Training Detr¶

This tutorial explains how to use LitDetr module to train DETR50 architecture from scratch, using COCO2017 detection dataset as input.

Goals

Learn the different ways to instantiate the LitDetr class
Train DETR50 architecture
Load trained weights and make inference with pre-trained weights

1. LitDetr argument levels¶

Aloception is developed under the Pytorch Lightning framework, and provides different modules that facilitate the use of datasets and training models.

See also

All information is availabled at:

There are multiple ways to instantiate the module, starting with the most common one: using the default parameters

[ ]:

from alonet.detr import LitDetr

litdetr = LitDetr()

Like all modules in Aloception based on Pytorch Lightning, LitDetr has a static method that concatenates its default parameters to other modules.

[ ]:

from argparse import ArgumentParser

parser = ArgumentParser()
parser = litdetr.add_argparse_args(parser)
parser.parse_args([])

However, if we want to change a specific parameter, it should be changed in class definition

[ ]:

from argparse import Namespace

def params2Namespace(litdetr):
    return Namespace(
        accumulate_grad_batches=litdetr.accumulate_grad_batches,
        gradient_clip_val=litdetr.gradient_clip_val,
        model_name=litdetr.model_name,
        weights=litdetr.weights
    )

litdetr = LitDetr(gradient_clip_val=0.6)
params2Namespace(litdetr)

These parameters could be easily modified in console if we provide them to the module

[ ]:

args = parser.parse_args([]) # Remove [] to run in script
litdetr = LitDetr(args)
params2Namespace(litdetr)

Also, we could use both examples to fix some parameters and use the rest as the values entered via the command line

[ ]:

from alonet.detr import LitDetr, DetrR50Finetune

my_model = DetrR50Finetune(num_classes = 2)
litdetr = LitDetr(args, model_name="finetune", model=my_model)
params2Namespace(litdetr)

Attention

All the parameters described explicitly will replace the ones in the args variable.

See also

Since LitDetr is a pytorch lig based module, all functionalities could be implemented by inheriting LitDetr as a parent class. See the information in Pytorch Lightning Module.

2. Train process¶

See also

The training process is based on the Pytorch Lightning Trainer Module. For more information, please consult their online documentation.

In order to make an example, let’s take the COCO detection 2017 dataset as a training base. The common pipeline is described below:

[ ]:

from argparse import ArgumentParser

import alonet
from alonet.detr import CocoDetection2Detr, LitDetr

import torch

device = torch.device("cuda:0") if torch.cuda.is_available() else torch.device("cpu")

# Parameters definition
# Build parser (concatenates arguments to modify the entire project)
parser = ArgumentParser(conflict_handler="resolve")
parser = CocoDetection2Detr.add_argparse_args(parser)
parser = LitDetr.add_argparse_args(parser)
parser = alonet.common.add_argparse_args(parser)  # Add common arguments in train process
args = parser.parse_args([])

# Dataset use to train
coco_loader = CocoDetection2Detr(args)
lit_detr = LitDetr(args)

# Train process
# args.save = True # Uncomment this line to store trained weights
lit_detr.run_train(
    data_loader=coco_loader,
    args=args,
    project="detr",
    expe_name="coco_detr",
)

Attention

This code has a high computational cost and demands several hours of training, given its initialization from scratch. It is recommended to skip to the next section to see the results of the trained network.

3. Make inferences¶

Once the training is finished, we can load the trained weights knowing the project and run id (~/.aloception/project_run_id/run_id path). For this, a function of the common module of aloception could be used:

from argparse import Namespace
from alonet.common import load_training

args = Namespace(project_run_id = "project_run_id", run_id = "run_id")
lit_detr = load_training(LitDetr, args = args)

Moreover, LitDetr allows download and load pre-trained weights for use. This is achieved by using the weights attribute:

[ ]:

lit_detr = LitDetr(weights = "detr-r50")

Finally, we have a pre-trained model ready to make some detections.

[ ]:

%matplotlib inline
from alonet.detr import CocoDetection2Detr, LitDetr

import torch

device = torch.device("cuda:0") if torch.cuda.is_available() else torch.device("cpu")

# Dataset use to train
coco_loader = CocoDetection2Detr()
lit_detr = LitDetr(weights = "detr-r50")
lit_detr.model = lit_detr.model.eval().to(device)

# Check a random result
frame = next(iter(coco_loader.val_dataloader()))
frame = frame[0].batch_list(frame).to(device)
pred_boxes = lit_detr.inference(lit_detr(frame))[0]  # Inference from forward result
gt_boxes = frame[0].boxes2d

frame.get_view(
    [
        gt_boxes.get_view(frame[0], title="Ground truth boxes"),
        pred_boxes.get_view(frame[0], title="Predicted boxes"),
    ], size = (1920,1080)
).render()

What is next?

Learn how to train a custom architecture in Finetuning DETR tutorial. Also, know about a complex model based on deformable attention module in Training Deformable tutorial.