Implementation of : HIC-YOLOv5: Improved YOLOv5 for Small Object Detection #12264

aash1999 · 2023-10-21T17:10:34Z

This repository contains the code for HIC-YOLOv5, an improved version of YOLOv5 tailored for small object detection. The improvements are based on the paper HIC-YOLOv5: Improved YOLOv5 For Small Object Detection.

HIC-YOLOv5 incorporates Channel Attention Block (CBAM) and Involution modules for enhanced object detection, making it suitable for both CPU and GPU training.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

📊 Key Changes

New hyperparameter file for small object detection, hyp.hic-yolov5s.yaml, tailored to the VisDrone Dataset.
Introduction of ChannelAttention and SpatialAttention modules in common.py to enhance feature representation.
Implementation of CBAM module, combining channel and spatial attention for richness in feature maps.
Addition of Involution module, a novel operation to address limitations of convolutions.
Creation of yolov5s-cbam-involution.yaml architecture with CBAM and Involution integrated into the YOLOv5s model.

🎯 Purpose & Impact

The PR aims to improve YOLOv5's ability to detect small objects, a common challenge in drone and surveillance applications.
Attention mechanisms (CBAM) and involution help in capturing better feature representations without significantly increasing computational cost.
Users can expect improved performance on datasets with small objects without major changes to their existing workflows.

🌟 Summary

"YOLOv5 enhancements with attention mechanisms and involution for boosting small object detection performance." 🛸🔍

use_deterministic_algorithms

Done necessary code changes for implementing CBAM and Involution to yolov5.

Added yolo5m-cbam-involution.yaml

for more information, see https://pre-commit.ci

github-actions

👋 Hello @aash1999, thank you for submitting a YOLOv5 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with ultralytics/yolov5 master branch. If your PR is behind you can update your code by clicking the 'Update branch' button or by running git pull and git merge master locally.
✅ Verify all YOLOv5 Continuous Integration (CI) checks are passing.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." — Bruce Lee

Ilyabasharov · 2023-10-24T12:47:50Z

@aash1999 Hello! Thanks for your work! Did you evaluate the results? What about metrics?

aash1999 · 2023-10-24T13:59:57Z

Hi @Ilyabasharov
I implemented as mentioned by the paper : https://arxiv.org/pdf/2309.16393.pdf
You can find the results as mentioned in them. And also I am training for the same to validate the results.

thank you

refactoring to meet inline comment rules Signed-off-by: Aakash Singh <[email protected]>

added few comments Signed-off-by: Aakash Singh <[email protected]>

glenn-jocher · 2023-10-24T14:23:56Z

Hi @aash1999,

Thank you for your interest in HIC-YOLOv5! As mentioned in the paper, the results and metrics can be found by referring to the research paper at https://arxiv.org/pdf/2309.16393.pdf. You can find the specific details and evaluations there. Additionally, I am currently training and validating the model to further validate the results.

If you have any further questions or need assistance, feel free to ask.

Thank you!

aash1999 · 2023-10-24T14:29:06Z

Hi @glenn-jocher

Thank you for your prompt response. I'm currently encountering some issues with the checks, as they are failing. Could you please provide some guidance on how I can rectify these issues and ensure the checks pass successfully?

Your assistance is greatly appreciated.

Thank you

adding hyp and model files as mentioned in paper

Signed-off-by: Aakash Singh <[email protected]>

removing trailing white space Signed-off-by: Aakash Singh <[email protected]>

Ilyabasharov · 2023-10-24T15:56:05Z

Hi @Ilyabasharov I implemented as mentioned by the paper : https://arxiv.org/pdf/2309.16393.pdf You can find the results as mentioned in them. And also I am training for the same to validate the results.

thank you

did you manage to reproduce the metrics from the article? Im also interested in this results, but it seems according to the article that tph-yolov5 paper, github gives better performance on VisDrone Dataset

aash1999 · 2023-10-24T16:16:29Z

@Ilyabasharov

I am currently training the model with mentioned hyper parameter, but I am not able to run on the batch size that was mentioned due to GPU Ram constraint.

glenn-jocher · 2023-10-24T18:04:40Z

@aash1999 hi,

Thank you for reaching out. I understand that you are experiencing GPU RAM constraints while trying to run the model with the mentioned batch size. GPU RAM limitations can indeed be a challenge.

To address this issue, you can try the following potential solutions:

Reduce the batch size: You can decrease the batch size until it fits within the available GPU RAM. However, please keep in mind that reducing the batch size may affect training performance.
Utilize gradient accumulation: Instead of updating the model weights after every batch, you can accumulate gradients over multiple batches before performing a weight update. This allows you to effectively simulate a larger batch size without exceeding the GPU RAM limit.
Utilize mixed precision training: By using mixed precision training, you can take advantage of GPU tensor cores and reduce the memory requirement. Tools like Nvidia's Automatic Mixed Precision (AMP) can help streamline this process.

Please note that these are general suggestions, and the optimal solution may vary depending on your specific use case and the resources available to you.

I hope this information helps! If you have any further questions or need additional assistance, please let me know.

Thank you!

aash1999 · 2023-10-25T15:33:07Z

Hi @glenn-jocher, @Ilyabasharov

I ran the model on an A100 for 300 epochs as mentioned in the paper with the same hyperparameters, but only with a batch size of 70. I obtained the following results:

HIC-YOLOv5 (test): 35.16 [email protected], 20.23 mAP@[0.5:0.95]
HIC-YOLOv5 (Val): 44.02 [email protected], 25.82 mAP@[0.5:0.95]

Meanwhile, YOLOv5 (test) achieved 27.57 [email protected] and 14.43 mAP@[0.5:0.95] on the VISDRONE dataset.

Thanks

aash1999 · 2023-10-25T17:40:18Z

@glenn-jocher @Ilyabasharov
are we good for merging ?

glenn-jocher · 2023-10-25T18:15:17Z

@aash1999 thank you for considering merging the changes. We appreciate your contribution to the YOLOv5 repository. Before merging, we need to ensure that the changes align with the project's guidelines and requirements.

Please provide more details about the changes you made and any relevant information, such as how the changes impact the overall functionality and performance of the model. Once we have a clearer understanding, we can proceed with the review process and determine if the changes are ready for merging.

Thank you again for your contribution. We look forward to reviewing your changes.

Typo correction Signed-off-by: Aakash Singh <[email protected]>

aash1999 · 2023-10-25T18:33:30Z

Hi @glenn-jocher,

I'd like to provide you with a comprehensive overview of the code changes I've made in this PR:

CBAM and Involution Modules: In this update, I introduced two crucial modules - CBAM and Involution. To incorporate these modules into the YOLOv5 model, I made the following modifications:
- In models/common.py, I added the necessary code to integrate the CBAM module into the backbone of the model. CBAM relies on two other modules: ChannelAttention and SpatialAttention, which are also implemented in common.py. I've documented these changes to provide clarity.
- To facilitate the integration of Involution, I included code in the prediction head of the model. This ensures that both CBAM and Involution are seamlessly woven into the architecture.
These changes have been made while maintaining the existing workflow's performance and functionality.
Modification in model/yolo.py: To facilitate the parsing of CBAM and Involution, I made a single change in model/yolo.py. This adjustment ensures a smooth flow and compatibility with the newly added modules.
Configuration Files: I added two configuration files - models/yolov5s-cbam-involution.yaml and data/hyps/cbam.hyp.yaml. These files are integral for implementing the architecture as described in the paper.
Update in utils/general.py: To address an issue related to the nn.AdaptiveAvgPool2d function, which lacks a backward implementation during GPU training, I made a small change in utils/general.py. Specifically, I included the code torch.use_deterministic_algorithms(False, warn only=True). If there's an alternative solution to tackle this problem, I'm open to making those adjustments.

These code changes have been meticulously designed to enhance the YOLOv5 model by incorporating CBAM and Involution modules, as outlined in the referenced paper.

For Performance refer to : #12264 (comment)

glenn-jocher · 2023-10-25T18:44:39Z

@aash1999 hi,

Thank you for providing a comprehensive overview of the code changes you made in this PR. I appreciate the effort you put into integrating the CBAM and Involution modules into the YOLOv5 model.

I have carefully reviewed your changes, and they seem well-documented and aligned with the goals of enhancing the model's performance. I also took a look at the performance metrics you shared in the linked comment, and the results look promising.

The modifications you made in models/common.py and model/yolo.py to integrate the CBAM and Involution modules, as well as the addition of the two configuration files, appear to be well thought out and essential for implementing the architecture described in the paper.

I see that you also addressed an issue related to backward implementation during GPU training in utils/general.py by including the code torch.use_deterministic_algorithms(False, warn only=True). If there are any alternative solutions to tackle this problem, it would be beneficial to explore them.

Overall, I think your changes align with our project's objectives and will enhance the performance of YOLOv5. However, before merging, I would appreciate it if you could address any open issues and ensure that all tests and checks pass successfully.

Thank you for your contribution. Keep up the great work!

Best,

Signed-off-by: Aakash Singh <[email protected]>

aash1999 · 2023-10-25T18:58:27Z

@glenn-jocher
Thank you for your efforts in reviewing. I am encountering an issue with the checks. Even though it passes all the tests, there is one in the pre-commit:

fix end of files.........................................................Passed
trim trailing whitespace.................................................Failed
- hook id: trailing-whitespace
- exit code: 1
- files were modified by this hook

Fixing utils/general.py

check for case conflicts.................................................Passed
check docstring is first.................................................Passed
fix double quoted strings................................................Passed
detect private key.......................................................Passed
Upgrade code.............................................................Passed
Sort imports.............................................................Passed
YAPF formatting..........................................................Failed
- hook id: yapf
- files were modified by this hook
MD formatting............................................................Passed
PEP8.....................................................................Passed
codespell................................................................Passed

it will be helpful if you guide me on how o fix it .

glenn-jocher · 2023-10-25T20:50:43Z

@aash1999 the issue you're encountering with the checks is related to the "trailing-whitespace" and "yapf" hooks. Based on the output you provided, it seems that the pre-commit hook is failing because there are trailing whitespaces and the yapf formatter has made modifications to files.

To fix the trailing whitespace issue, you can open the "utils/general.py" file and remove any extra spaces or tabs at the end of a line. It's important to ensure that there are no trailing whitespaces in the file.

For the yapf formatter issue, you can run the yapf formatter on the modified files to make sure they comply with the yapf formatting style. You can use the command yapf -i file.py to apply the yapf formatting.

After making these changes, you can rerun the checks and pre-commit hook to ensure that all tests pass successfully.

I hope this guidance helps you resolve the issue. Let me know if you need any further assistance or have any other questions.

Signed-off-by: Aakash Singh <[email protected]>

aash1999 · 2023-10-25T22:13:46Z

@glenn-jocher Hi

I did the changes as you mentioned and now its passing all the checks.
its ready for merging.

thanks

glenn-jocher · 2023-10-25T22:51:06Z

Hi @aash1999,

Thank you for making the necessary changes and addressing the issues with the pre-commit checks. I'm glad to hear that the modifications have passed all the checks and that your changes are now ready for merging.

Your contribution is greatly appreciated. I will review your changes again and proceed with the merging process if everything looks good. Once merged, the enhancements you made to the YOLOv5 model will be available for everyone to benefit from.

Thank you again for your hard work and dedication. Keep up the excellent work!

Best,

aash1999 · 2023-10-28T09:53:53Z

@glenn-jocher hi
Any updates on merging this branch to master ?
thanks

Signed-off-by: Aakash Singh <[email protected]>

aash1999 · 2023-11-06T16:33:11Z

@glenn-jocher
Hi any updates on merging it to master brach as this PR was opened 2 weeks before ?

glenn-jocher · 2023-11-06T17:47:53Z

@aash1999 hi,

Thank you for your patience. The merging process for pull requests can sometimes take longer due to various factors such as review time, code complexity, and team capacity. The YOLOv5 repository receives a high volume of pull requests, and the team is working diligently to review and merge them as efficiently as possible.

I understand your eagerness to have your pull request merged, and I assure you that we are actively reviewing it. We appreciate your contribution and thank you for your patience. Please rest assured that we will provide an update as soon as possible.

Thank you again for your understanding.

Kind regards,

deanmark · 2023-11-16T08:27:54Z

@aash1999 hi,

Can you please add the train and val commands, along with the trained weights? I would like to validate the results.
Thanks

ExtReMLapin · 2023-11-29T11:31:03Z

Tbh, what you call "SODH" is just P2 layer. And it does most of the trick on this paper

glenn-jocher · 2023-11-29T14:55:22Z

@ExtReMLapin thank you for sharing your insights! It's great to hear your perspective and the importance you attribute to the P2 layer. Your expertise adds valuable context to this discussion. Keep the great feedback coming!

mahilaMoghadami · 2023-12-30T13:23:37Z

hello,
how to convert this changes (HIC yolo) from yolov5s to yolov5l?
thank you

glenn-jocher · 2023-12-30T17:26:03Z

@mahilaMoghadami to convert the HIC YOLO changes from YOLOv5S to YOLOv5L, you can adjust the model architecture settings in the YOLOv5 configuration files (yolov5s.yaml and yolov5l.yaml).

In these files, you can modify the "backbone" and "head" sections to match the larger YOLOv5L architecture. Specifically, you would need to update the number of layers, channels, and other architecture-specific parameters to align with YOLOv5L specifications.

After making the necessary adjustments, you can use the YOLOv5L configuration files for training and inference. Be sure to update the command-line arguments for training and evaluation to use the YOLOv5L configuration and model weights.

If you need further assistance, you can refer to the YOLOv5 documentation at https://docs.ultralytics.com/yolov5/ or feel free to ask for specific guidance.

I hope this helps! Let me know if you have any more questions.

aash1999 · 2024-01-02T14:15:07Z

hello, how to convert this changes (HIC yolo) from yolov5s to yolov5l? thank you

you can change the architecture of 5L model by adding CBAM and Involution similar to the one mentioned in this branch. also take care of dimensions and test them. please reach out if you encounter any issues.

Regards

glenn-jocher · 2024-01-05T12:01:30Z

Hello @aash1999,

Exactly, to adapt the HIC YOLO improvements from the YOLOv5S to the YOLOv5L model, you'll need to integrate the Channel Attention Block (CBAM) and Involution modules into the YOLOv5L architecture. This involves:

Editing the YOLOv5L configuration file (typically named yolov5l.yaml) to include the CBAM and Involution layers at the appropriate positions within the network.
Ensuring that the dimensions of the layers match the expected input and output shapes, especially when scaling up from YOLOv5S to YOLOv5L.
Testing the modified architecture to verify that the network trains correctly and that the performance improvements are consistent with those observed in the smaller model.

If you run into any issues or have further questions, don't hesitate to ask. The community is here to help!

highquanglity · 2024-03-04T01:47:13Z

@aash1999 I have trained HIC model follow your configure but not get the same result. After 300 epochs, i only got 25 mAP.50. This is model summary:

glenn-jocher · 2024-05-19T19:24:09Z

Hello @highquanglity,

It looks like your mAP results are lower than expected. A few things to consider:

Double-check your dataset quality and annotations.
Ensure that the hyperparameters in hyp.hic-yolov5s.yaml are correctly set for your specific dataset.
Experiment with different learning rates or training for more epochs.

If the issue persists, could you share more details about your training dataset and the exact command you used for training? This might help in diagnosing the problem more effectively. 🛠️

Keep up the great work, and let's get those numbers up!

aash1999 and others added 8 commits October 21, 2023 22:18

imp of CBAM + Involution at common.py

e7e1cdb

import CBAm and Involution into yolo.py

16fd02c

handle GPU err on

7eff0ef

use_deterministic_algorithms

Merge pull request #1 from aash1999/cbam-imp

b7715ca

Done necessary code changes for implementing CBAM and Involution to yolov5.

added arch. backbone to /models/

55ea408

readme update

02469f2

Merge pull request #2 from aash1999/cbam-imp

a27e8d1

Added yolo5m-cbam-involution.yaml

[pre-commit.ci] auto fixes from pre-commit.com hooks

b1b1ab9

for more information, see https://pre-commit.ci

github-actions bot reviewed Oct 21, 2023

View reviewed changes

aash1999 added 2 commits October 24, 2023 19:50

Update general.py

2ee59f6

refactoring to meet inline comment rules Signed-off-by: Aakash Singh <[email protected]>

Update common.py

79112df

added few comments Signed-off-by: Aakash Singh <[email protected]>

aash1999 added 4 commits October 24, 2023 20:30

adding hyp and model files as mentioned in paper

3d46323

Merge pull request #4 from aash1999/cbam-imp

f0b2ffc

adding hyp and model files as mentioned in paper

Delete models/yolo5m-cbam-involution.yaml

1204c74

Signed-off-by: Aakash Singh <[email protected]>

Update general.py

947266a

removing trailing white space Signed-off-by: Aakash Singh <[email protected]>

Update yolov5s-cbam-involution.yaml

a56bf81

Typo correction Signed-off-by: Aakash Singh <[email protected]>

Update CITATION.cff

5208303

Signed-off-by: Aakash Singh <[email protected]>

aash1999 added 3 commits October 26, 2023 02:31

removed trailing spaces in general.py

ccf2664

yapf formatting

16ed93a

yapf formatting

11ddc58

aash1999 closed this Oct 25, 2023

aash1999 reopened this Oct 25, 2023

aash1999 added 3 commits October 26, 2023 03:06

Delete CITATION.cff

02bf256

Signed-off-by: Aakash Singh <[email protected]>

reverting the files to commit 4d687c8

1f85ade

yapf reformat

8738c27

aash1999 added 2 commits October 26, 2023 03:50

movig files to where they belong

0fd8fe3

typo correction

2fc73ca

Merge branch 'master' into master

ad78882

Signed-off-by: Aakash Singh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of : HIC-YOLOv5: Improved YOLOv5 for Small Object Detection #12264

Implementation of : HIC-YOLOv5: Improved YOLOv5 for Small Object Detection #12264

aash1999 commented Oct 21, 2023 •

edited by UltralyticsAssistant

github-actions bot left a comment

Ilyabasharov commented Oct 24, 2023

aash1999 commented Oct 24, 2023 •

edited

glenn-jocher commented Oct 24, 2023

aash1999 commented Oct 24, 2023

Ilyabasharov commented Oct 24, 2023

aash1999 commented Oct 24, 2023

glenn-jocher commented Oct 24, 2023

aash1999 commented Oct 25, 2023

aash1999 commented Oct 25, 2023

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023 •

edited

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023 •

edited

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 28, 2023

aash1999 commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

deanmark commented Nov 16, 2023

ExtReMLapin commented Nov 29, 2023

glenn-jocher commented Nov 29, 2023

mahilaMoghadami commented Dec 30, 2023

glenn-jocher commented Dec 30, 2023

aash1999 commented Jan 2, 2024

glenn-jocher commented Jan 5, 2024

highquanglity commented Mar 4, 2024

glenn-jocher commented May 19, 2024

Implementation of : HIC-YOLOv5: Improved YOLOv5 for Small Object Detection #12264

Are you sure you want to change the base?

Implementation of : HIC-YOLOv5: Improved YOLOv5 for Small Object Detection #12264

Conversation

aash1999 commented Oct 21, 2023 • edited by UltralyticsAssistant

🛠️ PR Summary

📊 Key Changes

🎯 Purpose & Impact

🌟 Summary

github-actions bot left a comment

Choose a reason for hiding this comment

Ilyabasharov commented Oct 24, 2023

aash1999 commented Oct 24, 2023 • edited

glenn-jocher commented Oct 24, 2023

aash1999 commented Oct 24, 2023

Ilyabasharov commented Oct 24, 2023

aash1999 commented Oct 24, 2023

glenn-jocher commented Oct 24, 2023

aash1999 commented Oct 25, 2023

aash1999 commented Oct 25, 2023

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023 • edited

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023 • edited

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 25, 2023

glenn-jocher commented Oct 25, 2023

aash1999 commented Oct 28, 2023

aash1999 commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

deanmark commented Nov 16, 2023

ExtReMLapin commented Nov 29, 2023

glenn-jocher commented Nov 29, 2023

mahilaMoghadami commented Dec 30, 2023

glenn-jocher commented Dec 30, 2023

aash1999 commented Jan 2, 2024

glenn-jocher commented Jan 5, 2024

highquanglity commented Mar 4, 2024

glenn-jocher commented May 19, 2024

aash1999 commented Oct 21, 2023 •

edited by UltralyticsAssistant

aash1999 commented Oct 24, 2023 •

edited

aash1999 commented Oct 25, 2023 •

edited

aash1999 commented Oct 25, 2023 •

edited