"how to track objects with SORT tracker" notebook: colab.research.google.com/github/robof...
"how to track objects with SORT tracker" notebook: colab.research.google.com/github/robof...
combo object detectors from top model libraries with multi-object tracker of your choice
for now we support SORT and DeepSORT; more trackers coming soon
link: github.com/roboflow/tra...
combo object detectors from top model libraries with multi-object tracker of your choice
for now we support SORT and DeepSORT; more trackers coming soon
link: github.com/roboflow/tra...
each subset contains images and annotations.jsonl file where each line of the file is a valid JSON object; each JSON object has three keys: image, prefix, and suffix
each subset contains images and annotations.jsonl file where each line of the file is a valid JSON object; each JSON object has three keys: image, prefix, and suffix
to pick the right variant, you need to take into account the vision-language task you are solving, available hardware, amount of data, inference speed
to pick the right variant, you need to take into account the vision-language task you are solving, available hardware, amount of data, inference speed
all I learned in a single blog
- PaliGemma 2 architecture
- dataset annotation and structure
- picking the right checkpoint
- memory optimization
- hyperparameters tuning
link: blog.roboflow.com/fine-tune-pa...
all I learned in a single blog
- PaliGemma 2 architecture
- dataset annotation and structure
- picking the right checkpoint
- memory optimization
- hyperparameters tuning
link: blog.roboflow.com/fine-tune-pa...
we can see that PaliGemma2's object detection performance depends more on input resolution than model size. 3B 448 seems like a sweet spot.
we can see that PaliGemma2's object detection performance depends more on input resolution than model size. 3B 448 seems like a sweet spot.
compared to PG1, it performs much better; datasets with a large number of classes were hard to fine-tune with previous version
compared to PG1, it performs much better; datasets with a large number of classes were hard to fine-tune with previous version
- used google/paligemma2-3b-pt-448 checkpoint
- trained on A100 with 40GB VRAM
- 1h of training
- 0.62 mAP on the validation set
colab with complete fine-tuning code: colab.research.google.com/github/robof...
- used google/paligemma2-3b-pt-448 checkpoint
- trained on A100 with 40GB VRAM
- 1h of training
- 0.62 mAP on the validation set
colab with complete fine-tuning code: colab.research.google.com/github/robof...
it looks like OCR-related metrics ST-VQA, TallyQA, and TextCaps... benefit more from increased resolution than model size. that's why I went from 224 to 336.
it looks like OCR-related metrics ST-VQA, TallyQA, and TextCaps... benefit more from increased resolution than model size. that's why I went from 224 to 336.
- used google/paligemma2-3b-pt-336 checkpoint; I tried to make it happen with 224, but 336 performed a lot better
- trained on A100 with 40GB VRAM
- trained with LoRA
colab with complete fine-tuning code: colab.research.google.com/github/robof...
- used google/paligemma2-3b-pt-336 checkpoint; I tried to make it happen with 224, but 336 performed a lot better
- trained on A100 with 40GB VRAM
- trained with LoRA
colab with complete fine-tuning code: colab.research.google.com/github/robof...
- never allow github actions from first-time contributors.
- always require review for new contributors.
- never run important actions automatically via bots.
- protect release actions with unique cases and selected actors.
- never allow github actions from first-time contributors.
- always require review for new contributors.
- never run important actions automatically via bots.
- protect release actions with unique cases and selected actors.
malicious code was injected into the pypi deployment workflow (github action).
the source code itself wasn't infected. however, the resulting tar/wheel files were corrupted during the build process.
malicious code was injected into the pypi deployment workflow (github action).
the source code itself wasn't infected. however, the resulting tar/wheel files were corrupted during the build process.
a crypto miner was injected into versions 8.3.41 and 8.3.42.
link: github.com/ultralytics/...
a crypto miner was injected into versions 8.3.41 and 8.3.42.
link: github.com/ultralytics/...
link to my original line counting tutorial: www.youtube.com/watch?v=OS5q...
link to my original line counting tutorial: www.youtube.com/watch?v=OS5q...
expensive, slow, censors results, and refuses to read plates 20-30% of the time.
open-source models like florence2 are more reliable.
expensive, slow, censors results, and refuses to read plates 20-30% of the time.
open-source models like florence2 are more reliable.
- license plate detection and ocr
- object tracking with bytetrack.
- counting cars entering and leaving the lot.
- real-time alerts via telegram.
- license plate detection and ocr
- object tracking with bytetrack.
- counting cars entering and leaving the lot.
- real-time alerts via telegram.