feat: add object detector feature in modmesh #619

rockleona · 2025-10-29T15:44:30Z

I started to have a research on computer vision, as the first step, this PR introduce Ultralytics YOLO as the object detection tool, and import yolov11 model as the base model.

Usage is really easy, just load an image, enable the detection, then you will see the result on the Console Widget. You can check the screenshot as below:

cpp/modmesh/pilot/RVisionDockWidget.cpp

cpp/modmesh/pilot/RVisionDockWidget.hpp

cpp/modmesh/pilot/wrap_pilot.cpp

tigercosmos · 2025-10-29T15:58:10Z

modmesh/pilot/_vision.py

+            logger.setLevel(logging.DEBUG)
+
+    if 'model' not in globals():
+        model = YOLO('./modmesh/pilot/yolo11n.pt')  # Please check the model path


How do you get the model? Maybe have a runtime download logic?

Yes, it will check the path if the model is exist during runtime, nor it will download it directly to the specified path.

I just found there is a directory called thirdparty, maybe I should specify the path overthere instead?

thirdparty is for the 3rd libraries. In this case, I think you can put the model file at the same directory of pilot runtime. Btw, it seems that the download logic is not implemented yet, right?

ultralytics library had already done the download logic, no need to do it agin, perhaps they will find the model name is in their file server or not, then download it when trigger class YOLO initialization.

cpp/modmesh/pilot/RVisionDockWidget.cpp

tigercosmos · 2025-10-29T16:01:22Z

modmesh/pilot/_vision.py

@@ -0,0 +1,99 @@
+"""


Is it possible to write tests to validate the implementation?

How would you recommend to place the tests, put them in the tests/test_pilot.py perhaps?

I think you can put it at tests/test_vision.py?

yungyuc

Please make sure CI passes before requesting for review.

Correct copyright headers.
Remove unnecessary code like WrapRVisionDockWidget, which looks like a placeholder.
The pimpl class RVisionDockWidget::Impl is not necessary. Do not use pimpl.
Do not create a symbol named modmesh::BoundingBox.
Always add an end marker to classes and namespaces.

I see you are using pybind11 to call back into Python to use YOLO. Why don't you just write PySide6 to do it?

cpp/modmesh/pilot/RVisionDockWidget.hpp

cpp/modmesh/pilot/RVisionDockWidget.cpp

yungyuc · 2025-10-30T14:51:44Z

cpp/modmesh/pilot/RVisionDockWidget.cpp

+    const uchar *data = rgbImg.bits();
+    py::array_t<uint8_t> np_img({height, width, channels}, data);
+    py::object vision_mod = py::module_::import("modmesh.pilot._vision");
+    py::object yolo_func = vision_mod.attr("yolo_detect");


If you call YOLO from Python, why not use PySide?

rockleona · 2025-11-17T03:47:21Z

Please make sure CI passes before requesting for review.

Correct copyright headers.

Remove unnecessary code like WrapRVisionDockWidget, which looks like a placeholder.

The pimpl class RVisionDockWidget::Impl is not necessary. Do not use pimpl.

Do not create a symbol named modmesh::BoundingBox.

Always add an end marker to classes and namespaces.

I see you are using pybind11 to call back into Python to use YOLO. Why don't you just write PySide6 to do it?

I thought it was a must to write all the GUI component with qt, I will change it to PySide6 since these functions were executed only from Python

yungyuc · 2025-11-17T11:31:45Z

@rockleona The code base has changed a lot. Please rebase to refresh the CI status.

rockleona · 2025-11-24T14:08:10Z

I've made a lots of changes, please find the items below:

Change all GUI components with PySide6
Write unit test cases for _yolo_detector

The latest GUI will be look like this, I didn't change the layout but slightly different on the detail like model status and logging message in pycon widget

rockleona · 2025-11-24T14:09:10Z

Please make sure CI passes before requesting for review.

I cannot find a button to run the CI process, maybe it should be executed by a repo maintainer?

yungyuc · 2025-11-29T02:30:49Z

Please make sure CI passes before requesting for review.

I cannot find a button to run the CI process, maybe it should be executed by a repo maintainer?

You should use your own fork to test for CI.

The latest GUI will be look like this, I didn't change the layout but slightly different on the detail like model status and logging message in pycon widget

Can you move the image preview away from the widget window (lower left) to the central sub-window, like other windows for 2D and 3D plots?

yungyuc

There will be some discussions before we can merge any code about YOLO.

Include upsplash license text.
Clean up image source link.
Evaluate to use Qt instead of PIL.
Discuss why including ultralytics.

yungyuc · 2025-11-29T02:32:19Z

tests/data/jpg/COPYING

+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+    https://unsplash.com/license


Make a copy of the license text here in this file.

yungyuc · 2025-11-29T02:32:53Z

tests/data/jpg/README.rst

+Test Files
+==========
+
+- cat.jpg (original source: https://unsplash.com/photos/orange-and-white-cat-on-yellow-surface-sR0cTmQHPug?utm_source=unsplash&utm_medium=referral&utm_content=creditShareLink)


Clean up link.

yungyuc · 2025-11-29T02:34:53Z

tests/test_pilot_vision.py

+
+import numpy as np
+import requests
+from PIL import Image


If possible, I do not want to have PIL. modmesh is already using Qt which should include all image handling features that PIL provides. Please evaluate if you can simply use Qt/PySide for processing images.

yungyuc · 2025-11-29T02:37:51Z

tests/test_pilot_vision.py

+import numpy as np
+import requests
+from PIL import Image
+from ultralytics import YOLO


I feel including a huge thirdparty like ultralytics defeats the principle of "doing it ourselves" in modmesh. @rockleona please elaborate why you include ultralytics.

tigercosmos reviewed Oct 29, 2025

View reviewed changes

yungyuc assigned rockleona Oct 30, 2025

yungyuc added the pilot GUI and visualization label Oct 30, 2025

yungyuc marked this pull request as draft October 30, 2025 14:42

yungyuc requested changes Oct 30, 2025

View reviewed changes

rockleona force-pushed the feat/yolov11 branch from 3a678c2 to 79ad5a8 Compare November 18, 2025 13:43

rockleona added 2 commits November 24, 2025 21:55

feat: add object detector feature in modmesh

9a7c2e3

test: adding test files for vision

c069cd7

rockleona force-pushed the feat/yolov11 branch from 79ad5a8 to c069cd7 Compare November 24, 2025 13:55

docs: adding license file for test jpg file

069a67f

rockleona marked this pull request as ready for review November 24, 2025 14:01

rockleona requested review from tigercosmos and yungyuc November 29, 2025 01:50

yungyuc requested changes Nov 29, 2025

View reviewed changes

feat: add object detector feature in modmesh #619

Are you sure you want to change the base?

feat: add object detector feature in modmesh #619

Uh oh!

Conversation

rockleona commented Oct 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rockleona Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yungyuc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rockleona commented Nov 17, 2025

Uh oh!

yungyuc commented Nov 17, 2025

Uh oh!

rockleona commented Nov 24, 2025

Uh oh!

rockleona commented Nov 24, 2025

Uh oh!

yungyuc commented Nov 29, 2025

Uh oh!

yungyuc left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rockleona Oct 30, 2025 •

edited

Loading