drivers/gps: prioritize non-blocking reads over injection #25535

dagar · 2025-09-08T21:52:19Z

No description provided.

dakejahl

Definitely an improvmement. The handleInjectDataTopic function can still block for a while though. If it loops the max (8) times and injects 300 bytes each time at 115200 baud that results in ~200ms blocking. At 921600 it's only 26ms and is probably goes unnoticed.

jeremyzff · 2025-09-09T18:55:21Z

Not sure this fixes the situation as much as necessary- As jake pointed out, it would definitely make sure that on entry, IF GPS bytes are available it will service them first, but I think the parsing happens after this call, so we'd still get blocked if the inject takes a long time. I think the fundamental problem is that in the duration of time it takes to read in 8 rtcm messages, the 600 byte hardware buffer (v5x) could have overflowed several times, so next loop even with data available in the buffer, its not the continuation of the last time we read it because the receive buffer overflowed.

jeremyzff · 2025-09-09T18:56:13Z

I think the correct fix to this problem is to fully separate the read and write threads so that the read task will NEVER block, and the write task will write as much data as possible.

dakejahl · 2025-09-09T19:27:11Z

Maybe we can break off the RTCM injection into a SubscriptionCallback work item. Is the driver doing any other writes aside from RTCM injection during normal run time?

alexcekay · 2025-09-10T17:01:24Z

@dakejahl,
the driver is only writing during config or when being told to reset the device.

If we really do not want to be blocked by the write() anymore, we either have to change the write in SerialImpl to be truly non-blocking, so no fsync (and handle the EAGAIN and non complete write correctly), or spin up two different tasks, which is kind of heavy (regarding performance overhead and additional flash) for just having a non blocking write.

dakejahl · 2025-09-10T18:50:30Z

Another thread is probably the easiest way to accomplish the job but not strictly necessary. The other option is to interleave reads/writes and only write small chunks at a time. We also would need to buffer the RTCM uORB data so that we can chunk it out at our leisure.

jeremyzff · 2025-09-11T15:49:22Z

I'm not 100% sure how the EKF handles the GPS timing, but I believe there is a parameter that adjusts the expected delay of the GPS data vs the EKF time horizon. If we are varying the timing baesd on injection and blocking reads while we do, the GPS data may come in at variable timing which is probably subpar for the EKF.

jeremyzff · 2025-09-11T15:50:53Z

It seems silly that we have a full duplex serial port but can't actually use it that way, So I'd support something that properly parallelizes the process. Perhaps the GPS reading thread can operate as normal, and the writing can happen on a work queue (so it isn't a full extra thread?)

dakejahl · 2025-09-11T18:35:39Z

The EKF time horizon is ~100ms so GPS read jitter shouldn't have an impact as long as we stay well below that. A work queue is still a full thread but we might be able to get away with putting it on the hp_default, but I would probably be safe and just give it it's own thread. @dagar what do you think? Also, who wants to fix this? I'm pretty busy at the moment but am interested in seeing this solved quickly. @jeremyzff want to take a stab at it?

jeremyzff · 2025-09-11T23:42:50Z

the work queue is a full thread that is more or less already "paid for" right? so adding it to a work queue adds minimal overhead was my thought. But a separate thread might be ok too. I'm sensitive to the RAM usage because our drones are starting to get pretty full...

jeremyzff · 2025-09-11T23:45:10Z

regarding read jitter- a burst of 8 messages of full 180 bytes at 115200 baud GPS rate is 125ms, which means that is how much variation I might have in when I read the GPS data between when there is no injection or max injection... Am I thinking about this right? so if it already takes 100ms, it could sometimes take 100ms, and other times take 225ms?

dakejahl · 2025-09-12T00:36:45Z

A work queue is a thread that can be shared by multiple work items. But these work items run sequentially in the thread context. So all items in the queue are blocked by the slowest running item they share it with.

If we don't use another thread and instead interleave non-blocking read/writes there won't really be any jitter since the reads/write are non-blocking. We just need to make sure we're checking the return value of read/write to ensure we don't quietly drop data. The output data (RTCM) needs to be buffered so that if we try to write 200 bytes but instead only write 50, we still have those 150 waiting to be written.

jeremyzff · 2025-09-15T17:09:06Z

we've tested this, and its part of the overall solution for us. By itself, this didn't fix our problem because the injection still took same amount of time and the RX buffer would overflow. But with the reversion of the fsync->tcdrain patch also which made writes nonblocking again, this still makes sure that we process a lot of the GPS buffer if its got backed up while we were away.

drivers/gps: prioritize non-blocking reads over injection

caefe14

dakejahl approved these changes Sep 9, 2025

View reviewed changes

This was referenced Sep 9, 2025

serial: change fsync to tcdrain #23686

Merged

serial: nuttx: revert tcdrain back to fsync #25538

Merged

dakejahl requested a review from alexcekay September 9, 2025 19:27

alexcekay approved these changes Sep 10, 2025

View reviewed changes

jeremyzff approved these changes Sep 15, 2025

View reviewed changes

dakejahl merged commit 41d3403 into main Sep 15, 2025
68 of 71 checks passed

dakejahl deleted the pr-drivers_gps_read_immediately branch September 15, 2025 23:18

dakejahl added the Needs Backport [1.16] label Oct 1, 2025

dakejahl pushed a commit that referenced this pull request Oct 1, 2025

drivers/gps: prioritize non-blocking reads over injection (#25535)

6e8f4c6

dakejahl mentioned this pull request Oct 1, 2025

[BACKPORT 1.16] drivers/gps: prioritize non-blocking reads over injection #25697

Closed

dagar added a commit that referenced this pull request Nov 10, 2025

drivers/gps: prioritize non-blocking reads over injection (#25535)

216fd85

mrpollo pushed a commit that referenced this pull request Nov 24, 2025

drivers/gps: prioritize non-blocking reads over injection (#25535)

acb6f48

dakejahl mentioned this pull request Dec 5, 2025

gps: fix RTCM injection for Moving Base #26042

Closed

alexcekay mentioned this pull request Dec 10, 2025

[GPS]: Use polling to read/write to GPS modules in parallel #26075

Draft

kmk142789 approved these changes Dec 10, 2025

View reviewed changes

dakejahl mentioned this pull request Dec 12, 2025

gps: fix RTCM injection and enable MSM7 for PPK #26095

Open

drivers/gps: prioritize non-blocking reads over injection #25535

drivers/gps: prioritize non-blocking reads over injection #25535

Uh oh!

Conversation

dagar commented Sep 8, 2025

Uh oh!

dakejahl left a comment

Choose a reason for hiding this comment

Uh oh!

jeremyzff commented Sep 9, 2025

Uh oh!

jeremyzff commented Sep 9, 2025

Uh oh!

dakejahl commented Sep 9, 2025

Uh oh!

alexcekay commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dakejahl commented Sep 10, 2025

Uh oh!

jeremyzff commented Sep 11, 2025

Uh oh!

jeremyzff commented Sep 11, 2025

Uh oh!

dakejahl commented Sep 11, 2025

Uh oh!

jeremyzff commented Sep 11, 2025

Uh oh!

jeremyzff commented Sep 11, 2025

Uh oh!

dakejahl commented Sep 12, 2025

Uh oh!

jeremyzff commented Sep 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

alexcekay commented Sep 10, 2025 •

edited

Loading