Fix: read error leads to message skip for blocking API #292

kusstas · 2024-12-19T23:13:44Z

ISSUE: Read error leads to message skip. It happens due consuming STX byte of message, even if message parsing isn't completed yet.

GOAL: I have custom readers that could return WouldBlock IO error at any time and code shall interrupt message parsing and return to this later.

pv42 · 2024-12-21T21:54:05Z

This seems to be an issue worth fixing. But I think this does not seem to compleatly fix the issue at least as I understand it.
I wrote the following test that uses a reader that alternates beween returning would block and reading a single byte:

    struct BlockyReader {
        block_next_read: bool,
        index: usize,
    }
    impl Read for BlockyReader {
        fn read(&mut self, buf: &mut [u8]) -> Result<usize> {
            if self.block_next_read {
                self.block_next_read = false;
                Result::Err(Error::new(ErrorKind::WouldBlock, "Test Block"))
            } else {
                let read = HEARTBEAT_V2.get(self.index).ok_or(Error::new(
                    ErrorKind::UnexpectedEof,
                    "EOF",
                ));
                buf[0] = *read?;
                self.index += 1;
                self.block_next_read = true;
                Ok(1)
            }
        }
    }

    #[test]
    fn test_read_error() {
        let mut reader = PeekReader::new(BlockyReader {
            block_next_read: true,
            index: 0,
        });
        loop {
            match read_v2_msg::<mavlink::common::MavMessage, _>(&mut reader) {
                Ok((header, _)) => {
                    assert_eq!(header, crate::test_shared::COMMON_MSG_HEADER);
                    break;
                },
                Err(MessageReadError::Io(err)) if err.kind() == ErrorKind::WouldBlock => (),
                Err(err) => panic!("{err}"),
            }
        }
    }

which fails. The problem seems to be that PeekReader::fetch calls io::Read::read_exact() which then calls read() multiple times discarding all previously read data in case of an error.
This PR still fixes the issue when the block occures directly after the STX byte, as in this reader:

impl Read for BlockyReader {
        fn read(&mut self, buf: &mut [u8]) -> Result<usize> {
            if self.block_next_read {
                self.block_next_read = false;
                Result::Err(Error::new(ErrorKind::WouldBlock, "Test Block"))
            } else {
                let read = HEARTBEAT_V2.get(self.index).ok_or(Error::new(
                    ErrorKind::UnexpectedEof,
                    "EOF",
                ));
                buf[0] = *read?;
                self.index += 1;
                if self.index <= 1 {
                    self.block_next_read = true;
                }
                Ok(1)
            }
        }
    }

but if at any other position the message still gets trown out.
Regardless if this is the intended fix or not there should probably be a test case for it.

kusstas · 2024-12-22T15:19:05Z

Hi, thank you for response. Yeah, I've not tested case where reader can read less data than expect. I've modified fetch function to use read instead of read_exact to store all data that read to buffer and added test cases for each version of mavlink read.

pv42 · 2024-12-22T17:26:45Z

f801bea seems to have broken the process_log_files test, it does not terminate anymore (at least for me).
This seems to be happening because std::io::default_read_exact() that performs the work for read_exact() has a special abort condition where it returns an error when 0 bytes where read:

while !buf.is_empty() {
    match this.read(buf) {
        Ok(0) => break,
        ...
    }
}
if !buf.is_empty() { Err(Error::READ_EXACT_EOF) } else { Ok(()) }

kusstas · 2024-12-22T17:45:59Z

Yeah, I've added case for zero check on read, and now seems tests with features listed in workflow work well.

Fix read error leads to skip message for blocking API

f96dacd

kusstas changed the title ~~Fix read error leads to message skip for blocking API~~ Fix: read error leads to message skip for blocking API Dec 19, 2024

feat: implement non-blocking read handling in PeekReader and add tests

f801bea

kusstas force-pushed the master branch from 93a7cb7 to 92cf19b Compare December 22, 2024 17:47

fix: handle unexpected EOF in PeekReader in case zero bytes has read

771564f

kusstas force-pushed the master branch from 92cf19b to 771564f Compare December 22, 2024 17:48

kusstas added 2 commits December 23, 2024 21:42

fix: change reader_ref method to return an immutable reference

b18d43c

feat: add read method to Read trait for embedded traits

e481084

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: read error leads to message skip for blocking API #292

Fix: read error leads to message skip for blocking API #292

kusstas commented Dec 19, 2024

pv42 commented Dec 21, 2024

kusstas commented Dec 22, 2024 •

edited

Loading

pv42 commented Dec 22, 2024

kusstas commented Dec 22, 2024

Fix: read error leads to message skip for blocking API #292

Are you sure you want to change the base?

Fix: read error leads to message skip for blocking API #292

Conversation

kusstas commented Dec 19, 2024

pv42 commented Dec 21, 2024

kusstas commented Dec 22, 2024 • edited Loading

pv42 commented Dec 22, 2024

kusstas commented Dec 22, 2024

kusstas commented Dec 22, 2024 •

edited

Loading