Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

poll for TIOCINQ and TIOCOUTQ instead of POLLIN, bump repeats #51

Merged
merged 60 commits into from
Oct 28, 2024

Conversation

jackyzha0
Copy link
Member

@jackyzha0 jackyzha0 commented Oct 24, 2024

we noticed a few cases where if a program exited really quickly but dumped a lot of output (e.g. env in the replit shell) we could sometimes miss output

we used to poll the controller side fd for POLLIN but that actually isn't fully correct.

the tty pipes look something like:

controlling program <> controller fd <> user fd <> user program
|---user space------||--------kernel space------||-user space-|

we can sometimes enter cases where the controller side thinks it has no more data to give to the controlling program (nodejs when using @replit/ruspty) but the kernel has decided to block on passing some user fd data to the controller fd (on linux for example, the pipe in the user fd -> controller fd direction has about 4kb of capacity)

for example, if node isnt processing data events quickly enough, the controller-side queue can fill up and the user fd write will block until it has space again, we could rarely enter a race if nodejs decides to read an entire 4kb block completely emptying the controller fd queue and then the POLLIN logic could return with no more data left (even though the user side still has a pending write!)

this would drop data :(

a few changes:

  • rust-side
    • poll TIOCINQ and TIOCOUTQ on controller and user instead of just POLLIN on controller
  • wrapper level
    • trigger 'end' event on the read stream on EIO instead of just calling the cb (that way, other things with handles to the socket can also handle end appropriately)
    • exit condition is now
      • fd closed && fd fully read && program actually exited
  • test level
    • bump repeats from 50 -> 500
    • rewrite it to a more standard async format so errors during the test actually are associated with that test
    • added a test for fast output (>4kb output and fast exit)

@jackyzha0 jackyzha0 changed the title bump repeats, add debug proper canonical mode flags, bump repeats, add debug Oct 24, 2024
@jackyzha0 jackyzha0 changed the title bump repeats, add debug poll for TIOCINQ and TIOCOUTQ instead of POLLIN, bump repeats Oct 28, 2024
Copy link

@blast-hardcheese blast-hardcheese left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

v. nice

wrapper.ts Outdated Show resolved Hide resolved
src/lib.rs Outdated Show resolved Hide resolved
tests/index.test.ts Show resolved Hide resolved
@jackyzha0 jackyzha0 merged commit b81def4 into main Oct 28, 2024
4 checks passed
@jackyzha0 jackyzha0 deleted the jackyzha0/test-robustness branch October 28, 2024 20:51
@lhchavez
Copy link
Contributor

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants