Inspiration
Nobody likes being tagged in slack for random test failures. What if, instead of waking up to a bug, you can wake up to a bug and an AI-reviewed fix PR ready to merge?
This was our inspiration for building Reflex. The sponsors of this hackathon fit this usecase very well and it sounded like a very fun project to build!
What it does
- Whenever sentry detects an error, it fires a webhook to my server
- My backend deploys a daytona sandbox that mimics the development env in <100ms using a snapshot
- We then use Gemini to recursively fix the bug while testing each change locally in the sandbox
- When the tests pass, a PR is created in the project repo
- This PR is automatically reviewed by code-rabbit
Ultimately - You can spot a bug, go out for a coffee, come back to a fix just sitting there to be merged
Challenges we ran into
- We initially wanted to use the gemini CLI along with the coderabbit CLI to review my code before I make a commit. This idea was proposed by Hendrik from code rabbit and sounded insanely cool.
- We had to drop this idea since coderabbit does not allow you to authenticate using a token as a cli arg
- Networking from a daytona sandbox was a roadblock, but Vedran helped me out!
Accomplishments that we're proud of
Being able to connect 3 products together end to end with a valid usecase in under 6 hours!
What we learned
- How daytona works, how fast it is and how incredibly convenient it is to test AI generated code.
- How the bug tracking scene has evolved with AI. Sentry is Awesome
- Coderabbit's integration is so easy that a 3 year old can figure it out. Very cool product
What's next for Reflex
- Reflex can grow up from here to write it's own unit tests and E2E test cases.
- Leveraging Daytona's computer use capabilities, we can agentically test full UI workflows
- Parallalizing and collaborating between multiple agents, to fix bugs in parallel!
Log in or sign up for Devpost to join the conversation.