Research Engineer focused on AI agents, SWE-bench evaluation, LLM verification, and scalable ML systems.
This is a page not in the menu. You can use markdown in this page.