Research Engineer focused on AI agents, SWE-bench evaluation, LLM verification, and scalable ML systems.
Short description of portfolio item number 1
Short description of portfolio item number 2