Previously, I earned my M.Sc. in Advanced Computer Science from Oxford, supported by a
Google DeepMind Scholarship.
Before that, I completed my Bachelor at Karlsruhe Institute of Technology.
where I worked with Luca
Torresi and Pascal
Friederich on chemical property prediction.
I also spent an exciting summer at Carnegie Mellon University as
part of the Robotics Institute Summer Scholars program,
exploring efficient architectures for videos with Jonathon Luiten and Deva Ramanan.
Besides research, I enjoy theatre, board games, and baking 🧁
(I am always happy to trade baking recipes and on the hunt for somebody to play set with :D )
I'm interested in computer vision, natural language processing, and how we can combine both to build
more intelligent systems, that can reason and navigate about the world in mutliple modalities.
Can models reason better with intermediate visualizations, akin to human mental imagery?
MentisOculi evaluates this across frontier models and finds that visual thoughts do not yet improve
reasoning.