“Techno-Cautious-Optimist”
Professional Experience
Independent AI Research
Independent Research Project “TraDe” (September 2023 - Oct 2024)
Built an automated interpretability technique called Transformer Decompiler (TraDe) from the ground up.
Funded by a Research Grant from Open Philanthropy / Hasler Foundation
In Collaboration with the Pattern Recognition Research Group (University of Bern)
Employed as a Researcher (Jan-May 2024) at the University of Bern
Paper was presented at ANNPR 2024
Collaboration with Jeremy Scheurer of Appollo Research (November 2023 - July 2024)
Generated a dataset of interpretability testbeds.
Created an evaluation tool for LLMs that investigates their ability to code in a novel programming language.
Paper was presented at the ICML - Mechanistic interpretability workshop 2024
Bühler
Quarterly AI Consulting (February 2024 - Present)
Advised on the long-term strategy in and application of AI
Zetcom
AI-Intern (June - September 2022)
Set up an evaluation framework for deep learning APIs in NLP and computer vision.
Integrated the APIs in the existing code base.
Consulted on the future use of various AI-based technology.
Filmmaking
Self-employed at Lyght (2019-2023)
Produced videos for big and small companies
Advised on advertisement and communication strategy
Project manager at Video Factory (2017 - 2019)
Oversaw and implemented the production of Video Advertisements
Played a pivotal role in shaping the company's overall strategic direction.
Swiss Military
Mandatory Service (2018)
Completed the 18-week Swiss military school as a reconnaissance specialist.
Projects
AI safety fundamentals
Founded the organisation “Bern AI Safety” BAIS to facilitate six-week courses on the fundamentals of AI safety with a focus on the alignment problem.
GPT-2 Interpretability
Took part in an Appart Research Interpretability-Hackathon and won 2nd prize for examining GPT-2 to localize the circuits responsible for its ability to handle brackets. (Project Report)
Evolutionary training and hyperparameter search
Created a custom RNN from the ground up in Python and performed hyperparameter optimisation through evolutionary training (bachelor thesis).
Reddit-like student platform
Won first prize in the University of Bern software development competition with a typescript-based forum platform for students.
Adversarial Interpretability Experiments
Conducted a series of experiments which showed that transformers could learn to be more resistant to SAE-based interpretability methods. Included large hyperparameter sweep and adversarial training setups.
AutoBench
Implemented a framework to quickly and automatically produce and apply model evaluations that probe an LLMs values along an axis of the users choosing and immediately compile and visualize the results. (Blog Post)
Education
BSc in Computer Science at University of Bern September 2020 - August 2023
Minor in mathematics, philosophy and information systems.
Teaching assistant in “programming for scientists”
ARENA - Alignment Research Engineer Accelerator - September 2024
Skills & Hobbies
Programming Languages: Python, Java, Javascript/Typescript, Dart
Frameworks: TensorFlow / Keras, PyTorch, Flutter, Angular
Languages: English, German (both equally fluent)
Hobbies: Programming, Fitness, Landscape Photography, Paragliding
Interests: Philosophy, Politics, History of Technology, Data Visualisations
Why AI safety?
A few yers ago AI safety seemed like a topic only relevant in sci-fi movies. I still have a lot of sympathy for this position. Read more about it and why i now care so deeply about AI safety in this blog post i wrote in 2022: