I am a Staff Research Scientist in the Google DeepMind team in Paris.
Between 2019 and 2022, I led fundamental research on distributional RL, imitation learning, offline RL, and RL from human feedback. I am very proud to be a contributor of the Acme library.
In late 22/early 23, I built the RLHF layer of the first version of Bard (now "Gemini App").
Since late 23, I am leading post-training of Gemma models.
contact: robert (dot) dadashi (at) gmail (dot) com