Andrew M. Bean
I am a DPhil candidate at Oxford researching the evaluation of advanced AI systems. My research mixes technical elements in the building and evaluation of AI agents with experimental methods in online user studies.
I have written papers about evaluations, reasoning, alignment and human data in venues such as NeurIPS (1x Best Paper, 2x Oral presentations), EMNLP, and GoodIT. I have been an invited speaker at Meta, the MIT Media Lab, MilaNLP and Ofcom. My research has been covered in the media by publications such as Financial Times, The Guardian, and the MIT Technology Review.
news
| Dec 16, 2024 | NeurIPS Best Paper for PRISM!Our project PRISM has won the Best Paper Award for the 2024 NeurIPS Datasets and Benchmarks track! PRISM was selected out of more than 1800 submissions, and had some of the best reviews ever given at NeurIPS (and LingOly was close behind!). |
|---|---|
| Dec 12, 2024 | NeurIPS Oral Presentation for LingOly!Our project LingOly was selected for an Oral presentation at NeurIPS 2024! Out of more than 1800 submissions, only the top 11 (0.6%) were chosen for oral presentation, and the reviews included a recommendation for an award! |
| Nov 4, 2024 | Press Coverage for Measuring what Matters!Measuring what Matters was covered in The Guardian, Gizmodo, and NBC News! This paper was a collaboration with 43 authors from top institutions around the world, and will be presented at NeurIPS 2025. |
selected publications
-
- Nature MedicineUnder Review