Home
Contact
Contact
Contact
Contact
HRK
Hello World
I’m Hannah Rose Kirk.

keywords = {
Large Language Models
;
Online Safety
;
Bias Mitigation
;
Statistics
;
China AI
;
Large Language Models
;
Oxford Internet Institute
;
NYU
;
Oxford AI Society
;
Cambridge University
;
Peking University
;
Oxford Internet Institute
;
Sci-Fi Books
;
Sushi
;
Documentaries
;
Emoji 😺😸
;
Cycling
;
Sci-Fi Books
;
}
LEARN MORE

print MySummary

I currently research largeĀ language models @ the University of Oxford. In the short term, I'm a visiting academic @ New York University.

My current research centres on human-and-model-in-the-loop learning and data-centric alignment of AI. I am passionate about the societal impact of AI systems, focusing on the role that dataset generation, curation and labelling has on value alignment in large-scale foundation models.

My body of published work spans computational linguistics, economics, ethics and sociology, addressing a broad range of issues such as alignment, bias, fairness and hate speech from a multidisciplinary perspective. Alongside academia, I collaborate often with industry. Previously, I worked at The AlanĀ Turing Institute on online safety and assisted product development at a start-up.

Education

.class GetDegrees

2021 - 2024

Oxford Internet Institute, University of Oxford

DPhil in Social Data Science
Fully-funded scholarship
Supervised by Dr Scott A. Hale & Dr Bertie Vidgen
2020 - 2021

Oxford Internet Institute, University of Oxford

MSc in Social Data Science
Distinction, 77%
Awarded the Oxford Internet Institute Thesis Prize for best graduate dissertation
2018-2020

Yenching Academy, Peking University

MA in China Studies and Economics
GPA: 3.99, Rank: 2/99
2015 - 2018

Trinity College, University of Cambridge

BA in Economics
Double First Class Honours
Awarded the Roger Dennis Prize for best undergraduate dissertation

Positions

.class AddExperience

Sept 2023 - Present

New York University

Visiting Academic in Data Science
Collaborating on human-AI coordination and LLMĀ alignment with Professor He & Professor Bowman
February 2023 - Present

Google

External Student Researcher
Co-hosting an adversarial challenge to identify unsafe failure modes in text2image models
August 2023 - Present

OpenAI

Red-Teamer + Consultant
Improving the safety of OpenAI models (DALL-E & GPT-4)
Sept 2021 - Sept 2023

The Alan Turing Institute

Data Scientist in Online Safety
Monitoring and detecting harmful language
Sept 2021 - July 2023

Rewire Online

Research Scientist
Implementing NLP solutions for online safety
Oct 2020 - Present

Oxford Artificial Intelligence Society

Research Labs Manager
Leading student research projects on AIĀ bias
Sept 2019 - Sept 2020

The Berggruen Institute, China Center

Research Scholar
Linking Chinese philosophy to AI and privacy

Grants

.class Find$$$

2023-2024

Microsoft Accelerating Foundation Models Research Programme

Project title: ā€œPERDI: Personalised and Diverse feedback for humans-and-models-in-the-loop"
2022-2024

MetaAI Dynabench Grant

Project title: ā€œOptimizing feedback between humans-and-model-in-the-loop
2020-2024

Economic and Social Science Research Council

PhD scholarship, Digitial Social Science Pathway

Publications

return Output

03
/
07
/
23

SemEval-2023 Task 10: Explainable Detection of Online Sexism

SemEval @ ACL 2023
Hannah Rose Kirk, Wenjie Yin, Bertie Vidgen, Paul Rƶttger
11
/
16
/
22

Handling and Presenting Harmful Text in NLP Research

EMNLP 2022
Hannah Rose Kirk, Abeba Birhane, Bertie Vidgen, Leon Derczynski
09
/
23
/
22

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

AACL 2022
Hugo Berg, Siobhan Mackenzie Hall, Yash Bhalgat, Wonsuk Yang, Hannah Rose Kirk, Aleksandar Shtedritski, Max Bain
09
/
06
/
22

Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

NAACL 2022
Hannah Rose Kirk, Bertram Vidgen, Paul Rƶttger, Tristan Thrush & Scott A. Hale
08
/
02
/
22

Tracking abuse on Twitter against football players in the 2021-22 Premier League season

Policy Report
Bertie Vidgen, Yi-Ling Chung, Pica Johansson, Hannah Rose Kirk, Angus Williams, Scott A. Hale, Helen Margetts, Paul Rƶttger, Laila Sprejer
05
/
23
/
22

Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements

GeBNLP @ NAACL 2022
Conrad Borchers, Dalia Sara Gala, Benjamin Gilburt, Eduard Oravkin, Wilfried Bounsi, Yuki M. Asano, Hannah Rose Kirk
12
/
01
/
21

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

NeurIPS 2021
Hannah Rose Kirk, Yennie Jun, Haider Iqbal, Elias Benussi, Filippo Volpin, Frederic A. Dreyer, Aleksandar Shtedritski & Yuki M. Asano
08
/
01
/
21

Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset

WOAH @ ACL 2021
Hannah Rose Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, & Yuki M Asano
08
/
19
/
20

The Nuances of Confucianism in Technology Policy: an Inquiry into the Interaction Between Cultural and Political Systems in Chinese Digital Ethics

International Journal of Politics, Culture, and Society
Hannah Rose Kirk, Kangkyu Lee & Carlisle Micallef