Skip to content

Resources

In this page, we share some seminal, influential and innovative works that have made waves in the RAI space, as well as a short tldr on their impact and/or potential applications.

Surveys

Benchmarks

Methodologies

Testing and red-teaming

Guardrails

Alignment

Interpretability

Repositories

  • Awesome-LM-SSP - Reading list for safety, security, and privacy in large models; maintained by researchers from Tsinghua University, HKSU, Xian Jiaotong University
  • Awesome-LLM-Judges - Research on using LLM judges for automated evaluation; maintained by Haize Labs