Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Aligning LLMs to User Preference

8 minute read

Published:

Table of Contents:

  • Why LLM Alignment
  • Reinforcement learning from human feedback
  • Direct Preference Optimization
  • Kahneman-Tversky Optimization
  • Survey studies
  • Miscellaneous

portfolio

publications

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

works