Direct Preference Optimization Beyond Chatbots — Blankdot