SynthLabs.ai
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
08 Jan 2025

The paper introduces Meta Chain-of-Thought (Meta-CoT), which extends traditional Chain-of-Thought by modeling the underlying reasoning process required for complex problem-solving.

View blog
Resources
PERSONA: A Reproducible Testbed for Pluralistic Alignment
24 Jul 2024

This paper introduces PERSONA, a reproducible testbed for evaluating pluralistic and personalized language model alignment using a novel approach of procedurally generated synthetic personas and an "LM-as-a-judge" framework. The work validated that frontier LMs can effectively role-play diverse personas and provided empirical insights into effective personalization strategies, demonstrating the importance of persona summarization over Chain of Thought, and highlighting performance differences across models for complex alignment tasks.

View blog
Resources
Suppressing Pink Elephants with Direct Principle Feedback
13 Feb 2024

Researchers developed Direct Principle Feedback (DPF), a simplified AI feedback method, to enable large language models to reliably avoid specific "forbidden" topics when prompted. This approach, trained on a synthetically generated dataset, achieved avoidance rates comparable to GPT-4 while maintaining general model performance, addressing a common failure mode where LLMs paradoxically mention what they are told to avoid.

View blog
Resources
There are no more papers matching your filters at the moment.