Ask or search anything...

History

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

Hot

SynthLabs.ai

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

08 Jan 2025

Alon Albalak

Kanishk Veerdhaval Gandhi

Anikait Singh

UC Berkeley Stanford University logo

Stanford University

The paper introduces Meta Chain-of-Thought (Meta-CoT), which extends traditional Chain-of-Thought by modeling the underlying reasoning process required for complex problem-solving.

View blog

#computer-science #artificial-intelligence #computation-and-language

Resources

6,612

PERSONA: A Reproducible Testbed for Pluralistic Alignment

24 Jul 2024

Nathan Lile

Stanford University SynthLabs.ai

This paper introduces PERSONA, a reproducible testbed for evaluating pluralistic and personalized language model alignment using a novel approach of procedurally generated synthetic personas and an "LM-as-a-judge" framework. The work validated that frontier LMs can effectively role-play diverse personas and provided empirical insights into effective personalization strategies, demonstrating the importance of persona summarization over Chain of Thought, and highlighting performance differences across models for complex alignment tasks.

View blog

#computer-science #conversational-ai #computation-and-language

Resources

344

Suppressing Pink Elephants with Direct Principle Feedback

13 Feb 2024

Nathan Lile

Brown University SynthLabs.ai

Researchers developed Direct Principle Feedback (DPF), a simplified AI feedback method, to enable large language models to reliably avoid specific "forbidden" topics when prompted. This approach, trained on a synthetically generated dataset, achieved avoidance rates comparable to GPT-4 while maintaining general model performance, addressing a common failure mode where LLMs paradoxically mention what they are told to avoid.

View blog

#computer-science #computation-and-language #reinforcement-learning

Resources

142

There are no more papers matching your filters at the moment.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Dark mode

Ask or search anything...

Events