alphaXiv

History

Papers Benchmarks

Tehran Polytechnic

4,922

26 May 2023

computer-science artificial-intelligence computation-and-language

Self-Instruct: Aligning Language Models with Self-Generated Instructions

University of Washington

Allen Institute for AI

Johns Hopkins University

Arizona State University Tehran Polytechnic

Researchers from the University of Washington and the Allen Institute for AI developed SELF-INSTRUCT, a framework that enables language models to generate their own instruction-following training data. This approach allows a GPT-3 model to achieve a 33.1% absolute improvement in ROUGE-L on the SUPER-NATURALINSTRUCTIONS benchmark and perform comparably to InstructGPT-001, demonstrating that self-generated data can effectively align language models with instructions with minimal human annotation.

4,495

566

24 Oct 2022

computer-science artificial-intelligence computation-and-language

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

UC Berkeley

Microsoft

Columbia University

Allen Institute for AI National Univ. of Singapore TCS Research Tehran Polytechnic Univ. of Washington Stanford Univ.Factored AI Arizona State Univ.Univ of Amsterdam Sharif Univ. of Tech.PSG College of Tech.Indian Institute of Tech.Government Polytechnic College Zycus Infotech National Institute of Tech. Karnataka Univ. of Massachusetts, Amherst

Mirali Purohit

Researchers at the Allen Institute for AI and the University of Washington created SUPER-NATURALINSTRUCTIONS, a public meta-dataset of 1,616 diverse NLP tasks, to advance instruction-following capabilities. Their Tk-INSTRUCT model, trained on this benchmark, outperformed the 175B-parameter InstructGPT by 9.9 ROUGE-L points on unseen English tasks and by 13.3 ROUGE-L points on unseen non-English tasks.

08 Dec 2024

adversarial-attacks adversarial-robustness computer-science

Revisiting DeepFool: generalization and improvement

Imperial College London Optum Labs Tehran Polytechnic

Deep neural networks have been known to be vulnerable to adversarial examples, which are inputs that are modified slightly to fool the network into making incorrect predictions. This has led to a significant amount of research on evaluating the robustness of these networks against such perturbations. One particularly important robustness metric is the robustness to minimal

\ell_2

adversarial perturbations. However, existing methods for evaluating this robustness metric are either computationally expensive or not very accurate. In this paper, we introduce a new family of adversarial attacks that strike a balance between effectiveness and computational efficiency. Our proposed attacks are generalizations of the well-known DeepFool (DF) attack, while they remain simple to understand and implement. We demonstrate that our attacks outperform existing methods in terms of both effectiveness and computational efficiency. Our proposed attacks are also suitable for evaluating the robustness of large models and can be used to perform adversarial training (AT) to achieve state-of-the-art robustness to minimal

\ell_2

adversarial perturbations.

23 Feb 2022

computer-science artificial-intelligence computation-and-language

UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

University of Washington

Allen Institute for AI Tehran Polytechnic

We present UnifiedQA-v2, a QA model built with the same process as UnifiedQA, except that it utilizes more supervision -- roughly 3x the number of datasets used for UnifiedQA. This generally leads to better in-domain and cross-domain results.

There are no more papers matching your filters at the moment.