alphaXiv

History

Papers Benchmarks

National Taiwan Normal University

620

08 Dec 2024

computer-science artificial-intelligence machine-learning

A Comprehensive Guide to Explainable AI: From Classical Models to LLMs

Qian NIU

boy yang

A comprehensive guide created by a large inter-institutional collaboration synthesizes the field of Explainable AI (XAI), from classical models to Large Language Models (LLMs). It details diverse XAI techniques and their practical implementation, providing clear definitions, evaluations, and future directions for transparent and trustworthy AI.

310

06 Jun 2025

computer-science contrastive-learning computer-vision-and-pattern-recognition

GazeNLQ @ Ego4D Natural Language Queries Challenge 2025

National Taiwan Normal University National Sun Yat-sen University

Researchers from National Taiwan Normal University and National Sun Yat-sen University developed GazeNLQ, a framework that integrates estimated gaze information as a third modality into egocentric video Natural Language Queries. The framework achieves competitive performance on the Ego4D NLQ dataset, demonstrating the utility of gaze for enhancing temporal localization in first-person videos.

12 Sep 2025

computer-science computers-and-society machine-learning

Quantum-Enhanced Forecasting for Deep Reinforcement Learning in Algorithmic Trading

National Taiwan Normal University Wells Fargo

The convergence of quantum-inspired neural networks and deep reinforcement learning offers a promising avenue for financial trading. We implemented a trading agent for USD/TWD by integrating Quantum Long Short-Term Memory (QLSTM) for short-term trend prediction with Quantum Asynchronous Advantage Actor-Critic (QA3C), a quantum-enhanced variant of the classical A3C. Trained on data from 2000-01-01 to 2025-04-30 (80\% training, 20\% testing), the long-only agent achieves 11.87\% return over around 5 years with 0.92\% max drawdown, outperforming several currency ETFs. We detail state design (QLSTM features and indicators), reward function for trend-following/risk control, and multi-core training. Results show hybrid models yield competitive FX trading performance. Implications include QLSTM's effectiveness for small-profit trades with tight risk and future enhancements. Key hyperparameters: QLSTM sequence length

=

4, QA3C workers

=

8. Limitations: classical quantum simulation and simplified strategy. \footnote{The views expressed in this article are those of the authors and do not represent the views of Wells Fargo. This article is for informational purposes only. Nothing contained in this article should be construed as investment advice. Wells Fargo makes no express or implied warranties and expressly disclaims all legal, tax, and accounting implications related to this article.

111

05 Jan 2025

bayesian-deep-learning computer-science artificial-intelligence

From Aleatoric to Epistemic: Exploring Uncertainty Quantification Techniques in Artificial Intelligence

Georgia Institute of Technology Indiana University

Kyoto University

Zhejiang University University of Edinburgh

Emory University The University of Texas at Dallas National Taiwan Normal University AppCubic

University of Wisconsin-Madison

Purdue University University of Liverpool

HKUST Simon Fraser University

Qian NIU

A comprehensive review systematically categorizes and analyzes Uncertainty Quantification (UQ) techniques in Artificial Intelligence, distinguishing between aleatoric and epistemic uncertainties and exploring their applications in high-risk domains like healthcare and autonomous driving to foster trustworthy AI.

03 Oct 2025

earth-and-planetary-astrophysics astrophysics-of-galaxies solar-and-stellar-astrophysics

JCMT detection of HCN emission from 3I/ATLAS at 2.1 AU

Academia Sinica

NASA Goddard Space Flight Center National Taiwan Normal University East Asian Observatory National Astronomical Research Institute of Thailand (NARIT)American University Catholic University of America Nicolaus Copernicus University in Torun

We report the detection of HCN (

J=3-2

) rotational emission from comet 3I/ATLAS at a heliocentric distance of 2.13 AU with the James Clerk Maxwell Telescope (JCMT). Observations were conducted from 07 August 2025 (UT) using the

^{\prime}\overline U^{\prime}\overline u

heterodyne receiver and ACSIS spectroscopic backend. The HCN line was detected at

>5\sigma

on 14 Sep 2025 (UT) and a production rate of

Q({\rm HCN})=(4.0\pm1.7)\times10^{25}\ {\rm s}^{-1}

was derived by non-LTE radiative transfer modelling. Preliminary estimates of the HCN/H

_2

O and CN/HCN abundance ratios suggest values similar to Solar System comets.

28 Aug 2025

computer-science sound

SincQDR-VAD: A Noise-Robust Voice Activity Detection Framework Leveraging Learnable Filters and Ranking-Aware Optimization

National Taiwan Normal University Realtek Semiconductor Corp.National Chi Nan University

Voice activity detection (VAD) is essential for speech-driven applications, but remains far from perfect in noisy and resource-limited environments. Existing methods often lack robustness to noise, and their frame-wise classification losses are only loosely coupled with the evaluation metric of VAD. To address these challenges, we propose SincQDR-VAD, a compact and robust framework that combines a Sinc-extractor front-end with a novel quadratic disparity ranking loss. The Sinc-extractor uses learnable bandpass filters to capture noise-resistant spectral features, while the ranking loss optimizes the pairwise score order between speech and non-speech frames to improve the area under the receiver operating characteristic curve (AUROC). A series of experiments conducted on representative benchmark datasets show that our framework considerably improves both AUROC and F2-Score, while using only 69% of the parameters compared to prior arts, confirming its efficiency and practical viability.

14 Sep 2025

astrophysics-of-galaxies solar-and-stellar-astrophysics physics

Dense Molecular Ring-like structure in gaseous CO depletion region G34.74-0.12

Academia Sinica

Chinese Academy of Sciences

Shanghai Jiao Tong University

the University of Tokyo

Nanjing University

Tsinghua University

Peking University Xiamen University National Taiwan Normal University Kavli Institute for Astronomy and Astrophysics, Peking University Purple Mountain Observatory INAF Guangxi University National Sun Yat-sen University Osservatorio Astrofisico di Arcetri Kavli Institute for Astronomy and Astrophysics Netherlands Institute for Radio Astronomy (ASTRON)Xinjiang Astronomical Observatory Xinjiang Astronomical Observatory, CAS Key Laboratory of Modern Astronomy and Astrophysics (Nanjing University), Ministry of Education Institute of Astronomy and Astrophysics Key Laboratory of Modern Astronomy and Astrophysics Center of Astronomy and Gravitation INAF Osservatorio Astrofisico di Arcetri Center for Astrophysics Harvard & Smithsonian

We report the discovery of a dense molecular ring-like structure in a dense (10

^5

^{-3}

), cold (pc-scale CO depletion at a factor of 5), and young (10

^4

year) star-forming region G34.74-0.12, revealed by C

^{18}

O (2-1), HNC (1-0), and N

_2

^+

(1-0) observations with the Atacama Large Millimeter/submillimeter Array (ALMA). The ring-like structure is redshifted with respect to the clump, spanning from

V_{\rm sys,lsr} + 0.9

V_{\rm sys,lsr} + 2.9

km s

^{-1}

, with a total mass of 109

M_{\odot}

. It is spatially coincident with 1.3 mm and 3.0 mm dust continuum emission from cores, and several protostellar outflows. However, no free-free emission or H\textsc{ii} region is detected in association with this structure. With a slow expansion speed indicated by the position-velocity diagram, this ring structure differs from rings previously identified in more evolved star-forming regions. Possible explanations for the ring-like structure include a relic wind-blown bubble produced by a deeply embedded young stellar object, a hollow cavity formed by cloud-cloud interactions, a gas ring resulting from a temperature gradient, or a line-of-sight superposition of multiple outflows or dense clouds. This discovery offers a rare observational glimpse into the earliest dynamical processes involved in massive star formation.

219

09 Dec 2024

ai-for-health computer-science artificial-intelligence

Large Language Model Benchmarks in Medical Tasks

Georgia Institute of Technology Indiana University

Kyoto University

Cornell University The University of Texas at Dallas National Taiwan Normal University AppCubic

University of Wisconsin-Madison

HKUST

Qian NIU

With the increasing application of large language models (LLMs) in the medical domain, evaluating these models' performance using benchmark datasets has become crucial. This paper presents a comprehensive survey of various benchmark datasets employed in medical LLM tasks. These datasets span multiple modalities including text, image, and multimodal benchmarks, focusing on different aspects of medical knowledge such as electronic health records (EHRs), doctor-patient dialogues, medical question-answering, and medical image captioning. The survey categorizes the datasets by modality, discussing their significance, data structure, and impact on the development of LLMs for clinical tasks such as diagnosis, report generation, and predictive decision support. Key benchmarks include MIMIC-III, MIMIC-IV, BioASQ, PubMedQA, and CheXpert, which have facilitated advancements in tasks like medical report generation, clinical summarization, and synthetic data generation. The paper summarizes the challenges and opportunities in leveraging these benchmarks for advancing multimodal medical intelligence, emphasizing the need for datasets with a greater degree of language diversity, structured omics data, and innovative approaches to synthesis. This work also provides a foundation for future research in the application of LLMs in medicine, contributing to the evolving field of medical artificial intelligence.

137

17 Sep 2024

computer-science computation-and-language human-ai-interaction

Surveying the MLLM Landscape: A Meta-Review of Current Surveys

Georgia Institute of Technology Indiana University

Kyoto University National Taiwan Normal University AppCubic

University of Wisconsin-Madison

Rutgers University

Purdue University

Qian NIU

Sen Zhang

The rise of Multimodal Large Language Models (MLLMs) has become a transformative force in the field of artificial intelligence, enabling machines to process and generate content across multiple modalities, such as text, images, audio, and video. These models represent a significant advancement over traditional unimodal systems, opening new frontiers in diverse applications ranging from autonomous agents to medical diagnostics. By integrating multiple modalities, MLLMs achieve a more holistic understanding of information, closely mimicking human perception. As the capabilities of MLLMs expand, the need for comprehensive and accurate performance evaluation has become increasingly critical. This survey aims to provide a systematic review of benchmark tests and evaluation methods for MLLMs, covering key topics such as foundational concepts, applications, evaluation methodologies, ethical concerns, security, efficiency, and domain-specific applications. Through the classification and analysis of existing literature, we summarize the main contributions and methodologies of various surveys, conduct a detailed comparative analysis, and examine their impact within the academic community. Additionally, we identify emerging trends and underexplored areas in MLLM research, proposing potential directions for future studies. This survey is intended to offer researchers and practitioners a comprehensive understanding of the current state of MLLM evaluation, thereby facilitating further progress in this rapidly evolving field.

461

25 Nov 2025

computer-science artificial-intelligence computation-and-language

Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges

Georgia Institute of Technology Indiana University

Kyoto University National Taiwan Normal University

Purdue University

Qian NIU

This comprehensive review explores the intersection of Large Language Models (LLMs) and cognitive science, examining similarities and differences between LLMs and human cognitive processes. We analyze methods for evaluating LLMs cognitive abilities and discuss their potential as cognitive models. The review covers applications of LLMs in various cognitive fields, highlighting insights gained for cognitive science research. We assess cognitive biases and limitations of LLMs, along with proposed methods for improving their performance. The integration of LLMs with cognitive architectures is examined, revealing promising avenues for enhancing artificial intelligence (AI) capabilities. Key challenges and future research directions are identified, emphasizing the need for continued refinement of LLMs to better align with human cognition. This review provides a balanced perspective on the current state and future potential of LLMs in advancing our understanding of both artificial and human intelligence.

17 Nov 2025

high-energy-astrophysical-phenomena general-relativity-and-quantum-cosmology high-energy-physics-theory

Tests of General Relativity with GWTC-3

University of Washington

CNRS

University of Toronto University of Mississippi University of Cincinnati

California Institute of Technology

University of Cambridge INFN Sezione di Napoli

Monash University National Central University National Astronomical Observatory of Japan Vanderbilt University

University of Notre Dame

Tel Aviv University

University College London Nikhef

Georgia Institute of Technology

University of Science and Technology of China

Tsinghua University

The Chinese University of Hong Kong University of Melbourne

The University of Texas at Austin University of Warsaw

Peking University

Texas A&M University

University of British Columbia

Northwestern University

NASA Goddard Space Flight Center Louisiana State University

University of Florida INFN Sezione di Pisa Rutherford Appleton Laboratory

University of Minnesota

University of Maryland

University of Tokyo Indian Institute of Science National Taiwan Normal University

The Pennsylvania State University Rochester Institute of Technology Gran Sasso Science Institute

Sorbonne Université University of Massachusetts Amherst

Australian National University University of Auckland Cardiff University University of Glasgow Leibniz Universität Hannover University of Portsmouth Universidade Federal do ABC High Energy Accelerator Research Organization (KEK)Indian Institute of Technology Madras University of Strathclyde Università di Genova University of Alabama in Huntsville Syracuse University University of Sannio RMIT University Instituto Nacional de Pesquisas Espaciais Università di Camerino Universitat de les Illes Balears Maastricht University University of Birmingham Università di Trieste National Cheng Kung University Aix Marseille University Kyushu University University of South Carolina Washington State University University of Oregon National Tsing-Hua University Kindai University The University of Western Australia Universidade de Aveiro Eötvös Loránd University Universitat Autònoma de Barcelona Sofia University Nicolaus Copernicus Astronomical Center Instituto de Fisica Teorica UAM/CSIC Shanghai Astronomical Observatory Nicolaus Copernicus University INFN, Laboratori Nazionali di Frascati University of Western Ontario Università di Napoli Federico II

University of California, Santa Cruz Embry-Riddle Aeronautical University University of Hawai’i University of Electro-Communications National Chung Hsing University Montana State University International Centre for Theoretical Sciences INFN Sezione di Perugia Istituto Nazionale di Alta Matematica The University of Sheffield Université de la Côte d’Azur Physikalisch-Technische Bundesanstalt Institut de Física d’Altes Energies (IFAE)INFN - Sezione di Padova University of the Balearic Islands Laboratoire Kastler Brossel Università di Firenze University of Toyama Istituto Nazionale di Ottica INFN-Sezione di Genova Universiteit Antwerpen The University of Mississippi University of Szeged Università di Perugia INFN-Sezione di Bologna Università di Cagliari VU Amsterdam Institute for Cosmic Ray Research, University of Tokyo INFN Sezione di Roma Tor Vergata Université de Paris, CNRS, Astroparticule et Cosmologie,California State University, Los Angeles Università di Siena LIGO Livingston Observatory National Center for High-Performance Computing NCBJ Laboratoire AstroParticule et Cosmologie - CNRS Università di Urbino Carlo Bo Università degli Studi di Sassari Università di Trento, INFN-TIFPA Wigner RCP, RMKI INFN Sezione di Cagliari RESCEU, University of Tokyo Univ Lyon, ENS de Lyon, CNRS, Université Claude Bernard Lyon 1 Universite de Nice, ARTEMIS, CNRS, Observatoire de la Cote d’Azur Istituto de Fısica Teórica, UAM/CSIC Albert-Einstein-Institut, Hanover APC, AstroParticule et Cosmologie, CNRS GSSI, INFN, Laboratori Nazionali del Gran Sasso National Institute of Technology, Akashi College LAPP, Universit´e Savoie Mont Blanc Università di Napoli Università degli Studi di Camerino The University of Sheffield, Department of Physics and Astronomy Universite de Paris * National and Kapodistrian University of Athens Friedrich-Schiller-Universität Jena Universit Grenoble Alpes Universit degli Studi di Genova Universit Libre de Bruxelles Universit di Trento Universit di Salerno Universit degli Studi di Padova Universit de Bordeaux Universit di Roma La Sapienza Universit Paris Cit Universit de Strasbourg Universit de Lyon Universit di Pisa INAF Osservatorio Astronomico di Padova Universit de Montpellier Universit di Roma Tor VergataUniversit Di Bologna INAF ` Osservatorio Astronomico di Trieste INFN Sezione di Firenze

Ish Gupta

The ever-increasing number of detections of gravitational waves (GWs) from compact binaries by the Advanced LIGO and Advanced Virgo detectors allows us to perform ever-more sensitive tests of general relativity (GR) in the dynamical and strong-field regime of gravity. We perform a suite of tests of GR using the compact binary signals observed during the second half of the third observing run of those detectors. We restrict our analysis to the 15 confident signals that have false alarm rates

\leq 10^{-3}\, {\rm yr}^{-1}

. In addition to signals consistent with binary black hole (BH) mergers, the new events include GW200115_042309, a signal consistent with a neutron star--BH merger. We find the residual power, after subtracting the best fit waveform from the data for each event, to be consistent with the detector noise. Additionally, we find all the post-Newtonian deformation coefficients to be consistent with the predictions from GR, with an improvement by a factor of ~2 in the -1PN parameter. We also find that the spin-induced quadrupole moments of the binary BH constituents are consistent with those of Kerr BHs in GR. We find no evidence for dispersion of GWs, non-GR modes of polarization, or post-merger echoes in the events that were analyzed. We update the bound on the mass of the graviton, at 90% credibility, to

m_g \leq 2.42 \times 10^{-23} \mathrm{eV}/c^2

. The final mass and final spin as inferred from the pre-merger and post-merger parts of the waveform are consistent with each other. The studies of the properties of the remnant BHs, including deviations of the quasi-normal mode frequencies and damping times, show consistency with the predictions of GR. In addition to considering signals individually, we also combine results from the catalog of GW signals to calculate more precise population constraints. We find no evidence in support of physics beyond GR.

160

10 Dec 2024

ai-for-health computer-science artificial-intelligence

From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice

Georgia Institute of Technology Indiana University

Kyoto University National Taiwan Normal University AppCubic

Qian NIU

Large Language Models (LLMs) have rapidly evolved from text-based systems to multimodal platforms, significantly impacting various sectors including healthcare. This comprehensive review explores the progression of LLMs to Multimodal Large Language Models (MLLMs) and their growing influence in medical practice. We examine the current landscape of MLLMs in healthcare, analyzing their applications across clinical decision support, medical imaging, patient engagement, and research. The review highlights the unique capabilities of MLLMs in integrating diverse data types, such as text, images, and audio, to provide more comprehensive insights into patient health. We also address the challenges facing MLLM implementation, including data limitations, technical hurdles, and ethical considerations. By identifying key research gaps, this paper aims to guide future investigations in areas such as dataset development, modality alignment methods, and the establishment of ethical guidelines. As MLLMs continue to shape the future of healthcare, understanding their potential and limitations is crucial for their responsible and effective integration into medical practice.

08 Jan 2025

adversarial-attacks computer-science artificial-intelligence

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition

Academia Sinica National Taiwan Normal University United Link Co., Ltd.

While pre-trained automatic speech recognition (ASR) systems demonstrate impressive performance on matched domains, their performance often degrades when confronted with channel mismatch stemming from unseen recording environments and conditions. To mitigate this issue, we propose a novel channel-aware data simulation method for robust ASR training. Our method harnesses the synergistic power of channel-extractive techniques and generative adversarial networks (GANs). We first train a channel encoder capable of extracting embeddings from arbitrary audio. On top of this, channel embeddings are extracted using a minimal amount of target-domain data and used to guide a GAN-based speech synthesizer. This synthesizer generates speech that faithfully preserves the phonetic content of the input while mimicking the channel characteristics of the target domain. We evaluate our method on the challenging Hakka Across Taiwan (HAT) and Taiwanese Across Taiwan (TAT) corpora, achieving relative character error rate (CER) reductions of 20.02% and 9.64%, respectively, compared to the baselines. These results highlight the efficacy of our channel-aware data simulation method for bridging the gap between source- and target-domain acoustics.

13 Jul 2025

earth-and-planetary-astrophysics physics

Multi-wavelength Constraints on Dust Dynamics and Size Evolution in Protoplanetary Disk Rings. I. Method

University of Victoria

Chinese Academy of Sciences National Taiwan Normal University Kavli Institute for Astronomy and Astrophysics, Peking University National Sun Yat-sen University Max Planck Institute for Astronomy Institute for Advanced Study, Tsinghua University

Observations with the Atacama Large Millimeter/submillimeter Array (ALMA) and the Jansky Very Large Array (JVLA) have revealed many dust rings in protoplanetary disks, often interpreted as dust traps at gas pressure bumps. Previous studies have typically modeled these rings by assuming a single dust species in drift-diffusion equilibrium, neglecting dust size evolution resulting from coagulation and fragmentation. In this work, we perform numerical simulations that incorporate both dust-gas dynamics (drift and diffusion) and dust size evolution. Our results show that the radial distributions of different dust species (up to the fragmentation limit) are nearly identical in the dust ring, as dust growth dominates over drift and diffusion (e.g., with a typical dust-to-gas ratio of

\epsilon \sim 10^{-2}

). Building on this finding, we develop a comprehensive, self-consistent analytical theory that describes the dust ring structure while explicitly accounting for size evolution effects. Our model provides a unified framework for interpreting multi-wavelength observations by linking the physical dust distribution to the observed ring properties, thus laying the foundation for future observational modeling.

07 Nov 2025

high-energy-astrophysical-phenomena physics

Probing Jet Compositions with Extreme Mass Ratio Binary Black Holes

Academia Sinica National Taiwan Normal University

This research proposes that Extreme Mass Ratio Binary (EMRB) black hole systems can serve as unique astrophysical laboratories for definitively probing the composition of relativistic jets. The work models multi-wavelength emissions from episodic jet-disk collisions, demonstrating that the ratio of gamma-ray-to-UV emission provides a distinct signature capable of differentiating between leptonic and baryonic jet compositions, with potential for detectable neutrino fluxes.

780

29 Mar 2025

computer-science artificial-intelligence machine-learning

Advanced Deep Learning Methods for Protein Structure Prediction and Design

Qian NIU

After AlphaFold won the Nobel Prize, protein prediction with deep learning once again became a hot topic. We comprehensively explore advanced deep learning methods applied to protein structure prediction and design. It begins by examining recent innovations in prediction architectures, with detailed discussions on improvements such as diffusion based frameworks and novel pairwise attention modules. The text analyses key components including structure generation, evaluation metrics, multiple sequence alignment processing, and network architecture, thereby illustrating the current state of the art in computational protein modelling. Subsequent chapters focus on practical applications, presenting case studies that range from individual protein predictions to complex biomolecular interactions. Strategies for enhancing prediction accuracy and integrating deep learning techniques with experimental validation are thoroughly explored. The later sections review the industry landscape of protein design, highlighting the transformative role of artificial intelligence in biotechnology and discussing emerging market trends and future challenges. Supplementary appendices provide essential resources such as databases and open source tools, making this volume a valuable reference for researchers and students.

12 Aug 2025

computer-science artificial-intelligence machine-learning

QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems

Academia Sinica National Taiwan Normal University United Link Co., Ltd.

Evaluating audio generation systems, including text-to-music (TTM), text-to-speech (TTS), and text-to-audio (TTA), remains challenging due to the subjective and multi-dimensional nature of human perception. Existing methods treat mean opinion score (MOS) prediction as a regression problem, but standard regression losses overlook the relativity of perceptual judgments. To address this limitation, we introduce QAMRO, a novel Quality-aware Adaptive Margin Ranking Optimization framework that seamlessly integrates regression objectives from different perspectives, aiming to highlight perceptual differences and prioritize accurate ratings. Our framework leverages pre-trained audio-text models such as CLAP and Audiobox-Aesthetics, and is trained exclusively on the official AudioMOS Challenge 2025 dataset. It demonstrates superior alignment with human evaluations across all dimensions, significantly outperforming robust baseline models.

389

08 May 2025

adversarial-attacks adversarial-robustness computer-science

Jailbreaking and Mitigation of Vulnerabilities in Large Language Models

Kyoto University The University of Texas at Dallas National Taiwan Normal University AppCubic

University of Wisconsin-Madison

Purdue University University of Liverpool

HKUST University of Hawai’i Purdue Technology

Qian NIU

Large Language Models (LLMs) have transformed artificial intelligence by advancing natural language understanding and generation, enabling applications across fields beyond healthcare, software engineering, and conversational systems. Despite these advancements in the past few years, LLMs have shown considerable vulnerabilities, particularly to prompt injection and jailbreaking attacks. This review analyzes the state of research on these vulnerabilities and presents available defense strategies. We roughly categorize attack approaches into prompt-based, model-based, multimodal, and multilingual, covering techniques such as adversarial prompting, backdoor injections, and cross-modality exploits. We also review various defense mechanisms, including prompt filtering, transformation, alignment techniques, multi-agent defenses, and self-regulation, evaluating their strengths and shortcomings. We also discuss key metrics and benchmarks used to assess LLM safety and robustness, noting challenges like the quantification of attack success in interactive contexts and biases in existing datasets. Identifying current research gaps, we suggest future directions for resilient alignment strategies, advanced defenses against evolving attacks, automation of jailbreak detection, and consideration of ethical and societal impacts. This review emphasizes the need for continued research and cooperation within the AI community to enhance LLM security and ensure their safe deployment.

21 Jun 2023

active-learning computer-science artificial-intelligence

Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning

National Taiwan Normal University

This research from National Taiwan Normal University introduces an Adaptive Learning Path Navigation (ALPN) system for e-learning, which employs Attentive Knowledge Tracing (AKT) to model student knowledge and Entropy-enhanced Proximal Policy Optimization (EPPO) for dynamic content recommendations. The system improved students' final learning outcomes by 8.2% compared to existing methods and achieved higher learning path diversity.

16 Dec 2024

adversarial-attacks adversarial-robustness computer-science

Deep Learning Model Security: Threats and Defenses

Qian NIU

Deep learning has transformed AI applications but faces critical security challenges, including adversarial attacks, data poisoning, model theft, and privacy leakage. This survey examines these vulnerabilities, detailing their mechanisms and impact on model integrity and confidentiality. Practical implementations, including adversarial examples, label flipping, and backdoor attacks, are explored alongside defenses such as adversarial training, differential privacy, and federated learning, highlighting their strengths and limitations. Advanced methods like contrastive and self-supervised learning are presented for enhancing robustness. The survey concludes with future directions, emphasizing automated defenses, zero-trust architectures, and the security challenges of large AI models. A balanced approach to performance and security is essential for developing reliable deep learning systems.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

A Comprehensive Guide to Explainable AI: From Classical Models to LLMs

GazeNLQ @ Ego4D Natural Language Queries Challenge 2025

Quantum-Enhanced Forecasting for Deep Reinforcement Learning in Algorithmic Trading

From Aleatoric to Epistemic: Exploring Uncertainty Quantification Techniques in Artificial Intelligence

JCMT detection of HCN emission from 3I/ATLAS at 2.1 AU

SincQDR-VAD: A Noise-Robust Voice Activity Detection Framework Leveraging Learnable Filters and Ranking-Aware Optimization

Dense Molecular Ring-like structure in gaseous CO depletion region G34.74-0.12

Large Language Model Benchmarks in Medical Tasks

Surveying the MLLM Landscape: A Meta-Review of Current Surveys

Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges

Tests of General Relativity with GWTC-3

From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition

Multi-wavelength Constraints on Dust Dynamics and Size Evolution in Protoplanetary Disk Rings. I. Method

Probing Jet Compositions with Extreme Mass Ratio Binary Black Holes

Advanced Deep Learning Methods for Protein Structure Prediction and Design

QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems

Jailbreaking and Mitigation of Vulnerabilities in Large Language Models

Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning

Deep Learning Model Security: Threats and Defenses

Events

AI for Law

Personalize Your Feed