alphaXiv

History

Papers Benchmarks

Kent State University

3,002

23 May 2025

agent-based-systems agentic-frameworks agents

A survey of agent interoperability protocols: Model Context Protocol (MCP), Agent Communication Protocol (ACP), Agent-to-Agent Protocol (A2A), and Agent Network Protocol (ANP)

Northeastern University Kent State University Cleveland State University Youngstown State University

Aditi Singh

This survey characterizes and compares four nascent agent interoperability protocols (MCP, ACP, A2A, ANP) developed by major tech companies and open-source initiatives. It details their architectures, communication patterns, and security mechanisms, culminating in a proposed phased adoption roadmap for building secure and scalable LLM-powered agent ecosystems.

177

04 Apr 2025

computer-science distributed-parallel-and-cluster-computing

AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

George Washington University Kent State University

The AMP4EC framework introduces an adaptive solution for efficient deep learning inference in dynamic mobile edge computing environments by adaptively partitioning models and scheduling tasks based on real-time resource availability. It achieved a 78.35% reduction in inference latency and a 414.73% increase in throughput over monolithic deployments.

16 Sep 2025

chain-of-thought computer-science conversational-ai

Evaluating LLM Alignment on Personality Inference from Real-World Interview Data

Kent State University

Large Language Models (LLMs) are increasingly deployed in roles requiring nuanced psychological understanding, such as emotional support agents, counselors, and decision-making assistants. However, their ability to interpret human personality traits, a critical aspect of such applications, remains unexplored, particularly in ecologically valid conversational settings. While prior work has simulated LLM "personas" using discrete Big Five labels on social media data, the alignment of LLMs with continuous, ground-truth personality assessments derived from natural interactions is largely unexamined. To address this gap, we introduce a novel benchmark comprising semi-structured interview transcripts paired with validated continuous Big Five trait scores. Using this dataset, we systematically evaluate LLM performance across three paradigms: (1) zero-shot and chain-of-thought prompting with GPT-4.1 Mini, (2) LoRA-based fine-tuning applied to both RoBERTa and Meta-LLaMA architectures, and (3) regression using static embeddings from pretrained BERT and OpenAI's text-embedding-3-small. Our results reveal that all Pearson correlations between model predictions and ground-truth personality traits remain below 0.26, highlighting the limited alignment of current LLMs with validated psychological constructs. Chain-of-thought prompting offers minimal gains over zero-shot, suggesting that personality inference relies more on latent semantic representation than explicit reasoning. These findings underscore the challenges of aligning LLMs with complex human attributes and motivate future work on trait-specific prompting, context-aware modeling, and alignment-oriented fine-tuning.

118

06 Dec 2024

computer-science computer-vision-security artificial-intelligence

From classical techniques to convolution-based models: A review of object detection algorithms

Rutgers University Kent State University West Chester University

Object detection is a fundamental task in computer vision and image understanding, with the goal of identifying and localizing objects of interest within an image while assigning them corresponding class labels. Traditional methods, which relied on handcrafted features and shallow models, struggled with complex visual data and showed limited performance. These methods combined low-level features with contextual information and lacked the ability to capture high-level semantics. Deep learning, especially Convolutional Neural Networks (CNNs), addressed these limitations by automatically learning rich, hierarchical features directly from data. These features include both semantic and high-level representations essential for accurate object detection. This paper reviews object detection frameworks, starting with classical computer vision methods. We categorize object detection approaches into two groups: (1) classical computer vision techniques and (2) CNN-based detectors. We compare major CNN models, discussing their strengths and limitations. In conclusion, this review highlights the significant advancements in object detection through deep learning and identifies key areas for further research to improve performance.

138

05 Jun 2025

chain-of-thought computer-science artificial-intelligence

Can Large Language Models Understand Intermediate Representations in Compilers?

Pacific Northwest National Laboratory

Huazhong University of Science and Technology Chongqing University Kent State University

An empirical study evaluates Large Language Models' (LLMs) comprehension of compiler Intermediate Representations (IRs), identifying limitations in their ability to accurately reconstruct control flow graphs, perform precise instruction-level decompilation, and simulate program execution. The research demonstrates that current LLMs often rely on high-level heuristics from source code pre-training, struggling with the detailed, low-level semantics of IRs.

16 Oct 2025

ai-for-health computer-science computation-and-language

AI-Powered Early Diagnosis of Mental Health Disorders from Real-World Clinical Conversations

Kent State University

Mental health disorders remain among the leading cause of disability worldwide, yet conditions such as depression, anxiety, and Post-Traumatic Stress Disorder (PTSD) are frequently underdiagnosed or misdiagnosed due to subjective assessments, limited clinical resources, and stigma and low awareness. In primary care settings, studies show that providers misidentify depression or anxiety in over 60% of cases, highlighting the urgent need for scalable, accessible, and context-aware diagnostic tools that can support early detection and intervention. In this study, we evaluate the effectiveness of machine learning models for mental health screening using a unique dataset of 553 real-world, semistructured interviews, each paried with ground-truth diagnoses for major depressive episodes (MDE), anxiety disorders, and PTSD. We benchmark multiple model classes, including zero-shot prompting with GPT-4.1 Mini and MetaLLaMA, as well as fine-tuned RoBERTa models using LowRank Adaptation (LoRA). Our models achieve over 80% accuracy across diagnostic categories, with especially strongperformance on PTSD (up to 89% accuracy and 98% recall). We also find that using shorter context, focused context segments improves recall, suggesting that focused narrative cues enhance detection sensitivity. LoRA fine-tuning proves both efficient and effective, with lower-rank configurations (e.g., rank 8 and 16) maintaining competitive performance across evaluation metrics. Our results demonstrate that LLM-based models can offer substantial improvements over traditional self-report screening tools, providing a path toward low-barrier, AI-powerd early diagnosis. This work lays the groundwork for integrating machine learning into real-world clinical workflows, particularly in low-resource or high-stigma environments where access to timely mental health care is most limited.

19 Sep 2025

mathematics

Sensitivity of Perron and Fiedler eigenpairs to structural perturbations of a network

Kent State University “Sapienza" Università di Roma

One can estimate the change of the Perron and Fiedler values for a connected network when the weight of an edge is perturbed by analyzing relevant entries of the Perron and Fiedler vectors. This is helpful for identifying edges whose weight perturbation causes the largest change in the Perron and Fiedler values. It also is important to investigate the sensitivity of the Perron and Fiedler vectors to perturbations. Applications of the perturbation analysis include the identification of edges that are critical for the structural robustness of the network.

187

03 Dec 2024

computer-science artificial-intelligence human-ai-interaction

Analyzing the Impact of AI Tools on Student Study Habits and Academic Performance

Kent State University

This study explores the effectiveness of AI tools in enhancing student learning, specifically in improving study habits, time management, and feedback mechanisms. The research focuses on how AI tools can support personalized learning, adaptive test adjustments, and provide real-time classroom analysis. Student feedback revealed strong support for these features, and the study found a significant reduction in study hours alongside an increase in GPA, suggesting positive academic outcomes. Despite these benefits, challenges such as over-reliance on AI and difficulties in integrating AI with traditional teaching methods were also identified, emphasizing the need for AI tools to complement conventional educational strategies rather than replace them. Data were collected through a survey with a Likert scale and follow-up interviews, providing both quantitative and qualitative insights. The analysis involved descriptive statistics to summarize demographic data, AI usage patterns, and perceived effectiveness, as well as inferential statistics (T-tests, ANOVA) to examine the impact of demographic factors on AI adoption. Regression analysis identified predictors of AI adoption, and qualitative responses were thematically analyzed to understand students' perspectives on the future of AI in education. This mixed-methods approach provided a comprehensive view of AI's role in education and highlighted the importance of privacy, transparency, and continuous refinement of AI features to maximize their educational benefits.

335

14 Mar 2025

agents computer-science artificial-intelligence

Collaboration is all you need: LLM Assisted Safe Code Translation

University of Houston Kent State University Mysten Labs

A multi-agent framework enables efficient code translation by orchestrating collaboration between specialized compact LLMs, achieving comparable performance to GPT-4 while running on common hardware through a combination of director-guided translation, NLI-based verification, and iterative refinement techniques.

13 Jan 2025

computer-science conversational-ai computation-and-language

Investigating Large Language Models in Inferring Personality Traits from User Conversations

Kent State University

Large Language Models (LLMs) are demonstrating remarkable human like capabilities across diverse domains, including psychological assessment. This study evaluates whether LLMs, specifically GPT-4o and GPT-4o mini, can infer Big Five personality traits and generate Big Five Inventory-10 (BFI-10) item scores from user conversations under zero-shot prompting conditions. Our findings reveal that incorporating an intermediate step--prompting for BFI-10 item scores before calculating traits--enhances accuracy and aligns more closely with the gold standard than direct trait inference. This structured approach underscores the importance of leveraging psychological frameworks in improving predictive precision. Additionally, a group comparison based on depressive symptom presence revealed differential model performance. Participants were categorized into two groups: those experiencing at least one depressive symptom and those without symptoms. GPT-4o mini demonstrated heightened sensitivity to depression-related shifts in traits such as Neuroticism and Conscientiousness within the symptom-present group, whereas GPT-4o exhibited strengths in nuanced interpretation across groups. These findings underscore the potential of LLMs to analyze real-world psychological data effectively, offering a valuable foundation for interdisciplinary research at the intersection of artificial intelligence and psychology.

21 Feb 2024

computer-science human-computer-interaction

Revolutionizing Mental Health Care through LangChain: A Journey with a Large Language Model

Kent State University The Davey Tree Expert Company Cleveland State University Bradley University

Aditi Singh

Researchers developed MindGuide, an AI-powered chatbot leveraging LangChain and GPT-4 to provide accessible, empathetic mental health guidance for anxiety, depression, and suicidal thoughts. The system establishes a therapeutic persona and engages in context-aware conversations, demonstrating a functional architecture for digital mental health support.

25 Apr 2025

soft-condensed-matter physics

Spontaneous Optimal Mixing via Defect-Vortex Coupling in Confined Active Nematics

Johns Hopkins University University of California, Merced Kent State University

Klein et al. investigated how confinement influences active nematics, revealing that specific boundary conditions lead to the spontaneous emergence of periodic, optimally mixing defect motions. Their work identified novel optimal braiding patterns for three and four defects, demonstrating a fundamental connection between boundary topology, active forces, and fluid mixing efficiency.

18 Oct 2024

computer-science artificial-intelligence computation-and-language

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

CNRS

Fudan University

Inria LIRMM Kent State University University of Texas at San Antonio Auburn University BEDI Cloud LNCC HiThink Research

As a promising paradigm to collaboratively train models with decentralized data, Federated Learning (FL) can be exploited to fine-tune Large Language Models (LLMs). While LLMs correspond to huge size, the scale of the training data significantly increases, which leads to tremendous amounts of computation and communication costs. The training data is generally non-Independent and Identically Distributed (non-IID), which requires adaptive data processing within each device. Although Low Rank Adaptation (LoRA) can significantly reduce the scale of parameters to update in the fine-tuning process, it still takes unaffordable time to transfer the low-rank parameters of all the layers in LLMs. In this paper, we propose a Fisher Information-based Efficient Curriculum Federated Learning framework (FibecFed) with two novel methods, i.e., adaptive federated curriculum learning and efficient sparse parameter update. First, we propose a fisher information-based method to adaptively sample data within each device to improve the effectiveness of the FL fine-tuning process. Second, we dynamically select the proper layers for global aggregation and sparse parameters for local update with LoRA so as to improve the efficiency of the FL fine-tuning process. Extensive experimental results based on 10 datasets demonstrate that FibecFed yields excellent performance (up to 45.35% in terms of accuracy) and superb fine-tuning speed (up to 98.61% faster) compared with 17 baseline approaches).

14 Apr 2025

high-energy-physics-phenomenology nuclear-experiment nuclear-theory

Understanding the Baryon Stopping at the Relativistic Heavy Ion Collider

Brookhaven National Laboratory Kent State University Texas Southern University

The nucleon exhibits a rich internal structure governed by Quantum Chromodynamics (QCD), where its electric charge arises from valence quarks, while its spin and mass emerge from complex interactions among valence quarks, sea (anti-)quarks, and gluons. At the advent of QCD, an alternative hypothesis emerged suggesting, at high energies, the transport of a nucleon's baryon number could be traced by a non-perturbative configuration of gluon fields connecting its three valence quarks, forming a

Y

-shaped topology known as the gluon junction. Recent measurements by the STAR experiment are compatible with this scenario. In light of these measurements, this study aims to explore the mechanisms of baryon transport in high-energy nuclear collisions using the PYTHIA-8 framework, which incorporates a state-of-the-art hadronization model with advanced Color Flow (CF) and Color Reconnection (CR) mechanisms which mimic signatures of a baryon junction. Within this model setup, we investigate (i) the rapidity slope of the net-baryon distributions in photon-included processes (

\gamma

+p) and (ii) baryon over charge transport in the isobaric (Ru+Ru and Zr+Zr) collisions. Our study highlights the importance of the CF and CR mechanisms in PYTHIA-8, which plays a crucial role in baryon transport. The results show that the CF and CR schemes significantly affect the isobaric baryon-to-charge ratio, leading to different predictions for baryon stopping and underscoring the need to account for CF and CR effects in comparisons with experimental measurements.

27 Jun 2025

computer-science emerging-technologies

Prediction of Protein Three-dimensional Structures via a Hardware-Executable Quantum Computing Framework

Tel Aviv University Harvard Medical School Cleveland Clinic Case Western Reserve University George Mason University Kent State University Brigham and Women’s Hospital Qradle Inc.National Cancer Institute

Accurate prediction of protein active site structures remains a central challenge in structural biology, particularly for short and flexible peptide fragments where conventional methods often fail. Here, we present a quantum computing framework specifically developed for utility-level quantum processors to address this problem. Starting from an amino acid sequence, we formulate the structure prediction task as a ground-state energy minimization problem using the Variational Quantum Eigensolver (VQE). Amino acid connectivity is encoded on a tetrahedral lattice model, and structural constraints-including steric, geometric, and chirality terms-are mapped into a problem-specific Hamiltonian expressed as sparse Pauli operators. The optimization is executed via a two-stage architecture separating energy estimation and measurement decoding, allowing noise mitigation under realistic quantum device conditions. We evaluate the framework on 23 randomly selected real protein fragments from the PDBbind dataset, as well as 7 real fragments from proteins with therapeutic potential, and run the experiments on the IBM-Cleveland Clinic quantum processor. Structural predictions are benchmarked against AlphaFold3 (AF3) using identical postprocessing and docking procedures. Our quantum method outperformed AF3 in both RMSD (Root-Mean-Square Deviation) and docking efficacy. This work demonstrates, for the first time, a complete end-to-end pipeline for biologically relevant structure prediction on real quantum hardware, highlighting its engineering feasibility and practical advantage over existing classical and deep learning approaches.

15 Aug 2025

computer-science computer-vision-and-pattern-recognition data-curation

OpenConstruction: A Systematic Synthesis of Open Visual Datasets for Data-Centric Artificial Intelligence in Construction Monitoring

University of Illinois at Urbana-Champaign

Carnegie Mellon University Louisiana State University Rochester Institute of Technology Stevens Institute of Technology Kent State University The University of Texas at San Antonio

The construction industry increasingly relies on visual data to support Artificial Intelligence (AI) and Machine Learning (ML) applications for site monitoring. High-quality, domain-specific datasets, comprising images, videos, and point clouds, capture site geometry and spatiotemporal dynamics, including the location and interaction of objects, workers, and materials. However, despite growing interest in leveraging visual datasets, existing resources vary widely in sizes, data modalities, annotation quality, and representativeness of real-world construction conditions. A systematic review to categorize their data characteristics and application contexts is still lacking, limiting the community's ability to fully understand the dataset landscape, identify critical gaps, and guide future directions toward more effective, reliable, and scalable AI applications in construction. To address this gap, this study conducts an extensive search of academic databases and open-data platforms, yielding 51 publicly available visual datasets that span the 2005-2024 period. These datasets are categorized using a structured data schema covering (i) data fundamentals (e.g., size and license), (ii) data modalities (e.g., RGB and point cloud), (iii) annotation frameworks (e.g., bounding boxes), and (iv) downstream application domains (e.g., progress tracking). This study synthesizes these findings into an open-source catalog, OpenConstruction, supporting data-driven method development. Furthermore, the study discusses several critical limitations in the existing construction dataset landscape and presents a roadmap for future data infrastructure anchored in the Findability, Accessibility, Interoperability, and Reusability (FAIR) principles. By reviewing the current landscape and outlining strategic priorities, this study supports the advancement of data-centric solutions in the construction sector.

31 Mar 2025

chain-of-thought computer-science artificial-intelligence

Synthesizing Public Opinions with LLMs: Role Creation, Impacts, and the Future to eDemorcacy

Ohio State University University of Houston Kent State University

This paper investigates the use of Large Language Models (LLMs) to synthesize public opinion data, addressing challenges in traditional survey methods like declining response rates and non-response bias. We introduce a novel technique: role creation based on knowledge injection, a form of in-context learning that leverages RAG and specified personality profiles from the HEXACO model and demographic information, and uses that for dynamically generated prompts. This method allows LLMs to simulate diverse opinions more accurately than existing prompt engineering approaches. We compare our results with pre-trained models with standard few-shot prompts. Experiments using questions from the Cooperative Election Study (CES) demonstrate that our role-creation approach significantly improves the alignment of LLM-generated opinions with real-world human survey responses, increasing answer adherence. In addition, we discuss challenges, limitations and future research directions.

05 Jul 2024

computer-science computer-vision-and-pattern-recognition efficient-transformers

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Meta Rochester Institute of Technology

Virginia Tech Santa Clara University Kent State University DEVCOM Army Research Laboratory University of Missouri Kansas City

The Automatic Multi-step Distillation (AMD) method effectively compresses large-scale vision transformers by up to 10x, yielding a 1.34-15.89% accuracy improvement over baselines on CIFAR datasets and training approximately 10 times faster than existing multi-step distillation techniques.

04 Nov 2024

computer-science artificial-intelligence

(Debiased) Contrastive Learning Loss for Recommendation (Technical Report)

Kent State University

In this paper, we perform a systemic examination of the recommendation losses, including listwise (softmax), pairwise(BPR), and pointwise (mean-squared error, MSE, and Cosine Contrastive Loss, CCL) losses through the lens of contrastive learning. We introduce and study both debiased InfoNCE and mutual information neural estimator (MINE), for the first time, under the recommendation setting. We also relate and differentiate these two losses with the BPR loss through the lower bound analysis. Furthermore, we present the debiased pointwise loss (for both MSE and CCL) and theoretically certify both iALS and EASE, two of the most popular linear models, are inherently debiased. The empirical experimental results demonstrate the effectiveness of the debiased losses and newly introduced mutual-information losses outperform the existing (biased) ones.

23 Apr 2021

computer-science cryptography-and-security

Identifying and Modeling Security Threats for IoMT Edge Network using Markov Chain and Common Vulnerability Scoring System (CVSS)

Kent State University

In this work, we defined an attack vector for networks utilizing the Internet of Medical Things (IoMT) devices and compute the probability distribution of IoMT security threats based on Markov chain and Common Vulnerability Scoring System (CVSS). IoMT is an emerging technology that improves patients' quality of life by permitting personalized e-health services without restrictions on time and site. The IoMT consists of embedded objects, sensors, and actuators that transmit and receive medical data. These Medical devices are vulnerable to different types of security threats, and thus, they pose a significant risk to patient's privacy and safety. Because security is a critical factor for successfully merging IoMT into pervasive healthcare systems, there is an urgent need for new security mechanisms to prevent threats on the IoMT edge network. Toward this direction, the first step is defining an attack vector that an attacker or unauthorized user can take advantage of to penetrate and tamper with medical data. In this article, we specify a threat model for the IoMT edge network. We identify any vulnerabilities or weaknesses within the IoMT network that allow unauthorized privileges and threats that can utilize these weaknesses to compromise the IoMT edge network. Finally, we compute the probability distribution of IoMT threats based on the Markov transition probability matrix.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

A survey of agent interoperability protocols: Model Context Protocol (MCP), Agent Communication Protocol (ACP), Agent-to-Agent Protocol (A2A), and Agent Network Protocol (ANP)

AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

Evaluating LLM Alignment on Personality Inference from Real-World Interview Data

From classical techniques to convolution-based models: A review of object detection algorithms

Can Large Language Models Understand Intermediate Representations in Compilers?

AI-Powered Early Diagnosis of Mental Health Disorders from Real-World Clinical Conversations

Sensitivity of Perron and Fiedler eigenpairs to structural perturbations of a network

Analyzing the Impact of AI Tools on Student Study Habits and Academic Performance

Collaboration is all you need: LLM Assisted Safe Code Translation

Investigating Large Language Models in Inferring Personality Traits from User Conversations

Revolutionizing Mental Health Care through LangChain: A Journey with a Large Language Model

Spontaneous Optimal Mixing via Defect-Vortex Coupling in Confined Active Nematics

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Understanding the Baryon Stopping at the Relativistic Heavy Ion Collider

Prediction of Protein Three-dimensional Structures via a Hardware-Executable Quantum Computing Framework

OpenConstruction: A Systematic Synthesis of Open Visual Datasets for Data-Centric Artificial Intelligence in Construction Monitoring

Synthesizing Public Opinions with LLMs: Role Creation, Impacts, and the Future to eDemorcacy

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

(Debiased) Contrastive Learning Loss for Recommendation (Technical Report)

Identifying and Modeling Security Threats for IoMT Edge Network using Markov Chain and Common Vulnerability Scoring System (CVSS)

Events

AI for Law

Personalize Your Feed