alphaXiv

History

Papers Benchmarks

Luoyang Institute for Robot and Intelligent Equipment

138

12 Sep 2025

computer-science computer-vision-and-pattern-recognition efficient-transformers

Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization

Meituan CASIA UCAS GigaAI Luoyang Institute for Robot and Intelligent Equipment

Researchers from CASIA, Meituan, GigaAI, and other institutions developed FullVQ (FVQ), a scalable training method for vector-quantized networks that consistently achieves 100% codebook utilization by introducing a novel VQBridge projector. FVQ sets a new state-of-the-art for discrete tokenizers with an rFID of 0.88 and enables autoregressive models to surpass advanced diffusion models in image generation quality without incurring inference overhead.

648

03 Jun 2025

computer-science computer-vision-and-pattern-recognition

Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection

Chinese Academy of Sciences

Tsinghua University HDU Luoyang Institute for Robot and Intelligent Equipment Casivision

Qiyu Chen

Bayesian Prompt Flow Learning (Bayes-PFL) models the text prompt space as a learnable probability distribution using normalizing flows to enhance Zero-Shot Anomaly Detection (ZSAD) with Vision-Language Models. This method achieves state-of-the-art performance across 15 industrial and medical datasets, demonstrating substantial gains, such as a 3.8% improvement in pixel-level AUROC on the ISIC medical dataset.

08 Mar 2025

attention-mechanisms autonomous-vehicles computer-science

Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection

Chinese Academy of Sciences UCAS PhiGent Robotics Luoyang Institute for Robot and Intelligent Equipment

Addresses flawed ground truth generation and enhances geometric relationship utilization in sparse-point 3D lane detection. This improves F1-scores on state-of-the-art models across challenging datasets like OpenLane and ApolloSim.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization

Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection

Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection

Events

AI for Law

Personalize Your Feed