A deterministic discrete denoising algorithm, employing a time-dependent herding method, significantly improves the efficiency and sample quality of discrete-state diffusion models. This "drop-in replacement" achieves up to a 10x improvement in perplexity for text generation and better FID/IS for images, often with fewer inference steps and without requiring model retraining.
View blogAn architecture for measurement-based fault-tolerant quantum computation is introduced, designed for high-connectivity devices to enable megaquop to gigaquop scale operations using moderate physical qubit counts. It leverages Knill's error-correcting teleportation with self-dual CSS codes, offering a resource-efficient path to early fault-tolerant quantum computing.
View blogA unified approach from Osaka University enables accurate reconstruction of both near and far objects in unbounded 3D scenes by representing Gaussian splatting in homogeneous coordinates, achieving state-of-the-art novel view synthesis while maintaining real-time rendering capabilities without requiring scene segmentation or pre-processing steps.
View blogThe Wavy Transformer introduces a novel architecture that re-conceptualizes transformer attention dynamics as second-order wave propagation, moving away from dissipative diffusion to mitigate over-smoothing. This approach enhances model performance across diverse tasks, achieving a +1.99 macro average score gain on GLUE benchmarks and a +0.92 Top-1 accuracy gain for DeiT-Ti models on ImageNet, while preserving feature diversity in deeper layers.
View blogResearchers developed a method for efficient magic state distillation, replacing complex code transformations in Magic State Cultivation (MSC) with lattice surgery. This approach reduces spacetime overhead by over 50% while maintaining comparable logical error rates, and introduces a lookup table for a further 15% reduction through early rejection.
View blogThis research introduces KnowMT-Bench, a new benchmark for evaluating large language models in multi-turn, knowledge-intensive long-form question answering across medical, financial, and legal domains. It quantitatively demonstrates a decline in model factuality and efficiency caused by self-generated conversational noise and shows that Retrieval-Augmented Generation (RAG) effectively reverses this performance degradation.
View blogThis work introduces a decentralized multi-agent world model that integrates Collective Predictive Coding with temporal dynamics, enabling two agents to develop shared symbolic communication for coordination in partially observable environments. The research demonstrates that decentralized learning, without direct access to other agents' internal states, can lead to the emergence of more meaningful, environment-reflective symbolic representations that facilitate improved coordination.
View blogKeisuke Fujii introduces Out-of-Time-Order Correlator (OTOC) spectroscopy, an algorithmic interpretation of higher-order OTOCs using Quantum Signal Processing (QSP). This framework establishes OTOCs as measurements of specific Fourier components of singular value distributions, enabling frequency-selective probing of quantum scrambling and diverse many-body dynamics.
View blogAnimalClue is introduced as the first large-scale, multi-modal dataset specifically designed for automated species identification from indirect biological evidence. It comprises 159,605 bounding boxes and 141,314 segmentation masks for 968 species across five trace types (footprints, feces, eggs, bones, feathers), and includes 22 fine-grained ecological trait annotations, setting new benchmarks for wildlife monitoring.
View blog