Keio University
Keio University researchers developed Deneb, an automatic evaluation metric for image captioning that significantly enhances robustness against hallucinations. It achieves state-of-the-art performance among LLM-free metrics across hallucination detection and human correlation benchmarks, while maintaining efficient inference times.
2
We theoretically investigate the single-particle excitation spectra of a one-dimensional Hubbard model at half filling using an infinite matrix-product state and elucidate the discretized energy spectra emerging under the influence of a dc electric field. In a weak electric-field regime, we observe two kinds of spectral structures in the density of states. With increasing the electric-field strength, the discretized spectra, the period of which is proportional to the strength, become dominant, and the density of states exhibits the Wannier-Stark ladder in their spectra. In addition, we also simulate time- and angle-resolved photoemission spectroscopy using an ultrashort terahertz pump pulse that approximates a dc electric field. Our results represent a significant step forward in understanding the states in strongly correlated electron systems driven by a static electric field.
·
A new benchmark, WMDP, quantifies hazardous knowledge in large language models across biosecurity, cybersecurity, and chemical security, designed to be a public and automatic evaluation tool. The paper also introduces RMU, an unlearning method that effectively reduces this hazardous knowledge in models to near-random levels while largely preserving general model capabilities and enhancing resistance to adversarial attacks.
·
Constrained combinatorial optimization problems (CCOPs) are challenging to solve due to the exponential growth of the solution space. When tackled with Ising machines, constraints are typically enforced by the penalty function method, whose coefficients must be carefully tuned to balance feasibility and objective quality. Variable-reduction techniques such as sample persistence variable reduction (SPVAR) can mitigate hardware limitations of Ising machines, yet their behavior on CCOPs remains insufficiently understood. Building on our prior proposal, we extend and comprehensively evaluate multi-penalty SPVAR (MP-SPVAR), which fixes variables using solution persistence aggregated across multiple penalty coefficients. Experiments on benchmark problems, including the quadratic assignment problem and the quadratic knapsack problem, demonstrate that MP-SPVAR attains higher feasible-solution ratios while matching or improving approximation ratios relative to the conventional SPVAR algorithm. An examination of low-energy states under small penalties clarifies when feasibility degrades and how encoding choices affect the trade-off between solution quality and feasibility. These results position MP-SPVAR as a practical variable-reduction strategy for CCOPs and lay a foundation for systematic penalty tuning, broader problem classes, and integration with quantum-inspired optimization hardware as well as quantum algorithms.
Log-Normal Multiplicative Dynamics (LMD), drawing inspiration from biological synapses, enables the first stable training-from-scratch of large networks like Vision Transformers and GPT-2 using multiplicative updates. This approach maintains full model performance with low-precision MXFP6 and MXFP4 forward passes, effectively addressing previous instability and excessive weight growth issues.
1
A theoretical framework demonstrates how the quantum metric, a core concept in quantum geometry, modifies Liouville's theorem and the dynamics of chiral kinetic theory, expanding its implications across various physical systems.
Researchers at Google DeepMind and the University of Cambridge developed a psychometrically valid methodology to measure and shape personality traits in large language models. Their work demonstrated that larger, instruction-tuned LLMs exhibit reliable and valid synthetic personality profiles that can be purposefully controlled and influence their behavior in generative tasks.
3
Researchers from The University of Tokyo and Keio University developed Spatial-CLAP, a stereo audio-text embedding model that unifies content and spatial information into a fixed-dimensional representation for multi-source audio. The model achieves state-of-the-art performance in audio-text retrieval (e.g., 20.79% A2T Recall@1 in 2-src conditions) and correctly assigns content to space (81.69% accuracy in 2-src), while also generalizing to unobserved 3-source scenarios.
7
Density matrix exponentiation (DME) is a general procedure that converts an unknown quantum state into the Hamiltonian evolution. This enables state-dependent operations and can reveal nontrivial properties of the state, among other applications, without full tomography. However, it has been proven that for any physical process, the DME requires Θ(1/ε)\Theta(1/\varepsilon) state copies in error ε\varepsilon. In this work, we go beyond the lower bound and propose a procedure called the virtual DME that achieves O(log(1/ε))\mathcal{O}(\log(1/\varepsilon)) or O(1)\mathcal{O}(1) state copies, by using non-physical processes. Using the virtual DME in place of its conventional counterpart realizes a general-purpose quantum algorithm for property estimation, that achieves exponential circuit-depth reductions over existing protocols across tasks including quantum principal component analysis, quantum emulator, calculation of nonlinear functions such as entropy, and linear system solver with quantum precomputation. In such quantum algorithms, the non-physical process for virtual DME can be effectively simulated via simple classical post-processing while retaining a near-unity measurement overhead. We numerically verify this small constant overhead together with the exponential reduction of copy count in the quantum principal component analysis task. The number of state copies used in our algorithm essentially saturates the theoretical lower bound we proved.
Researchers at the Center for AI Safety empirically evaluate whether current AI safety benchmarks truly measure safety progress or merely reflect general model capabilities and training compute. Their meta-analysis reveals that approximately half of these benchmarks are highly correlated with capabilities, suggesting a phenomenon termed 'safetywashing' where capability gains are misrepresented as safety improvements.
The EMAGE framework unifies full-body co-speech gesture generation, encompassing facial expressions, local body movements, hand gestures, and global translations, by integrating audio and customizable masked gesture priors. It introduces BEAT2, a large-scale, mesh-level dataset, and achieves state-of-the-art quantitative performance and user preference for realism and expressiveness.
Recent advances in combining Clifford circuits with tensor network (TN) states have shown that classically simulable disentanglers can significantly reduce entanglement, mitigating the bond-dimension bottleneck in TN simulations. In this work, we develop a variational TN framework based on Grassmann tensor networks, which natively encode fermionic statistics while preserving locality. By incorporating locally defined Clifford circuits within the fermionic formalism, we simulate benchmark models including the tight-binding and tt-VV models. Our results show that Clifford disentangling removes the classically simulable component of entanglement, leading to a reduced bond dimension and improved accuracy in ground-state energy estimates. Interestingly, imposing the natural Grassmann-evenness constraint on the Clifford circuits significantly reduces the number of disentangling gates, from 720 to just 32, yielding a far more efficient implementation. These findings highlight the potential of Clifford-augmented Grassmann TNs as a scalable and accurate tool for studying strongly correlated fermionic systems, particularly in higher dimensions.
Researchers from The University of Tokyo, Microsoft, and Keio University developed BigCodec, a neural speech codec that scales up model size to achieve high-quality speech coding at an ultra-low bitrate of 1.04 kbps. This system outperforms existing low-bitrate codecs and demonstrates objective performance comparable to codecs operating at significantly higher bitrates.
196
Regression discontinuity design (RDD) is widely adopted for causal inference under intervention determined by a continuous variable. While one is interested in treatment effect heterogeneity by subgroups in many applications, RDD typically suffers from small subgroup-wise sample sizes, which makes the estimation results highly instable. To solve this issue, we introduce hierarchical RDD (HRDD), a hierarchical Bayes approach for pursuing treatment effect heterogeneity in RDD. A key feature of HRDD is to employ a pseudo-model based on a loss function to estimate subgroup-level parameters of treatment effects under RDD, and assign a hierarchical prior distribution to ''borrow strength'' from other subgroups. The posterior computation can be easily done by a simple Gibbs sampling, and the optimal bandwidth can be automatically selected by the Hyv\"{a}rinen scores for unnormalized models. We demonstrate the proposed HRDD through simulation and real data analysis, and show that HRDD provides much more stable point and interval estimation than separately applying the standard RDD method to each subgroup.
We reveal the power of Grover's algorithm from thermodynamic and geometric perspectives by showing that it is a product formula approximation of imaginary-time evolution (ITE), a Riemannian gradient flow on the special unitary group. This viewpoint uncovers three key insights. First, we show that the ITE dynamics trace the shortest path between the initial and the solution states in complex projective space. Second, we prove that the geodesic length of ITE determines the query complexity of Grover's algorithm. This complexity notably aligns with the known optimal scaling for unstructured search. Lastly, utilizing the geodesic structure of ITE, we construct a quantum signal processing formulation for ITE without post-selection, and derive a new set of angles for the fixed-point search. These results collectively establish a deeper understanding of Grover's algorithm and suggest a potential role for thermodynamics and geometry in quantum algorithm design.
Domain wall (DW) networks may have formed in the early universe following the spontaneous breaking of a discrete symmetry. Notably, several particle physics models predict the existence of current-carrying DWs, which can capture and store particles as zero modes on it. In this study, we demonstrate that gravitational waves (GWs) generated by current-carrying DWs with fermionic zeromodes exhibit a novel feature: an additional peak in the GW spectrum resembling mountains, arising from metastable topological remnants, which we term ``spherons.'' This distinct signature could be detectable in upcoming GW observatories such as LISA and ET. The results suggest that DW networks in beyond Standard Model scenarios could emit GW signals that are significantly stronger and with greater detectability than previously expected.
MaskDiffusion, developed by Kawano and Aoki from Keio University, performs open-vocabulary and unsupervised semantic segmentation by leveraging the internal feature maps and cross-attention mechanisms of pre-trained Stable Diffusion models without additional training. The method achieved improvements of +10.5 mIoU on Potsdam and +14.8 mIoU on COCO-Stuff compared to existing open-vocabulary approaches, demonstrating its flexibility in segmenting objects based on arbitrary text descriptions.
28
Solving partial differential equations for extremely large-scale systems within a feasible computation time serves in accelerating engineering developments. Quantum computing algorithms, particularly the Hamiltonian simulations, present a potential and promising approach to achieve this purpose. Actually, there are several oracle-based Hamiltonian simulations with potential quantum speedup, but their detailed implementations and accordingly the detailed computational complexities are all unclear. This paper presents a method that enables us to explicitly implement the quantum circuit for Hamiltonian simulation; the key technique is the explicit gate construction of differential operators contained in the target partial differential equation discretized by the finite difference method. Moreover, we show that the space and time complexities of the constructed circuit are exponentially smaller than those of conventional classical algorithms. We also provide numerical experiments and an experiment on a real device for the wave equation to demonstrate the validity of our proposed method.
Efficiently preparing approximate ground-states of large, strongly correlated systems on quantum hardware is challenging and yet nature is innately adept at this. This has motivated the study of thermodynamically inspired approaches to ground-state preparation that aim to replicate cooling processes via imaginary-time evolution. However, synthesizing quantum circuits that efficiently implement imaginary-time evolution is itself difficult, with prior proposals generally adopting heuristic variational approaches or using deep block encodings. Here, we use the insight that quantum imaginary-time evolution is a solution of Brockett's double-bracket flow and synthesize circuits that implement double-bracket flows coherently on the quantum computer. We prove that our Double-Bracket Quantum Imaginary-Time Evolution (DB-QITE) algorithm inherits the cooling guarantees of imaginary-time evolution. Concretely, each step is guaranteed to i) decrease the energy of an initial approximate ground-state by an amount proportion to the energy fluctuations of the initial state and ii) increase the fidelity with the ground-state. We provide gate counts for DB-QITE through numerical simulations in Qrisp which demonstrate scenarios where DB-QITE outperforms quantum phase estimation. Thus DB-QITE provides a means to systematically improve the approximation of a ground-state using shallow circuits.
ETH Zurich logoETH ZurichUniversity of Washington logoUniversity of WashingtonCNRS logoCNRSUniversity of Pittsburgh logoUniversity of PittsburghUniversity of Cambridge logoUniversity of CambridgeUniversity of FreiburgHeidelberg UniversityLeibniz University HannoverNortheastern University logoNortheastern UniversityUCLA logoUCLAImperial College London logoImperial College LondonUniversity of Manchester logoUniversity of ManchesterUniversity of ZurichNew York University logoNew York UniversityUniversity of BernUniversity of StuttgartUC Berkeley logoUC BerkeleyUniversity College London logoUniversity College LondonFudan University logoFudan UniversityGeorgia Institute of Technology logoGeorgia Institute of TechnologyNational Taiwan Universitythe University of Tokyo logothe University of TokyoUniversity of California, Irvine logoUniversity of California, IrvineUniversity of BonnTechnical University of BerlinUniversity of Bristol logoUniversity of BristolUniversity of Michigan logoUniversity of MichiganUniversity of EdinburghUniversity of Hong KongUniversity of Alabama at BirminghamNorthwestern University logoNorthwestern UniversityUniversity of BambergUniversity of Florida logoUniversity of FloridaEmory University logoEmory UniversityUniversity of CologneHarvard Medical SchoolUniversity of Pennsylvania logoUniversity of PennsylvaniaUniversity of Southampton logoUniversity of SouthamptonFlorida State UniversityEPFL logoEPFLUniversity of Wisconsin-Madison logoUniversity of Wisconsin-MadisonMassachusetts General HospitalChongqing UniversityKeio UniversityUniversity of Alberta logoUniversity of AlbertaKing’s College London logoKing’s College LondonFriedrich-Alexander-Universität Erlangen-NürnbergUniversity of LuxembourgTechnical University of Munich logoTechnical University of MunichUniversity of Duisburg-EssenSapienza University of RomeUniversity of HeidelbergUniversity of SheffieldHKUST logoHKUSTUniversity of GenevaWashington University in St. LouisTU BerlinUniversity of GlasgowUniversity of SiegenUniversity of PotsdamUniversidade Estadual de CampinasUniversity of OldenburgThe Ohio State University logoThe Ohio State UniversityUniversity of LeicesterGerman Cancer Research Center (DKFZ)University of BremenUniversity of ToulouseUniversity of MiamiKarlsruhe Institute of Technology logoKarlsruhe Institute of TechnologyPeking Union Medical CollegeUniversity of OuluUniversity of HamburgUniversity of RegensburgUniversity of BirminghamUniversity of LeedsChinese Academy of Medical SciencesINSERMUniversity of Basel logoUniversity of BaselPeking Union Medical College HospitalUniversity of LausanneUniversity of LilleUniversity of PoitiersUniversity of PassauUniversity of LübeckKing Fahd University of Petroleum and MineralsUniversity of LondonUniversity of NottinghamUniversity of Erlangen-NurembergUniversity of BielefeldSorbonne UniversityUniversity of South FloridaWake Forest UniversityUniversity of CalgaryUniversity of Picardie Jules VerneIBMUniversity of Göttingen logoUniversity of GöttingenUniversity of BordeauxUniversity of MannheimUniversity of California San FranciscoNIHUniversity of KonstanzUniversity of Electro-CommunicationsUniversity of WuppertalUniversity of ReunionUNICAMPUniversity of TrierHasso Plattner InstituteUniversity of BayreuthHeidelberg University HospitalUniversity of StrasbourgDKFZUniversity of LorraineInselspital, Bern University Hospital, University of BernUniversity of WürzburgUniversity of La RochelleUniversity of LyonUniversity of HohenheimUniversity Medical Center Hamburg-EppendorfUniversity of UlmUniversity Hospital ZurichUniversity of TuebingenUniversity of KaiserslauternUniversity of NantesUniversity of MainzUniversity of PaderbornUniversity of KielMedical University of South CarolinaUniversity of RostockThe University of Texas MD Anderson Cancer CenterNational Research Council (CNR)Hannover Medical SchoolItalian National Research CouncilUniversity of MuensterUniversity of MontpellierUniversity of LeipzigUniversity of GreifswaldUniversity Hospital BernSiemens HealthineersThe University of Alabama at BirminghamNational Institutes of HealthUniversity of MarburgUniversity of Paris-SaclayUniversity of LimogesUniversity of Clermont AuvergneUniversity of DortmundUniversity of GiessenKITUniversity of ToulonChildren’s Hospital of PhiladelphiaUniversity of JenaNational Taiwan University HospitalUniversity of SaarlandUniversity of ErlangenNational Cancer InstituteUniversity Hospital HeidelbergSwiss Federal Institute of Technology LausanneUniversity of Texas Health Science Center at HoustonNational Institute of Biomedical Imaging and BioengineeringUniversity of New CaledoniaUniversity of Koblenz-LandauParis Diderot UniversityUniversity of ParisInselspital, Bern University HospitalUniversity of Grenoble AlpesUniversity Hospital BaselMD Anderson Cancer CenterUniversity of AngersUniversity of French PolynesiaUniversity of MagdeburgUniversity of Geneva, SwitzerlandOulu University HospitalUniversity of ToursFriedrich-Alexander-University Erlangen-NurnbergUniversity of Rennes 1Wake Forest School of MedicineNIH Clinical CenterParis Descartes UniversityUniversity of Rouen NormandieUniversity of Aix-MarseilleUniversity of Perpignan Via DomitiaUniversity of Caen NormandieUniversity of FrankfurtUniversity of BochumUniversity of Bourgogne-Franche-ComtéUniversity of Corsica Pasquale PaoliNational Institute of Neurological Disorders and StrokeUniversity of HannoverRoche DiagnosticsUniversity of South BrittanyUniversity of DüsseldorfUniversity of Reims Champagne-ArdenneUniversity of HalleIRCCS Fondazione Santa LuciaUniversity of Applied Sciences TrierUniversity of Southampton, UKUniversity of Nice–Sophia AntipolisUniversit de LorraineUniversité Paris-Saclay["École Polytechnique Fédérale de Lausanne"]RWTH Aachen UniversityUniversity of Bern, Institute for Advanced Study in Biomedical InnovationCRIBIS University of AlbertaThe Cancer Imaging Archive (TCIA)Fraunhofer Institute for Medical Image Computing MEVISMedical School of HannoverIstituto di Ricovero e Cura a Carattere Scientifico NeuromedFondazione Santa Lucia IRCCSCEA, LIST, Laboratory of Image and Biomedical SystemsUniversity of Alberta, CanadaHeidelberg University Hospital, Department of NeuroradiologyUniversity of Bern, SwitzerlandUniversity of DresdenUniversity of SpeyerUniversity of Trier, GermanyUniversity of Lorraine, FranceUniversity of Le Havre NormandieUniversity of Bretagne OccidentaleUniversity of French GuianaUniversity of the AntillesUniversity of Bern, Institute of Surgical Technology and BiomechanicsUniversity of Bern, ARTORG Center for Biomedical Engineering ResearchUniversity of Geneva, Department of RadiologyUniversity of Zürich, Department of NeuroradiologyRuhr-University-Bochum
·
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset.
There are no more papers matching your filters at the moment.