University of Oldenburg
This research reveals high token redundancy in 3D point cloud transformer architectures, demonstrating that up to 90-95% of tokens can be merged with minimal performance loss. It introduces `gitmerge3D`, a globally-informed graph token merging strategy tailored for 3D data, which reduces FLOPs by 5.3x and memory usage by 6.4x for semantic segmentation on ScanNet while maintaining accuracy.
ETH Zurich logoETH ZurichUniversity of Washington logoUniversity of WashingtonCNRS logoCNRSUniversity of Pittsburgh logoUniversity of PittsburghUniversity of Cambridge logoUniversity of CambridgeUniversity of FreiburgHeidelberg UniversityLeibniz University HannoverNortheastern University logoNortheastern UniversityUCLA logoUCLAImperial College London logoImperial College LondonUniversity of Manchester logoUniversity of ManchesterUniversity of ZurichNew York University logoNew York UniversityUniversity of BernUniversity of StuttgartUC Berkeley logoUC BerkeleyUniversity College London logoUniversity College LondonFudan University logoFudan UniversityGeorgia Institute of Technology logoGeorgia Institute of TechnologyNational Taiwan Universitythe University of Tokyo logothe University of TokyoUniversity of California, Irvine logoUniversity of California, IrvineUniversity of BonnTechnical University of BerlinUniversity of Bristol logoUniversity of BristolUniversity of Michigan logoUniversity of MichiganUniversity of EdinburghUniversity of Hong KongUniversity of Alabama at BirminghamNorthwestern University logoNorthwestern UniversityUniversity of BambergUniversity of Florida logoUniversity of FloridaEmory University logoEmory UniversityUniversity of CologneHarvard Medical SchoolUniversity of Pennsylvania logoUniversity of PennsylvaniaUniversity of Southampton logoUniversity of SouthamptonFlorida State UniversityEPFL logoEPFLUniversity of Wisconsin-Madison logoUniversity of Wisconsin-MadisonMassachusetts General HospitalChongqing UniversityKeio UniversityUniversity of Alberta logoUniversity of AlbertaKing’s College London logoKing’s College LondonFriedrich-Alexander-Universität Erlangen-NürnbergUniversity of LuxembourgTechnical University of Munich logoTechnical University of MunichUniversity of Duisburg-EssenSapienza University of RomeUniversity of HeidelbergUniversity of SheffieldHKUST logoHKUSTUniversity of GenevaWashington University in St. LouisTU BerlinUniversity of GlasgowUniversity of SiegenUniversity of PotsdamUniversidade Estadual de CampinasUniversity of OldenburgThe Ohio State University logoThe Ohio State UniversityUniversity of LeicesterGerman Cancer Research Center (DKFZ)University of BremenUniversity of ToulouseUniversity of MiamiKarlsruhe Institute of Technology logoKarlsruhe Institute of TechnologyPeking Union Medical CollegeUniversity of OuluUniversity of HamburgUniversity of RegensburgUniversity of BirminghamUniversity of LeedsChinese Academy of Medical SciencesINSERMUniversity of Basel logoUniversity of BaselPeking Union Medical College HospitalUniversity of LausanneUniversity of LilleUniversity of PoitiersUniversity of PassauUniversity of LübeckKing Fahd University of Petroleum and MineralsUniversity of LondonUniversity of NottinghamUniversity of Erlangen-NurembergUniversity of BielefeldSorbonne UniversityUniversity of South FloridaWake Forest UniversityUniversity of CalgaryUniversity of Picardie Jules VerneIBMUniversity of Göttingen logoUniversity of GöttingenUniversity of BordeauxUniversity of MannheimUniversity of California San FranciscoNIHUniversity of KonstanzUniversity of Electro-CommunicationsUniversity of WuppertalUniversity of ReunionUNICAMPUniversity of TrierHasso Plattner InstituteUniversity of BayreuthHeidelberg University HospitalUniversity of StrasbourgDKFZUniversity of LorraineInselspital, Bern University Hospital, University of BernUniversity of WürzburgUniversity of La RochelleUniversity of LyonUniversity of HohenheimUniversity Medical Center Hamburg-EppendorfUniversity of UlmUniversity Hospital ZurichUniversity of TuebingenUniversity of KaiserslauternUniversity of NantesUniversity of MainzUniversity of PaderbornUniversity of KielMedical University of South CarolinaUniversity of RostockThe University of Texas MD Anderson Cancer CenterNational Research Council (CNR)Hannover Medical SchoolItalian National Research CouncilUniversity of MuensterUniversity of MontpellierUniversity of LeipzigUniversity of GreifswaldUniversity Hospital BernSiemens HealthineersThe University of Alabama at BirminghamNational Institutes of HealthUniversity of MarburgUniversity of Paris-SaclayUniversity of LimogesUniversity of Clermont AuvergneUniversity of DortmundUniversity of GiessenKITUniversity of ToulonChildren’s Hospital of PhiladelphiaUniversity of JenaNational Taiwan University HospitalUniversity of SaarlandUniversity of ErlangenNational Cancer InstituteUniversity Hospital HeidelbergSwiss Federal Institute of Technology LausanneUniversity of Texas Health Science Center at HoustonNational Institute of Biomedical Imaging and BioengineeringUniversity of New CaledoniaUniversity of Koblenz-LandauParis Diderot UniversityUniversity of ParisInselspital, Bern University HospitalUniversity of Grenoble AlpesUniversity Hospital BaselMD Anderson Cancer CenterUniversity of AngersUniversity of French PolynesiaUniversity of MagdeburgUniversity of Geneva, SwitzerlandOulu University HospitalUniversity of ToursFriedrich-Alexander-University Erlangen-NurnbergUniversity of Rennes 1Wake Forest School of MedicineNIH Clinical CenterParis Descartes UniversityUniversity of Rouen NormandieUniversity of Aix-MarseilleUniversity of Perpignan Via DomitiaUniversity of Caen NormandieUniversity of FrankfurtUniversity of BochumUniversity of Bourgogne-Franche-ComtéUniversity of Corsica Pasquale PaoliNational Institute of Neurological Disorders and StrokeUniversity of HannoverRoche DiagnosticsUniversity of South BrittanyUniversity of DüsseldorfUniversity of Reims Champagne-ArdenneUniversity of HalleIRCCS Fondazione Santa LuciaUniversity of Applied Sciences TrierUniversity of Southampton, UKUniversity of Nice–Sophia AntipolisUniversit de LorraineUniversité Paris-Saclay["École Polytechnique Fédérale de Lausanne"]RWTH Aachen UniversityUniversity of Bern, Institute for Advanced Study in Biomedical InnovationCRIBIS University of AlbertaThe Cancer Imaging Archive (TCIA)Fraunhofer Institute for Medical Image Computing MEVISMedical School of HannoverIstituto di Ricovero e Cura a Carattere Scientifico NeuromedFondazione Santa Lucia IRCCSCEA, LIST, Laboratory of Image and Biomedical SystemsUniversity of Alberta, CanadaHeidelberg University Hospital, Department of NeuroradiologyUniversity of Bern, SwitzerlandUniversity of DresdenUniversity of SpeyerUniversity of Trier, GermanyUniversity of Lorraine, FranceUniversity of Le Havre NormandieUniversity of Bretagne OccidentaleUniversity of French GuianaUniversity of the AntillesUniversity of Bern, Institute of Surgical Technology and BiomechanicsUniversity of Bern, ARTORG Center for Biomedical Engineering ResearchUniversity of Geneva, Department of RadiologyUniversity of Zürich, Department of NeuroradiologyRuhr-University-Bochum
·
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset.
The SSSD model, developed at Oldenburg University, integrates conditional diffusion models with structured state space (S4) layers for robust time series imputation and forecasting. It achieves state-of-the-art performance, notably reducing MAE by over 50% in blackout missing ECG data and generating qualitatively meaningful signal reconstructions compared to prior methods.
301
We present a universal deep-learning method that reconstructs super-resolved images of quantum emitters from a single camera frame measurement. Trained on physics-based synthetic data spanning diverse point-spread functions, aberrations, and noise, the network generalizes across experimental conditions without system-specific retraining. We validate the approach on low- and high-density In(Ga)As quantum dots and strain-induced dots in 2D monolayer WSe2_2, resolving overlapping emitters even under low signal-to-noise and inhomogeneous backgrounds. By eliminating calibration and iterative acquisitions, this single-shot strategy enables rapid, robust super-resolution for nanoscale characterization and quantum photonic device fabrication.
The steered response power (SRP) method is one of the most popular approaches for acoustic source localization with microphone arrays. It is often based on simplifying acoustic assumptions, such as an omnidirectional sound source in the far field of the microphone array(s), free field propagation, and spatially uncorrelated noise. In reality, however, there are many acoustic scenarios where such assumptions are violated. This paper proposes a generalization of the conventional SRP method that allows to apply generic acoustic models for localization with arbitrary microphone constellations. These models may consider, for instance, level differences in distributed microphones, the directivity of sources and receivers, or acoustic shadowing effects. Moreover, also measured acoustic transfer functions may be applied as acoustic model. We show that the delay-and-sum beamforming of the conventional SRP is not optimal for localization with generic acoustic models. To this end, we propose a generalized SRP beamforming criterion that considers generic acoustic models and spatially correlated noise, and derive an optimal SRP beamformer. Furthermore, we propose and analyze appropriate frequency weightings. Unlike the conventional SRP, the proposed method can jointly exploit observed level and time differences between the microphone signals to infer the source location. Realistic simulations of three different microphone setups with speech under various noise conditions indicate that the proposed method can significantly reduce the mean localization error compared to the conventional SRP and, in particular, a reduction of more than 60% can be archived in noisy conditions.
Whole slide pathology image classification presents challenges due to gigapixel image sizes and limited annotation labels, hindering model generalization. This paper introduces a prompt learning method to adapt large vision-language models for few-shot pathology classification. We first extend the Prov-GigaPath vision foundation model, pre-trained on 1.3 billion pathology image tiles, into a vision-language model by adding adaptors and aligning it with medical text encoders via contrastive learning on 923K image-text pairs. The model is then used to extract visual features and text embeddings from few-shot annotations and fine-tunes with learnable prompt embeddings. Unlike prior methods that combine prompts with frozen features using prefix embeddings or self-attention, we propose multi-granular attention that compares interactions between learnable prompts with individual image patches and groups of them. This approach improves the model's ability to capture both fine-grained details and broader context, enhancing its recognition of complex patterns across sub-regions. To further improve accuracy, we leverage (unbalanced) optimal transport-based visual-text distance to secure model robustness by mitigating perturbations that might occur during the data augmentation process. Empirical experiments on lung, kidney, and breast pathology modalities validate the effectiveness of our approach; thereby, we surpass several of the latest competitors and consistently improve performance across diverse architectures, including CLIP, PLIP, and Prov-GigaPath integrated PLIP.
12
Lead halide perovskites have catalyzed the rise of main-group metal halide materials as promising candidates for next-generation optoelectronics, including solar cells, light-emitting diodes, lasers, sensors, and photocatalysts. Among these, effi-cient light-emission arises from self-trapped excitons, wherein excited states induce transient lattice distortions that localize excitons. However, the complex interplay of factors, such as lattice distortions, lattice softness, and electron-phonon cou-pling dynamics, obscures the direct structure-property relationships complicating the targeted material design. In this study, we advance the understanding of self-trapped exciton (STE)-based emission in hybrid antimony and bismuth halides, em-phasizing the interplay of structural and electronic factors that enhance white-light emission. We systematically vary com-position, anion dimensionality, connectivity, and the organic cation and find that the presence of Bi/Sb and Cl in edge-sharing anion motifs promotes white-light emission and optimal electron-phonon coupling. Chlorides outperform bromides, and organic cations, such as CMA and BZA, only subtly influence optical behavior by altering lattice dynamics and rigidity, resulting in tunable emission characteristics without compromising STEs. This work deepens the understanding of the emis-sion mechanisms in hybrid halide perovskites and establishes guiding principles for tailoring optoelectronic properties, paving the way for advanced materials with enhanced white-light emission for next-generation optoelectronic applications.
Separating competing speech in reverberant environments requires models that preserve spatial cues while maintaining separation efficiency. We present a Phase-aware Ear-conditioned speaker Separation network using eight microphones (PEASE-8) that consumes complex STFTs and directly introduces a raw-STFT input to the early decoder layer, bypassing the entire encoder pathway to improve reconstruction. The model is trained end-to-end with an SI-SDR-based objective against direct-path ear targets, jointly performing separation and dereverberation for two speakers in a fixed azimuth, eliminating the need for permutation invariant training. On spatialized two-speaker mixtures spanning anechoic, reverberant, and noisy conditions, PEASE-8 delivers strong separation and intelligibility. In reverberant environments, it achieves 12.37 dB SI-SDR, 0.87 STOI, and 1.86 PESQ at T60 = 0.6 s, while remaining competitive under anechoic conditions.
We construct wormholes in Einstein-scalar-Gauss-Bonnet theories with a potential for the scalar field that includes a mass term and self-interaction terms. By varying the Gauss-Bonnet coupling constant we delimit the domain of existence of wormholes in these theories. The presence of the self-interaction enlarges the domain of existence significantly. There arise wormholes with a single throat and wormholes with an equator and a double throat. We determine the physical properties of these wormholes including their mass, their size and their geometry.
A framework integrates Concept Bottleneck Models (CBMs) with a multi-agent Retrieval-Augmented Generation (RAG) system to generate interpretable radiology reports from Chest X-ray images. The system achieves 81% classification accuracy on the COVID-QU dataset and generates clinically relevant reports that significantly outperform GPT-4 and single-agent RAGs in accuracy and usefulness, as assessed by LLM judges.
5
In a generic theory of gravity coupled to matter fields, the Smarr formula for black holes does not work properly if the contributions of the coupling constants defining the theory are not incorporated. However, these couplings, such as the cosmological constant or the dimensionful parameters that appear in the Lagrangian, are fixed parameters defining the theory, and they cannot be varied. Here, we present a robust method, applicable to any covariant Lagrangian, that upgrades the role of the couplings from being constants in the theory to being free parameters of the solutions. To this end, for each one of the couplings in a theory, a pair of auxiliary scalar and gauge fields is introduced. The couplings are shown to be conserved charges of the global part of the implemented gauge symmetry. Besides, their conjugate chemical potentials are defined as the electric potential of the corresponding gauge fields on the black hole horizon. Using this method, we systematically extend the first law and the Smarr formula by coupling conserved charges and their conjugate potentials. The thermodynamics of a black hole solution in a quadratic gravity theory is given as an example.
Quasinormal modes of rapidly rotating black holes are crucial in understanding the ringdown phase after a merger. While for Kerr black holes these modes have been known for a long time, their calculation has remained a challenge in alternative theories of gravity. We obtain the spectrum of quasinormal modes of rapidly rotating black holes in Einstein-Gauss-Bonnet-dilaton theory without resorting to perturbation theory in the coupling constant. Our approach is based on a spectral decomposition of the linear perturbations of the metric and the scalar field. The quasinormal modes agree excellently with the perturbatively known slow rotation and weak coupling limits. For large coupling, though, the spectrum changes significantly.
Gravitational waves emitted by distorted black holes---such as those arising from the coalescence of two neutron stars or black holes---carry not only information about the corresponding spacetime but also about the underlying theory of gravity. Although general relativity remains the simplest, most elegant and viable theory of gravitation, there are generic and robust arguments indicating that it is not the ultimate description of the gravitational universe. Here we focus on a particularly appealing extension of general relativity, which corrects Einstein's theory through the addition of terms which are second order in curvature: the topological Gauss-Bonnet invariant coupled to a dilaton. We study gravitational-wave emission from black holes in this theory, and {\bf(i)} find strong evidence that black holes are linearly (mode) stable against both axial and polar perturbations; {\bf(ii)} discuss how the quasinormal modes of black holes can be excited during collisions involving black holes, and finally {\bf(iii)} show that future ringdown detections with large signal-to-noise ratio would improve current constraints on the coupling parameter of the theory.
The development of electronic stethoscopes and wearable recording sensors opened the door to the automated analysis of bowel sound (BS) signals. This enables a data-driven analysis of bowel sound patterns, their interrelations, and their correlation to different pathologies. This work leverages a BS dataset collected from 16 healthy subjects that was annotated according to four established BS patterns. This dataset is used to evaluate the performance of machine learning models to detect and/or classify BS patterns. The selection of considered models covers models using tabular features, convolutional neural networks based on spectrograms and models pre-trained on large audio datasets. The results highlight the clear superiority of pre-trained models, particularly in detecting classes with few samples, achieving an AUC of 0.89 in distinguishing BS from non-BS using a HuBERT model and an AUC of 0.89 in differentiating bowel sound patterns using a Wav2Vec 2.0 model. These results pave the way for an improved understanding of bowel sounds in general and future machine-learning-driven diagnostic applications for gastrointestinal examinations
Advancements in generative Artificial Intelligence (AI) hold great promise for automating radiology workflows, yet challenges in interpretability and reliability hinder clinical adoption. This paper presents an automated radiology report generation framework that combines Concept Bottleneck Models (CBMs) with a Multi-Agent Retrieval-Augmented Generation (RAG) system to bridge AI performance with clinical explainability. CBMs map chest X-ray features to human-understandable clinical concepts, enabling transparent disease classification. Meanwhile, the RAG system integrates multi-agent collaboration and external knowledge to produce contextually rich, evidence-based reports. Our demonstration showcases the system's ability to deliver interpretable predictions, mitigate hallucinations, and generate high-quality, tailored reports with an interactive interface addressing accuracy, trust, and usability challenges. This framework provides a pathway to improving diagnostic consistency and empowering radiologists with actionable insights.
6
Clustering is a fundamental data processing task used for grouping records based on one or more features. In the vertically partitioned setting, data is distributed among entities, with each holding only a subset of those features. A key challenge in this scenario is that computing distances between records requires access to all distributed features, which may be privacy-sensitive and cannot be directly shared with other parties. The goal is to compute the joint clusters while preserving the privacy of each entity's dataset. Existing solutions using secret sharing or garbled circuits implement privacy-preserving variants of Lloyd's algorithm but incur high communication costs, scaling as O(nkt), where n is the number of data points, k the number of clusters, and t the number of rounds. These methods become impractical for large datasets or several parties, limiting their use to LAN settings only. On the other hand, a different line of solutions rely on differential privacy (DP) to outsource the local features of the parties to a central server. However, they often significantly degrade the utility of the clustering outcome due to excessive noise. In this work, we propose a novel solution based on homomorphic encryption and DP, reducing communication complexity to O(n+kt). In our method, parties securely outsource their features once, allowing a computing party to perform clustering operations under encryption. DP is applied only to the clusters' centroids, ensuring privacy with minimal impact on utility. Our solution clusters 100,000 two-dimensional points into five clusters using only 73MB of communication, compared to 101GB for existing works, and completes in just under 3 minutes on a 100Mbps network, whereas existing works take over 1 day. This makes our solution practical even for WAN deployments, all while maintaining accuracy comparable to plaintext k-means algorithms.
3
This paper presents an introduction to the stochastic concepts of \emph{coupling} and \emph{copula}. Coupling means the construction of a joint distribution of two or more random variables that need not be defined on one and the same probability space, whereas a copula is a function that joins a multivariate distribution to its one-dimensional margins. Their role in stochastic modeling is illustrated by examples from multisensory perception. Pointers to more advanced and recent treatments are provided.
The visibility of two-photon interference is limited by the indistinguishability of the photons. In the cascaded emission of a three-level system, such as a single quantum dot, the indistinguishability of each photon in the pair is primarily affected by two main factors: the temporal correlation between paired photons and dephasing. Investigating the individual effects of these factors on photon indistinguishability is challenging, as both factors affect it simultaneously. In this study, we investigate the temperature-dependent two-photon interference visibility of the biexciton and exciton photons emitted from a single quantum dot under two-photon resonant excitation, while keeping temporal correlation between the paired photons intact. Finally, we simultaneously extract the coherence times of the biexciton and exciton photons as a function of temperature.
Monitoring urban structure and development requires high-quality data at high spatiotemporal resolution. While traditional censuses have provided foundational insights into demographic and socioeconomic aspects of urban life, their pace may not always align with the pace of urban development. To complement these traditional methods, we explore the potential of analyzing alternative big-data sources, such as human mobility data. However, these often noisy and unstructured big data pose new challenges. Here we propose a method to extract meaningful explanatory variables and classifications from such data. Using movement data from Beijing, which are produced as a byproduct of mobile communication, we show that meaningful features can be extracted, revealing, for example, the emergence and absorption of subcentres. This method allows the analysis of urban dynamics at a high spatial resolution (here, 500m) and near real-time frequency, and high computational efficiency, which is especially suitable for tracing event-driven mobility changes and their impact on urban structures.
This study develops a capacity expansion model for a fully decarbonized European electricity system using an Adaptive Robust Optimization (ARO) framework. The model endogenously identifies the worst regional Dunkelflaute events, prolonged periods of low wind and solar availability, and incorporates multiple extreme weather realizations within a single optimization run. Results show that system costs rise nonlinearly with the geographic extent of these events: a single worst-case regional disruption increases costs by 9%, but broader disruptions across multiple regions lead to much sharper increases, up to 51%. As Dunkelflaute conditions extend across most of Europe, additional cost impacts level off, with a maximum increase of 71%. The optimal technology mix evolves with the severity of weather stress: while renewables, batteries, and interregional transmission are sufficient to manage localized events, large-scale disruptions require long-term hydrogen storage and load shedding to maintain system resilience. Central European regions, especially Germany and France, emerge as systemic bottlenecks, while peripheral regions bear the cost of compensatory overbuilding. These findings underscore the need for a coordinated European policy strategy that goes beyond national planning to support cross-border infrastructure investment, scale up flexible technologies such as long-duration storage, and promote a geographically balanced deployment of renewables to mitigate systemic risks associated with Dunkelflaute events.
There are no more papers matching your filters at the moment.