Institute of PhysicsYerevan State University
Exploring the Fundamental Properties of Neutrino from Oscillation Experiments
As physicists pursue precision neutrino measurements, complementary experiments covering varied oscillation landscapes have become essential for resolving current tensions in global fits. This thesis presents projected sensitivities and forecasted performance of two next-generation long-baseline experiments: DUNE and T2HK, through detailed simulations addressing fundamental questions including neutrino mass ordering, leptonic CP violation, and the octant of θ23\theta_{23}. We demonstrate through simulated analyses that while each experiment alone faces inherent degeneracies, their complementary features enable breakthrough projected sensitivities in both standard oscillation parameter measurements and forecasted searches for new physics beyond the Standard Model. The combined simulation results reveal that DUNE-T2HK synergy will be crucial for achieving a comprehensive understanding of neutrino properties in the coming decade.
View blog
Resources
BARTSmiles: Generative Masked Language Models for Molecular Representations
We discover a robust self-supervised strategy tailored towards molecular representations for generative masked language models through a series of tailored, in-depth ablations. Using this pre-training strategy, we train BARTSmiles, a BART-like model with an order of magnitude more compute than previous self-supervised molecular representations. In-depth evaluations show that BARTSmiles consistently outperforms other self-supervised representations across classification, regression, and generation tasks setting a new state-of-the-art on 11 tasks. We then quantitatively show that when applied to the molecular domain, the BART objective learns representations that implicitly encode our downstream tasks of interest. For example, by selecting seven neurons from a frozen BARTSmiles, we can obtain a model having performance within two percentage points of the full fine-tuned model on task Clintox. Lastly, we show that standard attribution interpretability methods, when applied to BARTSmiles, highlight certain substructures that chemists use to explain specific properties of molecules. The code and the pretrained model are publicly available.
View blog
Resources32
Small Molecule Optimization with Large Language Models
Recent advancements in large language models have opened new possibilities for generative molecular drug design. We present Chemlactica and Chemma, two language models fine-tuned on a novel corpus of 110M molecules with computed properties, totaling 40B tokens. These models demonstrate strong performance in generating molecules with specified properties and predicting new molecular characteristics from limited samples. We introduce a novel optimization algorithm that leverages our language models to optimize molecules for arbitrary properties given limited access to a black box oracle. Our approach combines ideas from genetic algorithms, rejection sampling, and prompt optimization. It achieves state-of-the-art performance on multiple molecular optimization benchmarks, including an 8% improvement on Practical Molecular Optimization compared to previous methods. We publicly release the training corpus, the language models and the optimization algorithm.
View blog
Resources
Deconfined Quantum Critical Point in Quantum Hall Bilayers
Deconfined quantum critical points (DQCPs) represent an unconventional class of quantum criticality beyond the Landau-Ginzburg-Wilson-Fisher paradigm. Nevertheless, both their theoretical identification and experimental realization remain challenging. Here we report compelling evidence of a DQCP in quantum Hall bilayers with half-filled n=2n=2 Landau levels in each layer, based on large-scale variational uniform matrix product state (VUMPS) simulations and exact diagonalization (ED). By systematically analyzing the ground-state fidelity, low-lying energy spectra, exciton superfluid and stripe order parameters, and ground-state energy derivatives, we identify a direct and continuous quantum phase transition between two distinct symmetry-breaking phases by tuning the layer separation: an exciton superfluid phase with spontaneous U(1)U(1) symmetry breaking at small separation, and a unidirectional charge density wave with broken translational symmetry at large separation. Our results highlight quantum Hall bilayers as an ideal platform for realizing and experimentally probing DQCPs under precisely tunable interactions.
View blog
Resources
Towards Timelike Singularity via AdS Dual
It is well known that Kasner geometry with space-like singularity can be extended to bulk AdS-like geometry, furthermore one can study field theory on this Kasner space via its gravity dual. In this paper, we show that there exists a Kasner-like geometry with timelike singularity for which one can construct a dual gravity description. We then study various extremal surfaces including space-like geodesics in the dual gravity description. Finally, we compute correlators of highly massive operators in the boundary field theory with a geodesic approximation.
View blog
Resources
Microquasars as the major contributors to Galactic cosmic rays around the "knee"
Recently, LHAASO detected a gamma-ray emission extending beyond 100TeV100\,\rm{TeV} from 4 sources associated to powerful microquasars. We propose that such sources are the main Galactic PeVatrons and investigate their contribution to the proton and gamma-ray fluxes by modeling their entire population. We find that the presence of only 10\sim10 active powerful microquasars in the Galaxy at any given time is sufficient to account for the proton flux around the knee and to provide a very good explanation of cosmic-ray and gamma-ray data in a self-consistent picture. The 10TeV10\,\rm{TeV} bump and the 300TeV300\,\rm{TeV} hardening in the cosmic-ray spectrum naturally appear, and the diffuse background measured by LHAASO above a few tens of TeV\rm{TeV} is accounted for. This supports the paradigm in which cosmic rays around the knee are predominantly accelerated in a very limited number of powerful microquasars.
View blog
Resources
XDXD: End-to-end crystal structure determination with low resolution X-ray diffraction
Determining crystal structures from X-ray diffraction data is fundamental across diverse scientific fields, yet remains a significant challenge when data is limited to low resolution. While recent deep learning models have made breakthroughs in solving the crystallographic phase problem, the resulting low-resolution electron density maps are often ambiguous and difficult to interpret. To overcome this critical bottleneck, we introduce XDXD, to our knowledge, the first end-to-end deep learning framework to determine a complete atomic model directly from low-resolution single-crystal X-ray diffraction data. Our diffusion-based generative model bypasses the need for manual map interpretation, producing chemically plausible crystal structures conditioned on the diffraction pattern. We demonstrate that XDXD achieves a 70.4\% match rate for structures with data limited to 2.0~Å resolution, with a root-mean-square error (RMSE) below 0.05. Evaluated on a benchmark of 24,000 experimental structures, our model proves to be robust and accurate. Furthermore, a case study on small peptides highlights the model's potential for extension to more complex systems, paving the way for automated structure solution in previously intractable cases.
View blog
Resources
Entanglement Growth from Entangled States: A Unified Perspective on Entanglement Generation and Transport
Studies of entanglement dynamics in quantum many-body systems have focused largely on initial product states. Here, we investigate the far richer dynamics from initial entangled states, uncovering universal patterns across diverse systems ranging from many-body localization (MBL) to random quantum circuits. Our central finding is that the growth of entanglement entropy can exhibit a non-monotonic dependence on the initial entanglement in many non-ergodic systems, peaking for moderately entangled initial states. To understand this phenomenon, we introduce a conceptual framework that decomposes entanglement growth into two mechanisms: ``build'' and ``move''. The ``build'' mechanism creates new entanglement, while the ``move'' mechanism redistributes pre-existing entanglement throughout the system. We model a pure ``move'' dynamics with a random SWAP circuit, showing it uniformly distributes entanglement across all bipartitions. We find that MBL dynamics are ``move-dominated'', which naturally explains the observed non-monotonicity of the entanglement growth. This ``build-move'' framework offers a unified perspective for classifying diverse physical dynamics, deepening our understanding of entanglement propagation and information processing in quantum many-body systems.
View blog
Resources
The first proper motion measurement of the acceleration regions in the large-scale jets of SS 433 powering the W50 nebula

Naomi Tsuji et al. conducted the first direct proper motion measurements of X-ray knots in the large-scale jets of the SS 433 microquasar, finding these acceleration regions to be largely stationary. This observational constraint supports the presence of standing recollimation shocks and implies highly efficient particle acceleration to PeV energies.

View blog
Resources
WARP: Word-level Adversarial ReProgramming
Transfer learning from pretrained language models recently became the dominant approach for solving many NLP tasks. A common approach to transfer learning for multiple tasks that maximize parameter sharing trains one or more task-specific layers on top of the language model. In this paper, we present an alternative approach based on adversarial reprogramming, which extends earlier work on automatic prompt generation. Adversarial reprogramming attempts to learn task-specific word embeddings that, when concatenated to the input text, instruct the language model to solve the specified task. Using up to 25K trainable parameters per task, this approach outperforms all existing methods with up to 25M trainable parameters on the public leaderboard of the GLUE benchmark. Our method, initialized with task-specific human-readable prompts, also works in a few-shot setting, outperforming GPT-3 on two SuperGLUE tasks with just 32 training samples.
View blog
Resources83
Enhancing Galaxy Classification with U-Net Variational Autoencoders for Image Denoising
AI-enhanced approaches are becoming common in astronomical data analysis, including in the galaxy morphological classification. In this study we develop an approach that enhances galaxy classification by incorporating an image denoising pre-processing step, utilizing the U-Net Variational Autoencoder (VAE) architecture and effectively mitigating noise in galaxy images and leading to improved classification performance. Our methodology involves training U-Net VAEs on the EFIGI dataset. To simulate realistic observational conditions, we introduce artifacts such as projected stars, satellite trails, and diffraction patterns into clean galaxy images. The denoised images generated are evaluated using Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM), to quantify the quality improvements. We utilize the denoised images for galaxy classification tasks using models such as DenseNet-201, ResNet50, VGG16 and GCNN. Simulations do reveal that, the models trained on denoised images consistently outperform those trained on noisy images, thus demonstrating the efficiency of the used denoising procedure. The developed approach can be used for other astronomical datasets, via refining the VAE architecture and integrating additional pre-processing strategies, e.g. in revealing of gravitational lenses, cosmic web structures.
View blog
Resources
Noncompact Symmetries in String Theory
Noncompact groups, similar to those that appeared in various supergravity theories in the 1970's, have been turning up in recent studies of string theory. First it was discovered that moduli spaces of toroidal compactification are given by noncompact groups modded out by their maximal compact subgroups and discrete duality groups. Then it was found that many other moduli spaces have analogous descriptions. More recently, noncompact group symmetries have turned up in effective actions used to study string cosmology and other classical configurations. This paper explores these noncompact groups in the case of toroidal compactification both from the viewpoint of low-energy effective field theory, using the method of dimensional reduction, and from the viewpoint of the string theory world sheet. The conclusion is that all these symmetries are intimately related. In particular, we find that Chern--Simons terms in the three-form field strength HμνρH_{\mu\nu\rho} play a crucial role.
View blog
Resources
Zero resistance when metals mixed with insulators
A false zero resistance behavior was observed during our study on the search of superconductivity in Ge-doped GaNb4Se8. This zero resistance was proved to be caused by open-circuit in multi-phase samples comprised of metals and insulators by measuring with four-probe method. The evidence strongly suggests that the reported superconductivity in hydrides should be carefully re-checked.
View blog
Resources
GradSkip: Communication-Accelerated Local Gradient Methods with Better Computational Complexity
09 Jun 2025
We study a class of distributed optimization algorithms that aim to alleviate high communication costs by allowing clients to perform multiple local gradient-type training steps before communication. In a recent breakthrough, Mishchenko et al. (2022) proved that local training, when properly executed, leads to provable communication acceleration, and this holds in the strongly convex regime without relying on any data similarity assumptions. However, their ProxSkip method requires all clients to take the same number of local training steps in each communication round. We propose a redesign of the ProxSkip method, allowing clients with ``less important'' data to get away with fewer local training steps without impacting the overall communication complexity of the method. In particular, we prove that our modified method, GradSkip, converges linearly under the same assumptions and has the same accelerated communication complexity, while the number of local gradient steps can be reduced relative to a local condition number. We further generalize our method by extending the randomness of probabilistic alternations to arbitrary unbiased compression operators and by considering a generic proximable regularizer. This generalization, which we call GradSkip+, recovers several related methods in the literature as special cases. Finally, we present an empirical study on carefully designed toy problems that confirm our theoretical claims.
View blog
Resources
On the Fractional Electric Charge of a Magnetic Monopole at Nonzero Temperature
We extend recent discussions about the effect of nonzero temperature on the induced electric charge, due to CP violation, of a Dirac or an 't Hooft-Polyakov monopole. In particular, we determine the fractional electric charge of a very small 't Hooft-Polyakov monopole coupled to light fermions at nonzero temperature. If dyons with fractional electric charge exist in the Weinberg-Salam model, as recently suggested in the literature, then their charge too should be temperature dependent.
View blog
Resources
Probing the features of electron dispersion by tunneling between slightly twisted bilayer graphene sheets
Tunneling conductance between two bilayer graphene (BLG) sheets separated by 2 nm-thick insulating barrier was measured in two devices with the twist angles between BLGs less than 1°. At small bias voltages, the tunneling occurs with conservation of energy and momentum at the points of intersection between two relatively shifted Fermi circles. Here, we experimentally found and theoretically described signatures of electron-hole asymmetric band structure of BLG: since holes are heavier, the tunneling conductance is enhanced at the hole doping due to the higher density of states. Another key feature of BLG that we explore is gap opening in a vertical electric field with a strong polarization of electron wave function at van Hove singularities near the gap edges. This polarization, by shifting electron wave function in one BLG closer to or father from the other BLG, gives rise to asymmetric tunneling resonances in the conductance around charge neutrality points, which result in strong sensitivity of the tunneling current to minor changes of the gate voltages. The observed phenomena are reproduced by our theoretical model taking into account electrostatics of the dual-gated structure, quantum capacitance effects, and self-consistent gap openings in both BLGs.
View blog
Resources
Quantum theory phase space foundations

The paper comprehensively reviews the phase space foundations of quantum theory, detailing the interrelations of Wigner, Husimi, and Glauber-Sudarshan quasi-probability distributions. It then applies this framework to analytically determine the Husimi quasi-probability function for the output state of a linear quantum amplifier, precisely accounting for operator ordering.

View blog
Resources
Convergence of Physics-Informed Neural Networks for Fully Nonlinear PDE's
The present work is focused on exploring convergence of Physics-informed Neural Networks (PINNs) when applied to a specific class of second-order fully nonlinear Partial Differential Equations (PDEs). It is well-known that as the number of data grows, PINNs generate a sequence of minimizers which correspond to a sequence of neural networks. We show that such sequence converges to a unique viscosity solution of a certain class of second-order fully nonlinear PDE's, provided the latter satisfies the comparison principle in the viscosity sense.
View blog
Resources
High-harmonic spectroscopy of mobility edges in one-dimensional quasicrystals
Quasicrystals occupy a unique position between periodic and disordered systems, where localization phenomena such as Anderson transitions and mobility edges can emerge even in the absence of disorder. This distinctive behavior motivates the development of robust, all-optical diagnostic tools capable of probing the structural, topological, and dynamical properties of such systems. In this work, focusing on generalized Aubry-André-Harper models and on an incommensurate potential in the continuum limit, we demonstrate that high-harmonic generation phenomenon serves as a powerful probe of localization transitions and mobility edges in quasicrystals. We introduce a new parameter--dipole mobility--which captures the impact of intraband dipole transitions and enables classification of nonlinear optical regimes, where excitation and high-harmonic generation yield can differ by orders of magnitude. We show that the cutoff frequency of harmonics is strongly influenced by the position of the mobility edge, providing a robust and experimentally accessible signature of localization transitions in quasicrystals.
View blog
Resources
Analyzing Local Representations of Self-supervised Vision Transformers
In this paper, we present a comparative analysis of various self-supervised Vision Transformers (ViTs), focusing on their local representative power. Inspired by large language models, we examine the abilities of ViTs to perform various computer vision tasks with little to no fine-tuning. We design evaluation framework to analyze the quality of local, i.e.\ patch-level, representations in the context of few-shot semantic segmentation, instance identification, object retrieval and tracking. We discover that contrastive learning based methods like DINO produce more universal patch representations that can be immediately applied for downstream tasks with no parameter tuning, compared to masked image modeling. The embeddings learned using the latter approach, e.g. in masked autoencoders, have high variance features that harm distance-based algorithms, such as k-NN, and do not contain useful information for most downstream tasks. Furthermore, we demonstrate that removing these high-variance features enhances k-NN for MAE, as well as for its recent extension Scale-MAE. Finally, we find an object instance retrieval setting where DINOv2, a model pretrained on two orders of magnitude more data, falls short of its less compute intensive counterpart DINO.
View blog
Resources
There are no more papers matching your filters at the moment.