Tunghai University
Accurate and efficient question-answering systems are essential for delivering high-quality patient care in the medical field. While Large Language Models (LLMs) have made remarkable strides across various domains, they continue to face significant challenges in medical question answering, particularly in understanding domain-specific terminologies and performing complex reasoning. These limitations undermine their effectiveness in critical medical applications. To address these issues, we propose a novel approach incorporating similar case generation within a multi-agent medical question-answering (MedQA) system. Specifically, we leverage the Llama3.1:70B model, a state-of-the-art LLM, in a multi-agent architecture to enhance performance on the MedQA dataset using zero-shot learning. Our method capitalizes on the model's inherent medical knowledge and reasoning capabilities, eliminating the need for additional training data. Experimental results show substantial performance gains over existing benchmark models, with improvements of 7% in both accuracy and F1-score across various medical QA tasks. Furthermore, we examine the model's interpretability and reliability in addressing complex medical queries. This research not only offers a robust solution for medical question answering but also establishes a foundation for broader applications of LLMs in the medical domain.
The wide spread of rumors on social media has caused a negative impact on people's daily life, leading to potential panic, fear, and mental health problems for the public. How to debunk rumors as early as possible remains a challenging problem. Existing studies mainly leverage information propagation structure to detect rumors, while very few works focus on correlation among users that they may coordinate to spread rumors in order to gain large popularity. In this paper, we propose a new detection model, that jointly learns both the representations of user correlation and information propagation to detect rumors on social media. Specifically, we leverage graph neural networks to learn the representations of user correlation from a bipartite graph that describes the correlations between users and source tweets, and the representations of information propagation with a tree structure. Then we combine the learned representations from these two modules to classify the rumors. Since malicious users intend to subvert our model after deployment, we further develop a greedy attack scheme to analyze the cost of three adversarial attacks: graph attack, comment attack, and joint attack. Evaluation results on two public datasets illustrate that the proposed MODEL outperforms the state-of-the-art rumor detection models. We also demonstrate our method performs well for early rumor detection. Moreover, the proposed detection method is more robust to adversarial attacks compared to the best existing method. Importantly, we show that it requires a high cost for attackers to subvert user correlation pattern, demonstrating the importance of considering user correlation for rumor detection.
Image captioning strives to generate pertinent captions for specified images, situating itself at the crossroads of Computer Vision (CV) and Natural Language Processing (NLP). This endeavor is of paramount importance with far-reaching applications in recommendation systems, news outlets, social media, and beyond. Particularly within the realm of news reporting, captions are expected to encompass detailed information, such as the identities of celebrities captured in the images. However, much of the existing body of work primarily centers around understanding scenes and actions. In this paper, we explore the realm of image captioning specifically tailored for celebrity photographs, illustrating its broad potential for enhancing news industry practices. This exploration aims to augment automated news content generation, thereby facilitating a more nuanced dissemination of information. Our endeavor shows a broader horizon, enriching the narrative in news reporting through a more intuitive image captioning framework.
We systematically investigate the topological properties of spin polarized Rydberg-dressed fermionic atoms loaded in a bilayer optical lattice. Through tuning the Rydberg coupling strength and the inter-layer tunneling amplitude, we identify different types of topological superfluid states generated from the inter-layer pairing and relative gauge phase modulation of the couples 2D pp-wave superfluids. These phases includes gapped/gapless with/without time reversal symmetry. One of the most interesting states is a gapless paired topological superfluid with both the time-reversal symmetry and particle-hole symmetry. This state is equivalent to a topological Kondo lattice model with the spin-orbit coupling, an in-plane magnetic field, and an additional particle-hole symmetry. The flexibility of experimental manipulation in such Rydberg-dressed ferminoic systems therefore becomes a promising system for realizing interesting topological superfluids.
The notorious sign problem severely limits the applicability of quantum Monte Carlo (QMC) simulations, as statistical errors grow exponentially with system size and inverse temperature. A recent proposal of a quantum-computing stochastic series expansion (qc-SSE) method suggested that the problem could be avoided by introducing constant energy shifts into the Hamiltonian. Here we critically examine this framework and show that it does not strictly resolve the sign problem for Hamiltonians with non-commuting terms. Instead, it provides a practical mitigation strategy that suppresses the occurrence of negative weights. Using the antiferromagnetic anisotropic XY chain as a test case, we analyze the dependence of the average sign on system size, temperature, anisotropy, and shift parameters. An operator contraction method is introduced to improve efficiency. Our results demonstrate that moderate shifts optimally balance sign mitigation and statistical accuracy, while large shifts amplify errors, leaving the sign problem unresolved but alleviated.
We study the quantum point contact between the topological superconductor and the helical Luttinger liquid. The effects of the electron-electron interactions in the helical Luttinger liquid on the low-energy physics of this system are analyzed by the renormalization group. Among the various couplings at the point contact which arise from the tunneling via the Majorana edge channel, the induced backscattering in the helical Luttinger liquid is the most relevant for repulsive interactions. Hence, at low temperatures, the helical Luttinger liquid is effectively cut into two separated half wires. As a result, the low-temperature physics is described by a fixed point consisting of two leads coupled to the topological superconductor, and the electrical transport properties through the point contact at low temperature and low bias are dominated by the tunneling via the Majorana edge channel. We compute the temperature dependence of the zero-bias tunneling conductance and study the full counting statistics for the tunneling current at zero temperature.
The digital media, identified as computational propaganda provides a pathway for propaganda to expand its reach without limit. State-backed propaganda aims to shape the audiences' cognition toward entities in favor of a certain political party or authority. Furthermore, it has become part of modern information warfare used in order to gain an advantage over opponents. Most of the current studies focus on using machine learning, quantitative, and qualitative methods to distinguish if a certain piece of information on social media is propaganda. Mainly conducted on English content, but very little research addresses Chinese Mandarin content. From propaganda detection, we want to go one step further to provide more fine-grained information on propaganda techniques that are applied. In this research, we aim to bridge the information gap by providing a multi-labeled propaganda techniques dataset in Mandarin based on a state-backed information operation dataset provided by Twitter. In addition to presenting the dataset, we apply a multi-label text classification using fine-tuned BERT. Potentially this could help future research in detecting state-backed propaganda online especially in a cross-lingual context and cross platforms identity consolidation.
19
We investigate the effects of long-ranged Coulomb interactions in a tilted Dirac semimetal in two dimensions by using the perturbative renormalization-group method. Depending on the magnitude of the tilting parameter, the undoped system can have either Fermi points (type-I) or Fermi lines (type-II). Previous studies usually performed the renormalization-group transformations by integrating out the modes with large momenta. This is problematic when the Fermi surface is open, like type-II Dirac fermions. In this work, we study the effects of Coulomb interactions, following the spirit of Shankar\cite{Shankar}, by introducing a cutoff in the energy scale around the Fermi surface and integrating out the high-energy modes. For type-I Dirac fermions, our result is consistent with that of the previous work. On the other hand, we find that for type-II Dirac fermions, the magnitude of the tilting parameter increases monotonically with lowering energies. This implies the stability of type-II Dirac fermions in the presence of Coulomb interactions, in contrast with previous results. Furthermore, for type-II Dirac fermions, the velocities in different directions acquire different renormalization even if they have the same bare values. By taking into account the renormalization of the tilting parameter and the velocities due to the Coulomb interactions, we show that while the presence of a charged impurity leads only to charge redistribution around the impurity for type-I Dirac fermions, for type-II Dirac fermions, the impurity charge is completely screened, albeit with a very long screening length. The latter indicates that the temperature dependence of physical observables are essentially determined by the RG equations we derived. We illustrate this by calculating the temperature dependence of the compressibility and specific heat of the interacting tilted Dirac fermions.
We consider the thermalization hypothesis of pure states in quantum Ising chain with Z2Z_2 symmetry, XXZ chain with U(1)U(1) symmetry, and XXX chain with SU(2)SU(2) symmetries. Two kinds of pure states are considered: the energy eigenstates and the typical states evolved unitarily from the random product states for a long enough period. We further group the typical states by their expectation values of the conserved charges and consider the fine-grained thermalization hypothesis. We compare the locally (subsystem) reduced states of typical states/eigenstates with the ones of the corresponding thermal ensemble states. Besides the usual thermal ensembles such as the (micro-)canonical ensemble without conserved charges and the generalized Gibbs ensemble (GGE) with all conserved charges included, we also consider the so-called partial-GGEs (p-GGEs), which include only part of the conserved charges in the thermal ensemble. Moreover, in the framework of p-GGE, the Hamiltonian and other conserved charges are on an equal footing. The introduction of p-GGEs extends quantum thermalization to a more general scope. The validity of the subsystem thermalization hypothesis can be quantified by the smallness of the relative entropy of the reduced states obtained from the GGE/p-GGE and the typical states/eigenstates. We examine the validity of the thermalization hypothesis by numerically studying the relative entropy demographics. We show that the thermalization hypothesis holds generically for the small enough subsystems for various p-GGEs. Thus, our framework extends the universality of quantum thermalization.
In this paper, we consider a generalized model of 2×22\times 2 Keller-Segel system with nonlinear chemical gradient and small cell diffusion. The existence of the traveling pulses for such equations is established by the methods of geometric singular perturbation (GSP in short) and trapping regions from dynamical systems theory. By the technique of GSP, we show that the necessary condition for the existence of traveling pulses is that their limiting profiles with vanishing diffusion can only consist of the slow flows on the critical manifold of the corresponding algebraic-differential system. We also consider the linear instability of these pulses by the spectral analysis of the linearized operators.
Unique zero thermal expansion (ZTE) materials are valuable for use in precision instruments, including electronics, aerospace parts, and engines. However, most ZTE materials have a temperature range less than 1000 K under which they do not expand. In this study, we present a uniaxial ZTE in the low-cost Mn2OBO3 with a thermal expansion coefficient of α\alpha= -4(1)×\times10^(-7) K-1 along the X1 principal axis from 3.5 to 1250 K. The monoclinic structure of Mn2OBO3 remains stable over the entire temperature range in ambient conditions. Considerable thermal contraction on the BO3 trigonal planar and thermal expansion on the MnO6 octahedra combine to produce uniaxial ZTE. Temperature-dependent Raman scattering reveals anharmonic low-frequency modes associated with MnO6 Rigid Unit Modes (RUMs), which likely play a critical role in driving thermal contraction in the BO3 trigonal planar. No charge order-disorder transition, which could cause thermal contraction, was observed up to 1250 K.
The investigation of nonequilibrium thermodynamics in quantum many-body systems underscores the importance of quantum work, which differs from its classical counterpart due to its statistical nature. Recent studies have shown that quantum work can serve as an effective indicator of quantum phase transitions in systems subjected to sudden quenches. However, the potential of quantum work to identify thermal phase transitions remains largely unexplored. In this paper, we examine several types of thermal phase transitions in a sudden-quench hard-core boson model, including Ising, three-state Potts, and Berezinskii-Kosterlitz-Thouless transitions. Through finite-size scaling analysis, we conclude that work statistics can also characterize the critical behaviors of thermal phase transitions in generic many-body systems. Our investigation paves the way for applying work statistics to characterize critical behavior in many-body systems, with implications that may extend to broader contexts.
A positive margin may result in an increased risk of local recurrences after breast retention surgery for any malignant tumour. In order to reduce the number of positive margins would offer surgeon real-time intra-operative information on the presence of positive resection margins. This study aims to design an intra-operative tumour margin evaluation scheme by using specimen mammography in breast-conserving surgery. Total of 30 cases were evaluated and compared with the manually determined contours by experienced physicians and pathology report. The proposed method utilizes image thresholding to extract regions of interest and then performs a deep learning model, i.e. SegNet, to segment tumour tissue. The margin width of normal tissues surrounding it is evaluated as the result. The desired size of margin around the tumor was set for 10 mm. The smallest average difference to manual sketched margin (6.53 mm +- 5.84). In the all case, the SegNet architecture was utilized to obtain tissue specimen boundary and tumor contour, respectively. The simulation results indicated that this technology is helpful in discriminating positive from negative margins in the intra-operative setting. The aim of proposed scheme was a potential procedure in the intra-operative measurement system. The experimental results reveal that deep learning techniques can draw results that are consistent with pathology reports.
Capital allocation is a procedure for quantifying the contribution of each source of risk to aggregated risk. The gradient allocation rule, also known as the Euler principle, is a prevalent rule of capital allocation under which the allocated capital captures the diversification benefit of the marginal risk as a component of overall risk. This research concentrates on Expected Shortfall (ES) as a regulatory standard and focuses on the gradient allocations of ES, also called ES contributions (ESCs). We present the comprehensive treatment of backtesting the tuple of ESCs in the framework of the traditional and comparative backtests based on the concepts of joint identifiability and multi-objective elicitability. For robust forecast evaluation against the choice of scoring function, we also extend the Murphy diagram, a graphical tool to check whether one forecast dominates another under a class of scoring functions, to the case of ESCs. Finally, leveraging the recent concept of multi-objective elicitability, we propose a novel semiparametric model for forecasting dynamic ESCs based on a compositional regression model. In an empirical analysis of stock returns we evaluate and compare a variety of models for forecasting dynamic ESCs and demonstrate the outstanding performance of the proposed model.
In this paper, we revisit the effect of flat bands on the Ruderman-Kittel-Kasuya-Yosida (RKKY) interaction by using a coordinate transformation that detangles flat-band states from dispersive ones. Under this transformation, original flat-band systems containing magnetic impurities are mapped onto a generalized Fano-Anderson model, where flat-band states act as Fano defects. From this perspective, several features of exact RKKY couplings calculated numerically can be understood easily. As an illustrative example, we analyze a dimerized diamond chain model, which can exhibit either gapped or gapless spectra depending on the ratio of hopping integrals. We find that anomalous decay in the RKKY couplings arises exclusively in the gapless case and with specific magnetic coupling configurations. Furthermore, the conventional wisdom regarding the signs of RKKY interactions breaks down under certain conditions. These subtleties arising from flat bands find explanation within our present approach. Our investigation offers deeper insights into how flat bands influence carrier-mediated exchange interactions, with implications extending to broader contexts.
In this work, charge transport (CT) properties of the p53 gene are numerically studied by the transfer matrix method, and using either single or double strand effective tight-binding models. A statistical analysis of the consequences of known p53 point mutations on CT features is performed. It is found that in contrast to other kind of mutation defects, cancerous mutations result in much weaker changes of CT efficiency. Given the envisioned role played by CT in the DNA-repairing mechanism, our theoretical results suggest an underlying physical explanation at the origin of carcinogenesis.
The power-Lanczos (PL) method is one kind of Green's function Monte-Carlo simulation, which is improved by Lanczos iterations. The ground state energies of strongly-correlated models can be evaluated by this method quite accurately. In this report, the boundary of phase separation (PS) of the two-dimensional tJt-J model is investigated by the power-Lanczos method and Maxwell construction. The energies are compared with the results evaluated by other methods. Our conclusion is that there is no phase separation for J/t0.4J/t \le 0.4.
It is known that a trained Restricted Boltzmann Machine (RBM) on the binary Monte Carlo Ising spin configurations, generates a series of iterative reconstructed spin configurations which spontaneously flow and stabilize to the critical point of physical system. Here we construct a variety of Neural Network (NN) flows using the RBM and (variational) autoencoders, to study the q-state Potts and clock models on the square lattice for q = 2, 3, 4. The NN are trained on Monte Carlo spin configurations at various temperatures. We find that the trained NN flow does develop a stable point that coincides with critical point of the q-state spin models. The behavior of the NN flow is nontrivial and generative, since the training is unsupervised and without any prior knowledge about the critical point and the Hamiltonian of the underlying spin model. Moreover, we find that the convergence of the flow is independent of the types of NNs and spin models, hinting a universal behavior. Our results strengthen the potential applicability of the notion of the NN flow in studying various states of matter and offer additional evidence on the connection with the Renormalization Group flow.
In the present paper, the mean of Lyapunov exponents for the Azbel-Hofstadter model on the triangular lattice is calculated. It is recently proposed that [Phys. Rev. Lett. {\bf 85}, 4920 (2000)], for the case of the square lattice, this quantity can be related to the logarithm of the partition function of the two dimensional Ising model and has a connection to the asymptotic bandwidth. We find that the correspondence of this quantity to the logarithm of the partition function of the two dimensional Ising model is not complete for the triangular lattice. Moreover, the detailed connection between this quantity and the asymptotic bandwidth is not valid. Thus the conclusions for the mean of Lyapunov exponents suggested previously depend on the lattice geometry.
We propose a unified scheme to identify phase transitions out of the Z2\mathbb{Z}_2 Abelian topological order, including the transition to a non-Abelian chiral spin liquid. Using loop gas and and string gas states [H.-Y. Lee, R. Kaneko, T. Okubo, N. Kawashima, Phys. Rev. Lett. 123, 087203 (2019)] on the star lattice Kitaev model as an example, we compute the overlap of minimally entangled states through transfer matrices. We demonstrate that, similar to the anyon condensation, continuous deformation of a Z2\mathbb{Z}_2-injective projected entangled-pair state (PEPS) also allows us to study the transition between Abelian and non-Abelian topological orders. We show that the charge and flux anyons defined in the Abelian phase transmute into the σ\sigma anyon in the non-Abelian topological order. Furthermore, we show that contrary to the claim in [Phys. Rev. B 101, 035140 (2020)], both the LG and SG states have infinite correlation length in the non-Abelian regime, consistent with the no-go theorem that a chiral PEPS has a gapless parent Hamiltonian.
There are no more papers matching your filters at the moment.