Changzhou University
The recognition of pig behavior plays a crucial role in smart farming and welfare assurance for pigs. Currently, in the field of pig behavior recognition, the lack of publicly available behavioral datasets not only limits the development of innovative algorithms but also hampers model robustness and algorithm optimization.This paper proposes a dataset containing 13 pig behaviors that significantly impact welfare.Based on this dataset, this paper proposes a spatial-temporal perception and enhancement networks based on the attention mechanism to model the spatiotemporal features of pig behaviors and their associated interaction areas in video data. The network is composed of a spatiotemporal perception network and a spatiotemporal feature enhancement network. The spatiotemporal perception network is responsible for establishing connections between the pigs and the key regions of their behaviors in the video data. The spatiotemporal feature enhancement network further strengthens the important spatial features of individual pigs and captures the long-term dependencies of the spatiotemporal features of individual behaviors by remodeling these connections, thereby enhancing the model's perception of spatiotemporal changes in pig behaviors. Experimental results demonstrate that on the dataset established in this paper, our proposed model achieves a MAP score of 75.92%, which is an 8.17% improvement over the best-performing traditional model. This study not only improces the accuracy and generalizability of individual pig behavior recognition but also provides new technological tools for modern smart farming. The dataset and related code will be made publicly available alongside this paper.
Automated generation of high-quality media presentations is challenging, requiring robust content extraction, narrative planning, visual design, and overall quality optimization. Existing methods often produce presentations with logical inconsistencies and suboptimal layouts, thereby struggling to meet professional standards. To address these challenges, we introduce RCPS (Reflective Coherent Presentation Synthesis), a novel framework integrating three key components: (1) Deep Structured Narrative Planning; (2) Adaptive Layout Generation; (3) an Iterative Optimization Loop. Additionally, we propose PREVAL, a preference-based evaluation framework employing rationale-enhanced multi-dimensional models to assess presentation quality across Content, Coherence, and Design. Experimental results demonstrate that RCPS significantly outperforms baseline methods across all quality dimensions, producing presentations that closely approximate human expert standards. PREVAL shows strong correlation with human judgments, validating it as a reliable automated tool for assessing presentation quality.
Forecasting pedestrian trajectories in dynamic scenes remains a critical problem in various applications, such as autonomous driving and socially aware robots. Such forecasting is challenging due to human-human and human-object interactions and future uncertainties caused by human randomness. Generative model-based methods handle future uncertainties by sampling a latent variable. However, few studies explored the generation of the latent variable. In this work, we propose the Trajectory Predictor with Pseudo Oracle (TPPO), which is a generative model-based trajectory predictor. The first pseudo oracle is pedestrians' moving directions, and the second one is the latent variable estimated from ground truth trajectories. A social attention module is used to aggregate neighbors' interactions based on the correlation between pedestrians' moving directions and future trajectories. This correlation is inspired by the fact that pedestrians' future trajectories are often influenced by pedestrians in front. A latent variable predictor is proposed to estimate latent variable distributions from observed and ground-truth trajectories. Moreover, the gap between these two distributions is minimized during training. Therefore, the latent variable predictor can estimate the latent variable from observed trajectories to approximate that estimated from ground-truth trajectories. We compare the performance of TPPO with related methods on several public datasets. Results demonstrate that TPPO outperforms state-of-the-art methods with low average and final displacement errors. The ablation study shows that the prediction performance will not dramatically decrease as sampling times decline during tests.
Epilepsy is a prevalent neurological disorder marked by sudden, brief episodes of excessive neuronal activity caused by abnormal electrical discharges, which may lead to some mental disorders. Most existing deep learning methods for epilepsy detection rely solely on unimodal EEG signals, neglecting the potential benefits of multimodal information. To address this, we propose a novel multimodal model, DistilCLIP-EEG, based on the CLIP framework, which integrates both EEG signals and text descriptions to capture comprehensive features of epileptic seizures. The model involves an EEG encoder based on the Conformer architecture as a text encoder, the proposed Learnable BERT (BERT-LP) as prompt learning within the encoders. Both operate in a shared latent space for effective cross-modal representation learning. To enhance efficiency and adaptability, we introduce a knowledge distillation method where the trained DistilCLIP-EEG serves as a teacher to guide a more compact student model to reduce training complexity and time. On the TUSZ, AUBMC, and CHB-MIT datasets, both the teacher and student models achieved accuracy rates exceeding 97%. Across all datasets, the F1-scores were consistently above 0.94, demonstrating the robustness and reliability of the proposed framework. Moreover, the student model's parameter count and model size are approximately 58.1% of those of the teacher model, significantly reducing model complexity and storage requirements while maintaining high performance. These results highlight the potential of our proposed model for EEG-based epilepsy detection and establish a solid foundation for deploying lightweight models in resource-constrained settings.
Airway mucus is a complex gel with an anisotropic three-dimensional network structure. As a crucial component of the respiratory defense barrier, it plays a vital role in maintaining airway hydration and supporting the function of airway epithelial cells. Through linear and nonlinear rheological mechanisms such as ciliary motion and coughing, airway mucus expels foreign pathogens and toxic nano- and microparticles while selectively allowing the passage of specific nutrients and proteins. These protective and clearance functions depend on the proper rheological properties of mucus under normal physiological conditions. However, in respiratory disease such as CF, COPD, asthma, and COVID-19, excessive mucus secretion is often accompanied by abnormal rheological behaviors. This leads to impaired mucus flow, airway obstruction, and potentially life-threatening conditions. Therefore, this review examines the rheological behaviors of airway mucus in relation to health and disease, focusing on both macrorheology and microrheology. The review highlights those changes in the chemical composition and microstructure of airway mucus, especially under pathological conditions, that can significantly alter its rheological behavior. Rheological parameters can also serve as biological indicators to study the role of mucus in clearance functions and aid in developing pulmonary drug delivery systems. By integrating findings from both macro- and microrheological studies, this review aims to enhance our understanding of the complex behavior of airway mucus, supporting better diagnosis, treatment, and management of chronic respiratory diseases.
Achieving joint learning of Salient Object Detection (SOD) and Camouflaged Object Detection (COD) is extremely challenging due to their distinct object characteristics, i.e., saliency and camouflage. The only preliminary research treats them as two contradictory tasks, training models on large-scale labeled data alternately for each task and assessing them independently. However, such task-specific mechanisms fail to meet real-world demands for addressing unknown tasks effectively. To address this issue, in this paper, we pioneer a task-agnostic framework to unify SOD and COD. To this end, inspired by the agreeable nature of binary segmentation for SOD and COD, we propose a Contrastive Distillation Paradigm (CDP) to distil the foreground from the background, facilitating the identification of salient and camouflaged objects amidst their surroundings. To probe into the contribution of our CDP, we design a simple yet effective contextual decoder involving the interval-layer and global context, which achieves an inference speed of 67 fps. Besides the supervised setting, our CDP can be seamlessly integrated into unsupervised settings, eliminating the reliance on extensive human annotations. Experiments on public SOD and COD datasets demonstrate the superiority of our proposed framework in both supervised and unsupervised settings, compared with existing state-of-the-art approaches. Code is available on this https URL
With a interpolation method on the P-μ\mu plane, a hybrid equation of state is explored. The quark phase is described by our newly developed self-consistent two-flavor Nambu-Jona-Lasinio model. It retains the contribution from the vector channel in the Fierz-transformed Lagrangian by introducing a weighting parameter α\alpha [Chin. Phys. C \textbf{43}, 084102 (2019)]. In the hadron phase we use the relativistic mean-field theory. We study the dependence of hybrid EOS and mass-radius relation on α\alpha. It is found that increasing α\alpha makes the hybrid EOS softer in the medium pressure. We can get stellar mass larger than 2M2M_\odot. Further, we calculate the tidal deformability Λ~\tilde\Lambda for binary stars and compare with recent analysis GW170817 [Phys. Rev. X \textbf{9}, 011001 (2019)].
The illusion phenomenon of large language models (LLMs) is the core obstacle to their reliable deployment. This article formalizes the large language model as a probabilistic Turing machine by constructing a "computational necessity hierarchy", and for the first time proves the illusions are inevitable on diagonalization, incomputability, and information theory boundaries supported by the new "learner pump lemma". However, we propose two "escape routes": one is to model Retrieval Enhanced Generations (RAGs) as oracle machines, proving their absolute escape through "computational jumps", providing the first formal theory for the effectiveness of RAGs; The second is to formalize continuous learning as an "internalized oracle" mechanism and implement this path through a novel neural game theory this http URL, this article proposes a
Deep learning-based medical image segmentation faces significant challenges arising from limited labeled data and domain shifts. While prior approaches have primarily addressed these issues independently, their simultaneous occurrence is common in medical imaging. A method that generalizes to unseen domains using only minimal annotations offers significant practical value due to reduced data annotation and development costs. In pursuit of this goal, we propose FSDA-DG, a novel solution to improve cross-domain generalizability of medical image segmentation with few single-source domain annotations. Specifically, our approach introduces semantics-guided semi-supervised data augmentation. This method divides images into global broad regions and semantics-guided local regions, and applies distinct augmentation strategies to enrich data distribution. Within this framework, both labeled and unlabeled data are transformed into extensive domain knowledge while preserving domain-invariant semantic information. Additionally, FSDA-DG employs a multi-decoder U-Net pipeline semi-supervised learning (SSL) network to improve domain-invariant representation learning through consistent prior assumption across multiple perturbations. By integrating data-level and model-level designs, FSDA-DG achieves superior performance compared to state-of-the-art methods in two challenging single domain generalization (SDG) tasks with limited annotations. The code is publicly available at this https URL.
De novo peptide sequencing from mass spectrometry data is an important method for protein identification. Recently, various deep learning approaches were applied for de novo peptide sequencing and DeepNovoV2 is one of the represetative models. In this study, we proposed an enhanced model, DePS, which can improve the accuracy of de novo peptide sequencing even with missing signal peaks or large number of noisy peaks in tandem mass spectrometry data. It is showed that, for the same test set of DeepNovoV2, the DePS model achieved excellent results of 74.22%, 74.21% and 41.68% for amino acid recall, amino acid precision and peptide recall respectively. Furthermore, the results suggested that DePS outperforms DeepNovoV2 on the cross species dataset.
The part-whole relational property endowed by Capsule Networks (CapsNets) has been known successful for camouflaged object detection due to its segmentation integrity. However, the previous Expectation Maximization (EM) capsule routing algorithm with heavy computation and large parameters obstructs this trend. The primary attribution behind lies in the pixel-level capsule routing. Alternatively, in this paper, we propose a novel mamba capsule routing at the type level. Specifically, we first extract the implicit latent state in mamba as capsule vectors, which abstract type-level capsules from pixel-level versions. These type-level mamba capsules are fed into the EM routing algorithm to get the high-layer mamba capsules, which greatly reduce the computation and parameters caused by the pixel-level capsule routing for part-whole relationships exploration. On top of that, to retrieve the pixel-level capsule features for further camouflaged prediction, we achieve this on the basis of the low-layer pixel-level capsules with the guidance of the correlations from adjacent-layer type-level mamba capsules. Extensive experiments on three widely used COD benchmark datasets demonstrate that our method significantly outperforms state-of-the-arts. Code has been available on this https URL\_capsule.
We extend the parity doublet model for hadronic matter and study the possible presence of quark matter inside the cores of neutron stars with the Nambu-Jona-Lasinio (NJL) model. Considering the uncertainties of the QCD phase diagram and the location of the critical endpoint, we aim to explore the competition between the chiral phase transition and the deconfinement phase transition systematically, regulated by the vacuum pressure B-B in the NJL model. Employing a Maxwell construction, a sharp first-order deconfinement phase transition is implemented combining the parity doublet model for the hadronic phase and the NJL model for the high-energy quark phase. The position of the chiral phase transition is obtained from the NJL model self-consistently. We find stable neutron stars with a quark core within a specific parameter space that satisfies current astronomical observations. The observations suggest a relatively large chiral invariant mass m0=600m_0=600 MeV in the parity doublet model and a larger split between the chiral and deconfinement phase transitions while assuming the first-order deconfinement phase transition. The maximum mass of the hybrid star that we obtain is 2.2M\sim 2.2 M_{\odot}.
Ultrasound imaging is a prevalent diagnostic tool known for its simplicity and non-invasiveness. However, its inherent characteristics often introduce substantial noise, posing considerable challenges for automated lesion or organ segmentation in ultrasound video sequences. To address these limitations, we propose the Dual Semantic-Aware Network (DSANet), a novel framework designed to enhance noise robustness in ultrasound video segmentation by fostering mutual semantic awareness between local and global features. Specifically, we introduce an Adjacent-Frame Semantic-Aware (AFSA) module, which constructs a channel-wise similarity matrix to guide feature fusion across adjacent frames, effectively mitigating the impact of random noise without relying on pixel-level relationships. Additionally, we propose a Local-and-Global Semantic-Aware (LGSA) module that reorganizes and fuses temporal unconditional local features, which capture spatial details independently at each frame, with conditional global features that incorporate temporal context from adjacent frames. This integration facilitates multi-level semantic representation, significantly improving the model's resilience to noise interference. Extensive evaluations on four benchmark datasets demonstrate that DSANet substantially outperforms state-of-the-art methods in segmentation accuracy. Moreover, since our model avoids pixel-level feature dependencies, it achieves significantly higher inference FPS than video-based methods, and even surpasses some image-based models. Code can be found in \href{this https URL}{DSANet}
Three-dimensional generative models increasingly drive structure-based drug discovery, yet it remains constrained by the scarce publicly available protein-ligand complexes. Under such data scarcity, almost all existing pipelines struggle to learn transferable geometric priors and consequently overfit to training-set biases. As such, we present IBEX, an Information-Bottleneck-EXplored coarse-to-fine pipeline to tackle the chronic shortage of protein-ligand complex data in structure-based drug design. Specifically, we use PAC-Bayesian information-bottleneck theory to quantify the information density of each sample. This analysis reveals how different masking strategies affect generalization and indicates that, compared with conventional de novo generation, the constrained Scaffold Hopping task endows the model with greater effective capacity and improved transfer performance. IBEX retains the original TargetDiff architecture and hyperparameters for training to generate molecules compatible with the binding pocket; it then applies an L-BFGS optimization step to finely refine each conformation by optimizing five physics-based terms and adjusting six translational and rotational degrees of freedom in under one second. With only these modifications, IBEX raises the zero-shot docking success rate on CBGBench CrossDocked2020-based from 53% to 64%, improves the mean Vina score from 7.41kcalmol1-7.41 kcal mol^{-1} to 8.07kcalmol1-8.07 kcal mol^{-1}, and achieves the best median Vina energy in 57 of 100 pockets versus 3 for the original TargetDiff. IBEX also increases the QED by 25%, achieves state-of-the-art validity and diversity, and markedly reduces extrapolation error.
The fracturing-flooding technology is a new process for the development of low-permeability oil reservoirs, achieving a series of successful applications in oilfield production. However, existing numerical simulation methods for pressure drive struggle to efficiently and accurately simulate the dynamic changes in reservoir properties during the fracturing-flooding process, particularly the expansion and closure of fractures within the reservoir. This paper introduces a Darcy flow model with dual-porous and dual-permeable characteristics based on seepage mechanics theory, utilizing two sets of rock stress-sensitive parameter tables to describe the physical property changes of the matrix and fractures during the fracturing-flooding process. Different parameters are set for the X and Y directions to characterize the anisotropic features of the reservoir. A numerical simulation method aimed at dynamic analysis of fracturing-flooding is established, along with an automatic history fitting method based on the CMA-ES algorithm to derive rock mechanics parameters that align with actual block conditions.
We explore the possibility of phase transitions between different quark matter phases occurring within quark stars, giving rise to the hybrid quark stars (HybQSs). Utilizing a well-established general parameterization of interacting quark matter, we construct quark star models featuring sharp first-order quark-quark phase transitions of various types, in contrast to the hadron-quark transition in conventional hybrid stars. We systematically investigate how recent observations, such as the pulsar mass measurements MTOV2MM_{\rm TOV}\gtrsim2M_{\odot} and the GW170817's tidal deformability bound \Lambda_{1.4M_{\odot}}<800, constrain the viable parameter space. We also identified twin stars in some of the HybQS parameter space. This work unveils new possibilities of phase transitions and the resulting new types of compact stars in realistic astrophysical scenarios.
Using the parity doublet model (PDM) for hadronic matter and a modified Nambu-Jona-Lasinio (NJL) model for quark matter, we investigate the potential existence of two- and three-flavor quark matter in neutron star cores. Both models respect chiral symmetry, and a sharp first-order phase transition is implemented via Maxwell construction. We find stable neutron stars with quark cores within a specific parameter space that satisfies current astronomical observations. Typical neutron stars with masses around 1.4 M1.4 \ M_\odot may possess deconfined quark matter in their centers. The hybrid star scenario with a two-flavor quark core offers enough parameter space to allow the neutron stars with large quark cores exceeding 1 M\sim 1\ M_\odot, and allow the early deconfinement position before 2 ρ02\ \rho_0, where ρ0\rho_0 is the nuclear saturation density. The observations of gravitational wave event GW170817 suggest a relatively large chiral invariant mass m0=600 MeVm_0=600\ \rm MeV in the PDM for scenarios involving three-flavor quark matter cores. The maximum mass of the hybrid star with a quark core is found to be approximately 2.2 M2.2\ M_\odot for both two- or three-flavor quark matter in their centers.
The Sine-Cosine function, which is widely adopted in mathematics and physics, has attracted our attention due to its unique properties. By delving into the coupling effect of the Sine-Cosine function, we discover a previously unreported class of nonlinear systems, namely the Sine-Cosine Nonlinear System Family (SCNSF). This discovery is motivated by the need to expand the repertoire of nonlinear systems and understand the complex behaviors that can emerge from the combination of basic trigonometric functions. The SCNSF has both chaotic characteristics in the real number domain and fractal characteristics in the complex number domain. The classification and general mathematical description of SCNSF provide a solid theoretical foundation for further research. The proposal of three types of classic systems within SCNSF and the investigation of their chaotic properties and hardware implementation open up new avenues for practical applications. The large chaotic range exhibited by these systems implies their potential applications in various fields such as secure communication and chaotic circuit design. Moreover, the discovery of the chaos generation mechanism based on the coupling effect of the Sine-Cosine function deepens our understanding of the origin of chaos. In the complex number domain, the high parameter sensitivity and rich fractal patterns of SCNSF can be can be harnessed to develop more advanced encryption algorithms and more sensitive signal detection methods, thereby contributing to the advancement of information security and signal processing technologies. Overall, the chaotic and fractal properties of SCNSF make it a valuable asset in the pursuit of innovative solutions in multiple scientific and engineering disciplines.
Empirical A-site cation substitution has advanced the stability and efficiency of hybrid organic-inorganic lead halide perovskites solar cells and the functionality of X-ray detectors. Yet, the fundamental mechanisms underpinning their unique performance remain elusive. This multi-modal study unveils the link between nanoscale structural dynamics and macroscopic optoelectronic properties in these materials by utilising X-ray diffuse scattering, inelastic neutron spectroscopy and optical microscopy complemented by state-of-the-art machine learning-assisted molecular dynamics simulations. Our approach uncovers the presence of dynamic, lower-symmetry local nanodomains embedded within the higher-symmetry average phase in various perovskite compositions. The properties of these nanodomains are tunable via the A-site cation selection: methylammonium induces a high density of anisotropic, planar nanodomains of out-of-phase octahedral tilts, while formamidinium favours sparsely distributed isotropic, spherical nanodomains with in-phase tilting, even when crystallography reveals cubic symmetry on average. The observed variations in the properties of dynamic nanodomains are in agreement with our simulations and are directly linked to the differing macroscopic optoelectronic and ferroelastic behaviours of these compositions. By demonstrating the influence of A-site cation on local nanodomains and consequently, on macroscopic properties, we propose leveraging this relationship to engineer the optoelectronic response of these materials, propelling further advancements in perovskite-based photovoltaics, optoelectronics, and X-ray imaging.
In the paper [H. Kubo, Global existence for exterior problems of semilinear wave equations with the null condition in 2D, Evol. Equ. Control Theory 2 (2013), no. 2, 319-335], for the 2-D semilinear wave equation system (t2Δ)vI=QI(tv,xv)(\partial_t^2-\Delta)v^I=Q^I(\partial_tv, \nabla_xv) (1IM1\le I\le M) in the exterior domain with Dirichlet boundary condition, it is shown that the small data smooth solution v=(v1,,vM)v=(v^1, \cdot\cdot\cdot, v^M) exists globally when the cubic nonlinearities QI(tv,xv)=O(tv3+xv3)Q^I(\partial_tv, \nabla_xv)=O(|\partial_tv|^3+|\nabla_xv|^3) satisfy the null condition. We now focus on the global Dirichelt boundary value problem of 2-D wave maps equation with the form uI=J,K,L=1MCIJKLuJQ0(uK,uL)\Box u^I=\sum_{J,K,L=1}^MC_{IJKL}u^JQ_0(u^K,u^L) (1IM)(1\le I\le M) and Q0(f,g)=tftgj=12jfjgQ_0(f,g)=\partial_tf\partial_tg-\sum_{j=1}^2\partial_jf\partial_jg in exterior domain. By establishing some crucial classes of pointwise spacetime decay estimates for the small data solution u=(u1,,uM)u=(u^1, \cdot\cdot\cdot, u^M) and its derivatives, the global existence of uu is shown.
There are no more papers matching your filters at the moment.