The RNA folding process is often hierarchical, beginning with the formation of RNA secondary (2D) structures through base pairing. The 2D structure prediction models are hampered by the availability of thermodynamic parameters of RNA motifs. We have addressed this challenge and developed two models
• Vfold2D [1-4] a physics-based model that employs the statistics of known structures to compute energy parameters, enabling the prediction of 2D structures with pseudoknots and non-canonical base pairs
• VfoldMCPX [5-6] a physics-based model designed to predict the structure of multi-strand RNA assembly.
These models significantly enhance the predictive capability of RNA 2D folding predictions and exhibit improved prediction accuracy.
Modeling the three-dimensional (3D) structure of RNA is crucial for understanding RNA functions and for the design of RNA-based drugs and therapeutics. Our research group addresses this challenge by developing multiple RNA 3D structure prediction models, including
• Vfold3D [1] a motif template-based model• VfoldLA [2] a loop template-based model
• IsRNA [3-5], a model based on statistical potentials and coarse-grained molecular dynamics
Additionally, we developed Vfold-Pipeline [6], an end-to-end, fully automated solution for RNA 3D structure modeling from a given sequence.
These models adopt various methods and perspectives to tackle the complexities of RNA 3D folding, thereby enhancing the accuracy of RNA structure predictions.
Identifying small molecules that bind RNA targets can lead to new therapeutics for diseases involving dysregulated RNA functions. Effective computational screening in this field demands robust docking software and scoring functions to predict binding modes and enable virtual screening. Our group recently developed two new tools
• RLDOCK [1,2] offers an exhaustive sampling of binding sites and an accurate energy scoring
• SPRank [3] a hybrid machine-learning and knowledge-based scoring function, enhances docking tools by sampling flexible ligands against diverse RNA structures.
Both models have shown improved binding mode prediction and virtual screening across multiple RNA targets such as HIV-1 TAR, underscoring their potential in RNA-targeted drug discovery.
The CRISPR/Cas9 nuclease system is a transformative technology for genome engineering, and its success hinges on the strategic design of guide RNAs (gRNAs) to precisely control on- and off-target cleavage efficiencies. To address this challenge, we developed two computational models built on the underlying physical principles of CRISPR gene editing.
• uCRISPR [1] model leverages integrated energy parameters to evaluate R-loop folding stability.
• kCRISPR [2] model applies kinetic Monte Carlo simulations to capture R-loop folding/unfolding kinetics and employs a supervised ensemble learning algorithm to pinpoint key determinants of cleavage efficiency.
Together, these models provide promising tools to accurately predict and optimize gRNA performance at both on- and off-target sites.
Metal ions are critical for RNA structure and function, but accurately modeling metal ion (e.g., Mg²⁺) effects remains a major challenge. Our group previously developed statistical mechanics-based models to account for ion correlation and fluctuations in nucleic acids: TBI [1] and MCTBI [2,3].
Building on this, we recently developed a deep learning method, MgNet which treats RNA-ion complexes as images and predicts Mg²⁺ binding sites that are tightly coordinated with RNA atoms. MgNet identifies critical atoms for inner/outer-sphere coordination and uncovers new ion-binding motifs. This model can complement X-ray crystallography for precise metal ion site identification.