My Research
Welcome to my research page. I am a Ph.D. student in Computer Science specializing in Computational Biology.
Research Interests
- Computational Biology and Bioinformatics
- Algorithm Design and Analysis
- Applied Deep Learning and NLP
- Large Language Models
- Data Science for Bioinformatics
Projects
Cross-Species TFBS Prediction in Plant Genomes using DNA Foundation Models (2025)
Haghani, M.; Dhulipalla, K. V.; Li, S. (Under review)
- Fine-tuning and benchmarking three large pretrained DNA foundation models (DNABERT-2, AgroNT, HyenaDNA) on DAP-seq data for ABF transcription factors in Arabidopsis thaliana and Sisymbrium irio.
- Evaluated performance across cross-chromosome, cross-dataset, and cross-species protocols.
- Demonstrated that HyenaDNA achieves nearโstate-of-the-art accuracy with over 10ร faster training time, enabling scalable, genome-wide TFBS prediction in plants.
Analysis of Arabidopsis Nuclear Envelope Proteins using scRNA-Seq Data (2025)
- Performed scRNA-seq analysis of Arabidopsis nuclear envelope proteins from root cells using R.
- Identified cell type-specific genes and analyzed co-expression patterns.
- Generated gene clusters and constructed a co-expression network with hdWGCNA.
- Visualized co-expression networks to highlight functional links between genes.
HostVirusPair โ Enhancing Host-Viral Protein Complex Prediction (2024)
Haghani, M.; Bhattacharya, D.; Murali, T. M. (Under submission)
- Developed a novel MSA pairing algorithm using sequence-based deep learning.
- Integrated interchain coevolutionary signals to improve AlphaFold-Multimer predictions.
- Achieved higher DockQ scores and enhanced accuracy in structure predictions.
NEFFy โ A Toolbox for NEFF Calculation and MSA Conversion (2025)
Haghani, M.; Bhattacharya, D.; Murali, T. M. Bioinformatics, 2025 โ Website
- Developed a tool for calculating Number of Effective Sequences (NEFF).
- Enabled MSA format conversion across protein, DNA, and RNA alphabets.
- Implemented column-wise NEFF calculation and multimeric MSA handling.
- Built in C++ with a Python library for integration into workflows.