Publication

GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data

Downloadable Content

Persistent URL
Last modified
  • 05/15/2025
Type of Material
Authors
    Xingyu Yang, Georgia Institute of TechnologyChristopher Tipton, Emory UniversityMatthew Woodruff, Emory UniversityEnlu Zhou, Georgia Institute of TechnologyFrances Eun-Hyung Lee, Emory UniversityIgnacio Sanz, Emory UniversityPeng Qiu, Georgia Institute of Technology
Language
  • English
Date
  • 2020-09-09
Publisher
  • BMC
Publication Version
Copyright Statement
  • © The Author(s) 2020
License
Final Published Version (URL)
Title of Journal or Parent Work
Volume
  • 21
Issue
  • Suppl 9
Start Page
  • 583
End Page
  • 583
Grant/Funding Information
  • This work was partially supported by funding from the National Science Foundation (CCF1552784). PQ is an ISAC Marylou Ingram Scholar and a Carol Ann and David D. Flanagan Faculty Fellow. Publication costs are funded by PQ’s Faculty Fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Supplemental Material (URL)
Abstract
  • Background: B cell affinity maturation enables B cells to generate high-affinity antibodies. This process involves somatic hypermutation of B cell immunoglobulin receptor (BCR) genes and selection by their ability to bind antigens. Lineage trees are used to describe this microevolution of B cell immunoglobulin genes. In a lineage tree, each node is one BCR sequence that mutated from the germinal center and each directed edge represents a single base mutation, insertion or deletion. In BCR sequencing data, the observed data only contains a subset of BCR sequences in this microevolution process. Therefore, reconstructing the lineage tree from experimental data requires algorithms to build the tree based on partially observed tree nodes. Results: We developed a new algorithm named Grow Lineages along Minimum Spanning Tree (GLaMST), which efficiently reconstruct the lineage tree given observed BCR sequences that correspond to a subset of the tree nodes. Through comparison using simulated and real data, GLaMST outperforms existing algorithms in simulations with high rates of mutation, insertion and deletion, and generates lineage trees with smaller size and closer to ground truth according to tree features that highly correlated with selection pressure. Conclusions: GLaMST outperforms state-of-art in reconstruction of the BCR lineage tree in both efficiency and accuracy. Integrating it into existing BCR sequencing analysis frameworks can significant improve lineage tree reconstruction aspect of the analysis.
Author Notes
Keywords
Research Categories
  • Biology, Microbiology
  • Biology, Cell
  • Biology, Genetics

Tools

Relations

In Collection:

Items