Toward a Species Search Engine: KISSE Offers a Rigorous Statistical Framework for Bone Collagen Tandem Mass Spectrometry Data.

Gharibi H, Saei AA, Chernobrovkin AL, Lundstrom SL, Lyu H, Meng Z, Vegvari A, Gaetani M, Zubarev RA

Adv Sci (Weinh) 12 (40) e03963 [2025-10-00; online 2025-08-11]

DNA and bone collagen are two key sources of resilient molecular markers used to identify species from their remains. Collagen is more stable than DNA, and thus it is preferred for ancient and degraded samples. Current mass spectrometry-based collagen sequencing approaches are empirical and lack a rigorous statistical framework. Based on the well-developed approaches to protein identification in shotgun proteomics, a first approximation of the species search engine (SSE) is introduced. SSE named KISSE is based on a species-specific library of collagenous peptides that uses both peptide sequences and their relative abundances. The developed statistical model can identify the species and the probability of correct identification, as well as determine the likelihood of the analyzed species not being in the library. The advantages and limitations of the proposed approach, and the possibility of extending it to other tissues is discussed.

Chemical Proteomics [Technology development]

PubMed 40787835

DOI 10.1002/advs.202503963

Crossref 10.1002/advs.202503963

pmc: PMC12561455


Publications 9.5.1