Optimization of Quantum-Si Platinum single-molecule protein sequencing platform towards improved complex matrix protein identification

Publication: Posted on 29 January 2026 — CC-BY 4.0 — This is a preprint and has not been peer reviewed. Data may be preliminary. — https://doi.org/10.26434/chemrxiv.10001736/v1

Authors: Tomasz A Leski1, Sean M Brown2, Zachary T Johnson1, Scott N Dean1, Ellen R Goldman1, David A Stenger1

1. Center for Biomolecular Science and Engineering US Naval Research Laboratory
2. National Research Council Research Associate at U.S. Naval Research Laboratory

Abstract:

Proteins are a class of macromolecules with essential roles in processes and structures associated with life. Protein sequencing technologies are therefore fundamental for understanding cell metabolic pathways, disease mechanisms, and how pathogenic agents and toxins function. Emerging next generation protein sequencing (NGPS) technologies promise a dramatic improvement of proteomics methods enabling identification of pathogens and toxins with unparalleled sensitivity and precision. The Quantum-Si (QSi) Platinum Sequencer is a novel single molecule protein sequencing technology capable of single amino acid resolution. In this work, we conducted significant optimization of the QSi protein library preparation protocol, reducing sample preparation time from 32 to 10 hours without sacrificing substantial sequencing quality, allowing for a sample-to-answer timeline in less than 24 hours. The modified protocol was applied for analyzing a set of proteins including sixteen single-domain antibodies with diverse sequences and a nontoxic derivative of staphylococcal enterotoxin B. We were further able to determine the library dilution threshold: losing the ability to sequence beyond 100x dilution. Finally, we were able to successfully obtain protein sequences within a crude lysate background, demonstrating the effectiveness of sequencing within complex protein mixtures. Improvements in sequencing chemistry and data processing may soon lessen or eliminate the dependence on reference sequences: a current obstacle for efficiently characterizing unknown proteins. By further condensing and optimizing library preparation, this technique presents a potential application for proteomics that require rapid characterization of highly complex biological systems, significantly improving protein-based diagnostic technologies.