Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire

J Glanville, W Zhai, J Berka, D Telman… - Proceedings of the …, 2009 - National Acad Sciences
J Glanville, W Zhai, J Berka, D Telman, G Huerta, GR Mehta, I Ni, L Mei, PD Sundar…
Proceedings of the National Academy of Sciences, 2009National Acad Sciences
Antibody repertoire diversity, potentially as high as 1011 unique molecules in a single
individual, confounds characterization by conventional sequence analyses. In this study, we
present a general method for assessing human antibody sequence diversity displayed on
phage using massively parallel pyrosequencing, a novel application of Kabat column-
labeled profile Hidden Markov Models, and translated complementarity determining region
(CDR) capture-recapture analysis. Pyrosequencing of domain amplicon and RCA PCR …
Antibody repertoire diversity, potentially as high as 1011 unique molecules in a single individual, confounds characterization by conventional sequence analyses. In this study, we present a general method for assessing human antibody sequence diversity displayed on phage using massively parallel pyrosequencing, a novel application of Kabat column-labeled profile Hidden Markov Models, and translated complementarity determining region (CDR) capture-recapture analysis. Pyrosequencing of domain amplicon and RCA PCR products generated 1.5 × 106 reads, including more than 1.9 × 105 high quality, full-length sequences of antibody variable fragment (Fv) variable domains. Novel methods for germline and CDR classification and fine characterization of sequence diversity in the 6 CDRs are presented. Diverse germline contributions to the repertoire with random heavy and light chain pairing are observed. All germline families were found to be represented in 1.7 × 104 sequences obtained from repeated panning of the library. While the most variable CDR (CDR-H3) presents significant length and sequence variability, we find a substantial contribution to total diversity from somatically mutated germline encoded CDRs 1 and 2. Using a capture-recapture method, the total diversity of the antibody library obtained from a human donor Immunoglobulin M (IgM) pool was determined to be at least 3.5 × 1010. The results provide insights into the role of IgM diversification, display library construction, and productive germline usages in antibody libraries and the humoral repertoire.
National Acad Sciences