{"doi":"10.1038/s41597-020-00744-3","title":"A primary human T-cell spectral library to facilitate large scale quantitative T-cell proteomics","abstract":"<jats:title>Abstract</jats:title><jats:p>Data independent analysis (DIA) exemplified by sequential window acquisition of all theoretical mass spectra (SWATH-MS) provides robust quantitative proteomics data, but the lack of a public primary human T-cell spectral library is a current resource gap. Here, we report the generation of a high-quality spectral library containing data for 4,833 distinct proteins from human T-cells across genetically unrelated donors, covering ~24% proteins of the UniProt/SwissProt reviewed human proteome. SWATH-MS analysis of 18 primary T-cell samples using the new human T-cell spectral library reliably identified and quantified 2,850 proteins at 1% false discovery rate (FDR). In comparison, the larger Pan-human spectral library identified and quantified 2,794 T-cell proteins in the same dataset. As the libraries identified an overlapping set of proteins, combining the two libraries resulted in quantification of 4,078 human T-cell proteins. Collectively, this large data archive will be a useful public resource for human T-cell proteomic studies. The human T-cell library is available at SWATHAtlas and the data are available via ProteomeXchange (PXD019446 and PXD019542) and PeptideAtlas (PASS01587).</jats:p>","journal":"Scientific Data","year":2020,"id":24194,"datarank":0.781547725215308,"base_score":2.8903717578961645,"endowment":2.8903717578961645,"self_citation_contribution":0.4335557636844247,"citation_network_contribution":0.34799196153088324,"self_endowment_contribution":0.4335557636844247,"citer_contribution":0.34799196153088324,"corpus_percentile":55.8,"corpus_rank":597,"citation_count":17,"citer_count":10,"citers_with_citation_signal":10,"citers_with_endowment":10,"datacite_reuse_total":2,"is_dataset":true,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":60.4167,"fair_percentile":92.63412489006157,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":146009,"name":"Jeremy Potriquet","orcid":null,"position":1,"is_corresponding":false},{"id":146010,"name":"Alok K. Shah","orcid":"0000-0002-2687-1168","position":2,"is_corresponding":false},{"id":146011,"name":"Sarah Reed","orcid":"0000-0001-5219-9590","position":3,"is_corresponding":false},{"id":146012,"name":"Buddhika Jayakody","orcid":null,"position":4,"is_corresponding":false},{"id":146013,"name":"Charu Kapil","orcid":"0000-0002-8750-5006","position":5,"is_corresponding":false},{"id":146014,"name":"Mukul K. Midha","orcid":"0000-0003-4053-0682","position":6,"is_corresponding":false},{"id":78845,"name":"Robert L. Moritz","orcid":"0000-0002-3216-9447","position":7,"is_corresponding":false},{"id":146015,"name":"Ailin Lepletier","orcid":"0000-0002-1371-7313","position":8,"is_corresponding":false},{"id":146016,"name":"Jason Mulvenna","orcid":null,"position":9,"is_corresponding":false},{"id":146017,"name":"John J. Miles","orcid":null,"position":10,"is_corresponding":false},{"id":146018,"name":"Michelle M. Hill","orcid":"0000-0003-1134-0951","position":11,"is_corresponding":false},{"id":146008,"name":"Harshi Weerakoon","orcid":"0000-0002-8699-133X","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":2.8903717578961645,"endowment":2.8903717578961645,"datacite_reuse_total":2,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"33230158","pmcid":"PMC7683684","openalex_id":"https://openalex.org/W3107622006","authors":[],"funders":[{"funder_name":"NIGMS NIH HHS","grant_id":"R01 GM087221","title":null},{"funder_name":"National Health and Medical Research Council (NHMRC)","grant_id":"1108064","title":"The bioactivity and binding partners of Irukandji and Box Jellyfish venom"},{"funder_name":"National Health and Medical Research Council (NHMRC)","grant_id":"1131732","title":"Understanding and modulating the human immune system"},{"funder_name":"National Institutes of Health","grant_id":"5R01GM087221-10","title":"Development of Trans Proteomic Pipeline, an Analysis Suite for Mass Spectrometry"}],"total_grants":4,"fwci":1.3541,"citation_percentile":0.79056984,"influential_citations":0,"citation_trend":[{"year":2020,"count":1},{"year":2021,"count":4},{"year":2022,"count":5},{"year":2023,"count":2},{"year":2024,"count":4},{"year":2025,"count":1}],"oa_status":"gold","license":"cc-by","oa_locations":[{"url":"https://www.nature.com/articles/s41597-020-00744-3.pdf","host_type":"journal"},{"url":"https://www.nature.com/articles/s41597-020-00744-3.pdf","host_type":"GOLD"},{"url":"https://www.nature.com/articles/s41597-020-00744-3.pdf","host_type":"publisher"},{"url":"https://www.nature.com/articles/s41597-020-00744-3","host_type":"publisher"},{"url":"https://doi.org/10.1038/s41597-020-00744-3","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/33230158","host_type":"repository"},{"url":"https://researchonline.jcu.edu.au/66036/1/John%20Miles%20%232.pdf","host_type":"repository"},{"url":"https://doaj.org/article/c39eb5f546ac4ddab0c080896f2affe2","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/7683684","host_type":"repository"},{"url":"http://hdl.handle.net/10072/399944","host_type":"repository"},{"url":"https://europepmc.org/articles/PMC7683684","host_type":"Europe_PMC"},{"url":"https://europepmc.org/articles/PMC7683684?pdf=render","host_type":"Europe_PMC"},{"url":"http://dx.doi.org/10.1038/s41597-020-00744-3","host_type":""},{"url":"https://api.library.uq.edu.au/view/UQ:7c727c5","host_type":""},{"url":"https://dx.doi.org/10.1038/s41597-020-00744-3","host_type":""},{"url":"https://doi.org/https://doi.org/10.1038/s41597-020-00744-3","host_type":""}],"fields_of_study":["Advanced Proteomics Techniques and Applications","Biosensors and Analytical Detection","Mass Spectrometry Techniques and Applications","Biology","Medicine","0301 basic medicine","0303 health sciences","03 medical and health sciences","Databases, Protein","Humans","Proteome","Proteomics","T-Lymphocytes"],"mesh_terms":["Humans","T-Lymphocytes","Proteome","Databases, Protein","Proteomics"],"keywords":["UniProt","Proteome","Human proteome project","Proteomics","Computational biology","Human Protein Atlas","Quantitative proteomics","Biology","Human cell","Human proteins","Bioinformatics","Cell culture","Genetics","Protein expression","Gene","Statistics and Probability","Data Descriptor","Science","T-Lymphocytes","Q","Clinical sciences","Library and Information Sciences","06 Biological Sciences","Mass","0601 Biochemistry and Cell Biology","Computer Science Applications","Education","Peptide Identification","306","Humans","Statistics, Probability and Uncertainty","Databases, Protein","Information Systems"],"sdg_mappings":[{"sdg_number":0,"sdg_label":"Partnerships for the goals"}],"linked_datasets":[{"doi":"10.6084/m9.figshare.12991619.v1","title":"Metadata record for: A primary human T-cell spectral library to facilitate large scale quantitative T-cell proteomics","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.12991619","title":"Metadata record for: A primary human T-cell spectral library to facilitate large scale quantitative T-cell proteomics","publisher":"figshare","resource_type":"Dataset"}],"clinical_trials":[],"software_tools":[],"database_accessions":[{"name":"pxd"}],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-07T21:49:42.243762Z","pmid":"33230158","pmcid":"PMC7683684","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":72.5,"fair_i":62.5,"fair_r":41.6667,"fair_zscore":1.3754,"fair_rationale":{"fair_score":60.42,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=2, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper provides a machine-accessible metadata file (Figshare DOI) and uses standard identifiers (UniProt, PRIDE, PeptideAtlas), but does not describe structured, machine-readable metadata beyond the abstract and subject terms."}]},"A":{"name":"Accessible","score":72.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":0.5,"signal":"files/OA location present but not flagged OA","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"16 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"Data access is clearly stated via ProteomeXchange (PXD019446, PXD019542) and PeptideAtlas (PASS01587), but no explicit code repository or step-by-step protocol for accessing the data is provided."}]},"I":{"name":"Interoperable","score":62.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"linked_datasets=0, datacite=2","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"Standard formats (mzML, pepXML) and vocabularies (UniProt, iRT) are used, but the paper does not specify use of community-standard ontologies or formal semantic annotations for the data."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"A data-availability statement and Creative Commons license (CC BY 4.0) are present, but the paper lacks a formal software license and detailed reproducibility instructions for the computational workflow."}]}},"suggestions":["Provide a machine-readable metadata file (e.g., JSON-LD or RDF) with structured descriptions of the data, methods, and variables.","Include a direct link to a code repository (e.g., GitHub) with the exact scripts and parameters used for library generation and analysis.","Add formal ontology terms (e.g., from OBI or EDAM) to describe the data types and experimental steps in the metadata.","Specify a software license (e.g., MIT or Apache 2.0) for any code or scripts associated with the study.","Provide a step-by-step reproducibility guide or containerized environment (e.g., Docker) for the computational pipeline."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:48:01.648523Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}