{"doi":"10.1038/nature03025","title":"Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype","abstract":"Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests approximately 900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype.","journal":"Nature","year":2004,"id":2144,"datarank":12.69723058466512,"base_score":7.586803535162581,"endowment":7.586803535162581,"self_citation_contribution":1.1380205302743873,"citation_network_contribution":11.559210054390732,"self_endowment_contribution":1.1380205302743873,"citer_contribution":11.559210054390732,"corpus_percentile":84.13344182262001,"corpus_rank":196,"citation_count":2012,"citer_count":177,"citers_with_citation_signal":177,"citers_with_endowment":177,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.922,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2004-10-01","fair_score":31.25,"fair_percentile":12.708883025505717,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":25073,"name":"Jean-Marc Aury","orcid":null,"position":1,"is_corresponding":false},{"id":25074,"name":"Frédéric Brunet","orcid":"0000-0001-5145-8925","position":2,"is_corresponding":false},{"id":25075,"name":"Jean-Louis Petit","orcid":null,"position":3,"is_corresponding":false},{"id":25076,"name":"Nicole Stange-Thomann","orcid":null,"position":4,"is_corresponding":false},{"id":25077,"name":"Evan Mauceli","orcid":null,"position":5,"is_corresponding":false},{"id":25078,"name":"Laurence Bouneau","orcid":null,"position":6,"is_corresponding":false},{"id":25079,"name":"Cécile Fischer","orcid":null,"position":7,"is_corresponding":false},{"id":25080,"name":"Catherine Ozouf-Costaz","orcid":null,"position":8,"is_corresponding":false},{"id":25081,"name":"Alain Bernot","orcid":null,"position":9,"is_corresponding":false},{"id":25082,"name":"Sophie Nicaud","orcid":null,"position":10,"is_corresponding":false},{"id":25084,"name":"Sheila Fisher","orcid":"0000-0002-8681-2362","position":12,"is_corresponding":false},{"id":25085,"name":"Georges Lutfalla","orcid":"0000-0002-6591-7894","position":13,"is_corresponding":false},{"id":25086,"name":"Carole Dossat","orcid":null,"position":14,"is_corresponding":false},{"id":25087,"name":"Béatrice Segurens","orcid":null,"position":15,"is_corresponding":false},{"id":25088,"name":"Corinne Dasilva","orcid":null,"position":16,"is_corresponding":false},{"id":25089,"name":"Marcel Salanoubat","orcid":"0000-0003-1132-6455","position":17,"is_corresponding":false},{"id":25090,"name":"Michael Levy","orcid":null,"position":18,"is_corresponding":false},{"id":25091,"name":"Nathalie Boudet","orcid":null,"position":19,"is_corresponding":false},{"id":25092,"name":"Sergi Castellano","orcid":"0000-0002-5819-4210","position":20,"is_corresponding":false},{"id":25093,"name":"Véronique Anthouard","orcid":null,"position":21,"is_corresponding":false},{"id":25094,"name":"Claire Jubin","orcid":null,"position":22,"is_corresponding":false},{"id":25095,"name":"Vanina Castelli","orcid":null,"position":23,"is_corresponding":false},{"id":25096,"name":"Michael Katinka","orcid":null,"position":24,"is_corresponding":false},{"id":25097,"name":"Benoît Vacherie","orcid":"0000-0002-1564-0575","position":25,"is_corresponding":false},{"id":25098,"name":"Christian Biémont","orcid":null,"position":26,"is_corresponding":false},{"id":25099,"name":"Zineb Skalli","orcid":null,"position":27,"is_corresponding":false},{"id":25100,"name":"Laurence Cattolico","orcid":null,"position":28,"is_corresponding":false},{"id":25101,"name":"Julie Poulain","orcid":"0000-0002-8744-3116","position":29,"is_corresponding":false},{"id":25102,"name":"Véronique de Berardinis","orcid":"0000-0002-3273-4135","position":30,"is_corresponding":false},{"id":25103,"name":"Corinne Cruaud","orcid":"0000-0002-4752-7278","position":31,"is_corresponding":false},{"id":25104,"name":"Simone Duprat","orcid":"0000-0001-7902-6526","position":32,"is_corresponding":false},{"id":25105,"name":"Philippe Brottier","orcid":null,"position":33,"is_corresponding":false},{"id":25106,"name":"Jean-Pierre Coutanceau","orcid":null,"position":34,"is_corresponding":false},{"id":25107,"name":"Jérôme Gouzy","orcid":"0000-0001-5695-4557","position":35,"is_corresponding":false},{"id":25108,"name":"Genis Parra","orcid":null,"position":36,"is_corresponding":false},{"id":25109,"name":"Guillaume Lardier","orcid":null,"position":37,"is_corresponding":false},{"id":25110,"name":"Charles Chapple","orcid":null,"position":38,"is_corresponding":false},{"id":25111,"name":"Kevin J. McKernan","orcid":null,"position":39,"is_corresponding":false},{"id":25112,"name":"Paul McEwan","orcid":null,"position":40,"is_corresponding":false},{"id":25113,"name":"Stephanie Bosak","orcid":null,"position":41,"is_corresponding":false},{"id":14693,"name":"Sharon L. R. Kardia","orcid":"0000-0002-9853-3379","position":42,"is_corresponding":false},{"id":25114,"name":"Jean-Nicolas Volff","orcid":null,"position":43,"is_corresponding":false},{"id":59323,"name":"Simon G. Gregory","orcid":"0000-0002-7805-1743","position":44,"is_corresponding":false},{"id":6273,"name":"Michael C. Zody","orcid":"0000-0001-6594-7199","position":45,"is_corresponding":false},{"id":41916,"name":"Broad Institute Sequencing Platform and Whole Genome Assembly Team","orcid":null,"position":47,"is_corresponding":false},{"id":1745,"name":"Irene Newsham","orcid":"0000-0003-4913-298X","position":49,"is_corresponding":false},{"id":25116,"name":"Daniel Kahn","orcid":null,"position":50,"is_corresponding":false},{"id":25117,"name":"Marc Robinson-Rechavi","orcid":null,"position":51,"is_corresponding":false},{"id":25118,"name":"Vincent Laudet","orcid":"0000-0003-4022-4175","position":52,"is_corresponding":false},{"id":25119,"name":"Vincent Schachter","orcid":null,"position":53,"is_corresponding":false},{"id":25120,"name":"Francis Quétier","orcid":null,"position":54,"is_corresponding":false},{"id":25121,"name":"William Saurin","orcid":null,"position":55,"is_corresponding":false},{"id":25122,"name":"Claude Scarpelli","orcid":"0000-0002-2458-9775","position":56,"is_corresponding":false},{"id":18887,"name":"Matthew E. Hurles","orcid":"0000-0002-2333-7015","position":58,"is_corresponding":false},{"id":25124,"name":"Jean Weissenbach","orcid":"0000-0001-6564-0840","position":59,"is_corresponding":false},{"id":25125,"name":"Hugues Roest Crollius","orcid":"0000-0002-8209-173X","position":60,"is_corresponding":false},{"id":25126,"name":"Jean‐Marc Aury","orcid":"0000-0003-1718-3010","position":61,"is_corresponding":false},{"id":25127,"name":"Jean‐Louis Petit","orcid":"0000-0002-8566-0571","position":62,"is_corresponding":false},{"id":25128,"name":"Catherine Ozouf‐Costaz","orcid":null,"position":63,"is_corresponding":false},{"id":20075,"name":"David B. Jaffe","orcid":"0000-0001-8739-568X","position":64,"is_corresponding":false},{"id":25129,"name":"Corinne Da Silva","orcid":"0000-0002-7618-7831","position":65,"is_corresponding":false},{"id":25130,"name":"Michael A. Levy","orcid":"0000-0002-4188-2527","position":66,"is_corresponding":false},{"id":25131,"name":"Michaël Katinka","orcid":null,"position":67,"is_corresponding":false},{"id":25132,"name":"Genı́s Parra","orcid":"0000-0002-0575-2936","position":68,"is_corresponding":false},{"id":25133,"name":"Charles E. Chapple","orcid":null,"position":69,"is_corresponding":false},{"id":25134,"name":"Kevin McKernan","orcid":"0000-0002-3908-1122","position":70,"is_corresponding":false},{"id":25135,"name":"Jean‐Nicolas Volff","orcid":"0000-0003-3406-892X","position":71,"is_corresponding":false},{"id":25136,"name":"Jill P. Mesirov","orcid":"0000-0002-9755-2818","position":72,"is_corresponding":false},{"id":18085,"name":"Kerstin Lindblad‐Toh","orcid":"0000-0001-8338-0253","position":73,"is_corresponding":false},{"id":19776,"name":"Bruce W. Birren","orcid":"0000-0001-6971-945X","position":74,"is_corresponding":false},{"id":25137,"name":"Marc Robinson‐Rechavi","orcid":"0000-0002-3437-3329","position":75,"is_corresponding":false},{"id":25138,"name":"Vincent Schächter","orcid":null,"position":76,"is_corresponding":false},{"id":25139,"name":"Françis Quétier","orcid":"0000-0003-4388-9287","position":77,"is_corresponding":false},{"id":25072,"name":"Olivier Jaillon","orcid":"0000-0002-7237-9596","position":0,"is_corresponding":true}],"reference_count":57,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":32.5,"fair_a":55.0,"fair_i":12.5,"fair_r":25.0,"fair_zscore":-1.2629,"fair_rationale":{"fair_score":31.25,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":32.5,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"datacite=0, pmcid=False, pmid=False","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper provides accession numbers for the assembly and cDNAs (CAAE01000000, CR631133–CR735083) and a download URL, but no formal rich, machine-readable metadata (e.g., structured schema, controlled vocabularies) is described."}]},"A":{"name":"Accessible","score":55.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper states that the data is 'freely available' and provides a download site (http://www.genoscope.org/tetraodon) and accession numbers, but does not specify any authentication, licensing terms, or a persistent identifier like a DOI for the dataset itself."}]},"I":{"name":"Interoperable","score":12.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper uses standard file formats (FASTA, GenBank) and some standard IDs (InterPro, GO), but does not mention the use of standard machine-readable vocabularies or unique identifiers for datasets beyond accession numbers."}]},"R":{"name":"Reusable","score":25.0,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.333,"signal":null,"rationale":"The paper includes a data-availability statement and provides accession numbers and a URL, but lacks an explicit license for reuse and does not describe the computational environment or code needed to reproduce analyses."}]}},"suggestions":["Include a formal machine-readable metadata file (e.g., ISA-Tab or JSON-LD) describing the genome assembly and annotation.","Register the dataset in a public repository with a persistent identifier (e.g., DOI) and specify a clear license (e.g., CC0 or CC-BY).","Describe the bioinformatics software, parameters, and versioning applied in a 'Code availability' section to enhance reproducibility.","Use standard ontology terms (e.g., EDAM for bioinformatics operations, MIAME for sequence data) in the metadata.","Provide a detailed data dictionary and schema for the annotation files to improve interoperability."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"unpaywall_pdf"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"unpaywall_pdf","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:30:42.288500Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}