{"doi":"10.1101/653907","title":"Deep learning does not outperform classical machine learning for cell-type annotation","abstract":"Deep learning has revolutionized image analysis and natural language processing with remarkable accuracies in prediction tasks, such as image labeling and semantic segmentation or named-entity recognition and semantic role labeling. Specifically, the combination of algorithmic and hardware advances with the appearance of large and well-labeled datasets has led up to seminal contributions in these fields. The emergence of large amounts of data from single-cell RNA-seq and the recent global effort to chart all cell types in the Human Cell Atlas has attracted an interest in deep-learning applications. However, all current approaches are unsupervised, i.e. , learning of latent spaces without using any cell labels, even though supervised learning approaches are often more powerful in feature learning and the most popular approach in the current AI revolution by far. Here, we ask why this is the case. In particular we ask whether supervised deep learning can be used for cell annotation, i.e. to predict cell-type labels from single-cell gene expression profiles. After evaluating 10 classification methods across 14 datasets, we notably find that deep learning does not outperform classical machine-learning methods in the task. Thus, cell-type prediction based on gene-signature derived cell-type labels is potentially too simplistic a task for complex non-linear methods, which demands better labels of functional single-cell readouts.","journal":null,"year":2019,"id":1838,"datarank":0.48283137373023016,"base_score":3.2188758248682006,"endowment":3.2188758248682006,"self_citation_contribution":0.48283137373023016,"citation_network_contribution":0.0,"self_endowment_contribution":0.48283137373023016,"citer_contribution":0.0,"corpus_percentile":null,"corpus_rank":null,"citation_count":24,"citer_count":0,"citers_with_citation_signal":0,"citers_with_endowment":0,"datacite_reuse_total":0,"is_dataset":false,"is_dataset_confidence":0.0543,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2019-05-31","fair_score":null,"fair_percentile":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":3572,"name":"Maren Büttner","orcid":"0000-0002-6189-3792","position":1,"is_corresponding":false},{"id":20759,"name":"Niry Andriamanga","orcid":null,"position":2,"is_corresponding":false},{"id":42,"name":"Fabian Joachim Theis","orcid":"0000-0002-2419-1943","position":3,"is_corresponding":false},{"id":20758,"name":"Niklas D. Köhler","orcid":"0000-0003-2726-0518","position":0,"is_corresponding":true}],"reference_count":44,"raw_metadata":null,"created_at":"2026-03-01T18:20:47.508186Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":null,"fair_a":null,"fair_i":null,"fair_r":null,"fair_zscore":null,"fair_rationale":null,"fair_model":null,"fair_agent_version":null,"fair_fulltext_source":null,"fair_has_llm":null,"fair_computed_at":null,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}