中国神经再生研究(英文版) ›› 2025, Vol. 20 ›› Issue (9): 2697-2705.doi: 10.4103/NRR.NRR-D-24-00532

• 原著:神经损伤修复保护与再生 • 上一篇    下一篇

一种用于聚类细胞数据以改进分类的新方法

  

  • 出版日期:2025-09-15 发布日期:2024-12-30

A novel method for clustering cellular data to improve classification

Diek W. Wheeler, Giorgio A. Ascoli*   

  1. Center for Neural Informatics, Structures, & Plasticity, Krasnow Institute for Advanced Study; and Bioengineering Department, Volgenau School of Engineering; George Mason University, Fairfax, VA, USA
  • Online:2025-09-15 Published:2024-12-30
  • Contact: Giorgio A. Ascoli, PhD, ascoli@gmu.edu.
  • Supported by:
    This work was supported in part by NIH grants R01NS39600, U01MH114829, and RF1MH128693 (to GAA).

摘要: https://orcid.org/0000-0001-8635-0033 (Diek W. Wheeler); https://orcid.org/0000-0002-0964-676X (Giorgio A. Ascoli) 


Abstract: Many fields, such as neuroscience, are experiencing the vast proliferation of cellular data, underscoring the need for organizing and interpreting large datasets. A popular approach partitions data into manageable subsets via hierarchical clustering, but objective methods to determine the appropriate classification granularity are missing. We recently introduced a technique to systematically identify when to stop subdividing clusters based on the fundamental principle that cells must differ more between than within clusters. Here we present the corresponding protocol to classify cellular datasets by combining datadriven unsupervised hierarchical clustering with statistical testing. These general-purpose functions are applicable to any cellular dataset that can be organized as two-dimensional matrices of numerical values, including molecular, physiological, and anatomical datasets. We demonstrate the protocol using cellular data from the Janelia MouseLight project to characterize morphological aspects of neurons.

Key words: cellular data,  clustering dendrogram,  data classification,  Levene’s one-tailed statistical test,  unsupervised hierarchical clustering