Single Cell RNA Sequencing of Human Milk-Derived Cells Reveals Sub-Populations of Mammary Epithelial Cells with Molecular Signatures of Progenitor and Mature States: a Novel, Non-invasive Framework for Investigating Human Lactation Physiology.

Department of Pediatrics, Section of Nutrition, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. jayne.martincarli@cuanschutz.edu. Department of Pediatrics, Section of Hematology, Oncology, and Bone Marrow Transplant, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. Department of Cell Biology, Oklahoma University Health Sciences Center, Oklahoma City, OK, USA. Department of Medicine, Division of Endocrinology, Metabolism and Diabetes, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. Department of Pediatrics, Section of Neonatology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. Department of Physiology, Oklahoma University Health Sciences Center, Oklahoma City, OK, USA. Department of Obstetrics & Gynecology, Division of Maternal-Fetal Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. College of Nursing, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. Department of Obstetrics & Gynecology, Division of Reproductive Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Journal of mammary gland biology and neoplasia. 2020;(4):367-387

Abstract

Cells in human milk are an untapped source, as potential "liquid breast biopsies", of material for investigating lactation physiology in a non-invasive manner. We used single cell RNA sequencing (scRNA-seq) to identify milk-derived mammary epithelial cells (MECs) and their transcriptional signatures in women with diet-controlled gestational diabetes (GDM) with normal lactation. Methodology is described for coordinating milk collections with single cell capture and library preparation via cryopreservation, in addition to scRNA-seq data processing and analyses of MEC transcriptional signatures. We comprehensively characterized 3740 cells from milk samples from two mothers at two weeks postpartum. Most cells (>90%) were luminal MECs (luMECs) expressing lactalbumin alpha and casein beta and positive for keratin 8 and keratin 18. Few cells were keratin 14+ basal MECs and a small immune cell population was present (<10%). Analysis of differential gene expression among clusters identified six potentially distinct luMEC subpopulation signatures, suggesting the potential for subtle functional differences among luMECs, and included one cluster that was positive for both progenitor markers and mature milk transcripts. No expression of pluripotency markers POU class 5 homeobox 1 (POU5F1, encoding OCT4) SRY-box transcription factor 2 (SOX2) or nanog homeobox (NANOG), was observed. These observations were supported by flow cytometric analysis of MECs from mature milk samples from three women with diet-controlled GDM (2-8 mo postpartum), indicating a negligible basal/stem cell population (epithelial cell adhesion molecule (EPCAM)-/integrin subunit alpha 6 (CD49f)+, 0.07%) and a small progenitor population (EPCAM+/CD49f+, 1.1%). We provide a computational framework for others and future studies, as well as report the first milk-derived cells to be analyzed by scRNA-seq. We discuss the clinical potential and current limitations of using milk-derived cells as material for characterizing human mammary physiology.