Titre
Comprehensive sampling of gene expression in human cell lines with massively parallel signature sequencing.
Type
article
Institution
UNIL/CHUV/Unisanté + institutions partenaires
Périodique
Proceedings of the National Academy of Sciences of the United States of America
Auteur(s)
Jongeneel, C.V.
Auteure/Auteur
Iseli, C.
Auteure/Auteur
Stevenson, B.J.
Auteure/Auteur
Riggins, G.J.
Auteure/Auteur
Lal, A.
Auteure/Auteur
Mackay, A.
Auteure/Auteur
Harris, R.A.
Auteure/Auteur
O'Hare, M.J.
Auteure/Auteur
Neville, A.M.
Auteure/Auteur
Simpson, A.J.
Auteure/Auteur
Strausberg, R.L.
Auteure/Auteur
Liens vers les personnes
Liens vers les unités
ISSN
0027-8424[print], 0027-8424[linking]
Statut éditorial
Publié
Date de publication
2003
Volume
100
Numéro
8
Première page
4702
Dernière page/numéro d’article
4705
Langue
anglais
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't ; Research Support, U.S. Gov't, P.H.S.
Publication Status: ppublish
Publication Status: ppublish
Résumé
Whereas information is rapidly accumulating about the structure and position of genes encoded in the human genome, less is known about the complexity and relative abundance of their expression in individual human cells and tissues. Here, we describe the characteristics of the transcriptomes of two cultured cell lines, HB4a (normal breast epithelium) and HCT-116 (colon adenocarcinoma), using massively parallel signature sequencing (MPSS). We generated in excess of 10(7) short signature sequences per cell line, thus providing a comprehensive snapshot of gene expression, within the technical limitations of the method. The number of genes expressed at one copy per cell or more in either of the lines was estimated to be between 10,000 and 15,000. The vast majority of the transcripts found in these cells can be mapped to known genes and their polyadenylation variants. Among the genes that could be identified from their signature sequences, approximately 8,500 were expressed by both cell lines, whereas 6,000 showed cellular specificity. Taking into account sequence tags that map uniquely to the genome but not to known transcripts, overall the data are consistent with an upper limit of 17,000 for the total number of genes expressed at more than one copy per cell in one or both of the two cell lines examined.
PID Serval
serval:BIB_73AE957EC3A6
PMID
Open Access
Oui
Date de création
2008-01-24T14:39:58.793Z
Date de création dans IRIS
2025-05-21T02:04:45Z