ArticlePDF AvailableLiterature Review

The urologic epithelial stem cell database (UESC)—A web tool for cell type-specific gene expression and immunohistochemistry images of the prostate and bladder

Authors:

Abstract and Figures

Public databases are crucial for analysis of high-dimensional gene and protein expression data. The Urologic Epithelial Stem Cells (UESC) database http://scgap.systemsbiology.net/ is a public database that contains gene and protein information for the major cell types of the prostate, prostate cancer cell lines, and a cancer cell type isolated from a primary tumor. Similarly, such information is available for urinary bladder cell types. Two major data types were archived in the database, protein abundance localization data from immunohistochemistry images, and transcript abundance data principally from DNA microarray analysis. Data results were organized in modules that were made to operate independently but built upon a core functionality. Gene array data and immunostaining images for human and mouse prostate and bladder were made available for interrogation. Data analysis capabilities include: (1) CD (cluster designation) cell surface protein data. For each cluster designation molecule, a data summary allows easy retrieval of images (at multiple magnifications). (2) Microarray data. Single gene or batch search can be initiated with Affymetrix Probeset ID, Gene Name, or Accession Number together with options of coalescing probesets and/or replicates. Databases are invaluable for biomedical research, and their utility depends on data quality and user friendliness. UESC provides for database queries and tools to examine cell type-specific gene expression (normal vs. cancer), whereas most other databases contain only whole tissue expression datasets. The UESC database provides a valuable tool in the analysis of differential gene expression in prostate cancer genes in cancer progression.
Content may be subject to copyright.
BioMed Central
Page 1 of 7
(page number not for citation purposes)
BMC Urology
Open Access
Database
The urologic epithelial stem cell database (UESC) – a web tool for
cell type-specific gene expression and immunohistochemistry
images of the prostate and bladder
Laura E Pascal*
1,2
, Eric W Deutsch
2
, David S Campbell
2
, Martin Korb
2
,
Lawrence D True
3
and Alvin Y Liu
1,2
Address:
1
Department of Urology, and the Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle WA 98195, USA,
2
Institute for Systems Biology, Seattle WA 98103, USA and
3
Department of Pathology, University of Washington, Seattle WA 98195, USA
Email: Laura E Pascal* - lpascal@systemsbiology.org; Eric W Deutsch - edeutsch@systemsbiology.org;
David S Campbell - dcampbell@systemsbiology.org; Martin Korb - mkorb@systemsbiology.org; Lawrence D True - ltrue@u.washington.edu;
Alvin Y Liu - aliu@u.washington.edu
* Corresponding author
Abstract
Background: Public databases are crucial for analysis of high-dimensional gene and protein
expression data. The Urologic Epithelial Stem Cells (UESC) database http://
scgap.systemsbiology.net/ is a public database that contains gene and protein information for the
major cell types of the prostate, prostate cancer cell lines, and a cancer cell type isolated from a
primary tumor. Similarly, such information is available for urinary bladder cell types.
Description: Two major data types were archived in the database, protein abundance localization
data from immunohistochemistry images, and transcript abundance data principally from DNA
microarray analysis. Data results were organized in modules that were made to operate
independently but built upon a core functionality. Gene array data and immunostaining images for
human and mouse prostate and bladder were made available for interrogation. Data analysis
capabilities include: (1) CD (cluster designation) cell surface protein data. For each cluster
designation molecule, a data summary allows easy retrieval of images (at multiple magnifications).
(2) Microarray data. Single gene or batch search can be initiated with Affymetrix Probeset ID, Gene
Name, or Accession Number together with options of coalescing probesets and/or replicates.
Conclusion: Databases are invaluable for biomedical research, and their utility depends on data
quality and user friendliness. UESC provides for database queries and tools to examine cell type-
specific gene expression (normal vs. cancer), whereas most other databases contain only whole
tissue expression datasets. The UESC database provides a valuable tool in the analysis of differential
gene expression in prostate cancer genes in cancer progression.
Background
Public databases for the storage and retrieval of genomic
and proteomic data have become an integral component
of biomedical research. These databases can aid in the
identification of genes and proteins responsible for dis-
ease and health and defining their function by enabling
Published: 11 December 2007
BMC Urology 2007, 7:19 doi:10.1186/1471-2490-7-19
Received: 22 June 2007
Accepted: 11 December 2007
This article is available from: http://www.biomedcentral.com/1471-2490/7/19
© 2007 Pascal et al; licensee BioMed Central Ltd.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0
),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 2 of 7
(page number not for citation purposes)
investigators in diverse research areas and interests with a
range of computer expertise to have ready access to the
stored information through one user interface. Previously,
the Prostate Expression Database (PEDB) established a
centralized archive of gene expression information for
human prostate [1]. This database contains a large cDNA
library of gene sequences obtained for normal/benign,
benign prostatic hyperplasia (BPH), prostatic intraepithe-
lial neoplasia (PIN) and malignant prostate disease states.
The Prostate Gene Database (PGDB) is another prostate
database that stores factual data about genes related to the
human prostate and prostatic diseases supported by liter-
ature references [2]. These genes are grouped under
molecular events of amplification, mutation, gross dele-
tion, methylation, polymorphism, overexpression and
linkage. These two databases provide valuable informa-
tion obtained from whole prostate tissue. The characteri-
zation of tissues based on cell-surface protein expression
[3] allows the possibility of separating cells of interest
from that tissue for gene array analysis and determination
of cell-type specific transcriptomes [4]. Public availability
of cell-type specific data will be an important additional
tool in future studies.
The Stem Cell Genome Anatomy Project (SCGAP) initi-
ated by the National Institute of Diabetes and Digestive
and Kidney Diseases (NIDDK) included seven organ-spe-
cific groups that were funded to form a research consor-
tium. The aims of this consortium were to collectively
develop necessary biological procedures and reagents for
characterization of tissue specific progenitor cells and to
characterize gene expression patterns in these cells using
advanced technologies and bioinformatic techniques. The
official web portal for SCGAP [5] was designed to deliver
an overview of the progress of the consortium's research
efforts and to function as a gateway to the websites of the
consortium participants. As such, the detailed data, proto-
cols and descriptions are accessible from the respective
website of the participating SCGAP projects.
Our group, the urologic epithelial stem cells project,
investigated the molecular basis of the differentiation of
epithelial cells of the human prostate and bladder. We are
interested in studying development and the cancer proc-
ess in the context of interaction between individual cell
types. Expression levels of CD cell surface antigens was
first used to distinguish the constituent cell types of the
prostate, as well as cancer cells from their normal counter-
part [3,6]. The cell CD phenotyping data acquisition
involved immunohistochemistry with ~200 commer-
cially available CD monoclonal antibodies (BD-PharMin-
gen). Magnetic cell sorting (MACS) based on the cell type-
specific CD expression was then used to isolate the follow-
ing prostatic cell types: CD31
+
endothelial cells, CD26
+
luminal secretory and CD104
+
basal cells of the epithe-
lium, and CD49a
+
fibromuscular cells of the interglandu-
lar stroma for transcriptome profiling [4]. In addition, a
CDw338+ (ABCG2) stem cell population was profiled
[7]. These microarray datasets were also deposited in the
UESC database [8]. Here, we will illustrate the utility of
our UESC database, and a future consortium report will
describe in detail the central SCGAP site and its federated
search and data analysis tools.
Construction and Content
The UESC database was based on the Systems Biology
Experiment Analysis Management System (SBEAMS) [9],
a software and database framework for collecting, storing,
and accessing different types of experimental data.
SBEAMS combined a relational database management
system (RDBMS) back-end, a collection of tools to store,
manage, and query experimental information and results,
a web front-end for querying the database and providing
integrated access to remote data sources, and an interface
to other data processing and analysis programs. Since all
data from each part of any experiment were organized in
a modular schema using similar designs, quality control,
analysis, and data integration tasks were greatly simpli-
fied. In SBEAMS, each module was made to operate inde-
pendently but was built upon a core functionality, which
included user authentication and auditing, web interface
tools, result set management, Gene Ontology integration,
centralized BioSequenceSet linking, RDBMS-independ-
ence layer, and others. Support for microarrays, proteom-
ics, molecular interactions, macroarrays, gene expression
localization, protein functional predictions, and
expressed sequence tag (EST) clustering was provided in
the current major modules. The SBEAMS module queries
were automatically piped to Cytoscape [10] for network
visualization and further exploration. Data includes cell
type specific information for human and mouse prostate
and bladder from immunostaining and microarray. Feed-
back options and data availability questions are made
accessible through contact information listed on the web-
site or through a feedback form.
Populating the database
The methods of tissue collection, immunostaining and
expression data used in this database have been published
previously [3,4,11]. Briefly, tissue samples consisted of
both cancer-enriched and cancer-free samples obtained
from over 50 radical prostatectomies or cystectomies
under approval by the University of Washington Institu-
tional Review Board following a standard protocol.
Immunohistochemistry data
The immunohistochemistry data for human and mouse
prostate and bladder were annotated and uploaded in the
database following the data standard, Minimum Informa-
tion Specification For In Situ Hybridization and Immuno-
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 3 of 7
(page number not for citation purposes)
histochemistry Experiments (MISFISHIE) [11]. The files
are systematically named based on the antibody used in
staining, the organism, tissue type, tissue block, and mag-
nification. As an example, the file 'CD44 98-395F HP ba
100.jpg' was stained with anti-CD44, was derived from
human prostate tissue block 98-395F, is human prostate
tissue, and an image of microscopic field of view b within
field a was captured at 100× magnification. Annotation
describes the tissue, distribution of reaction product in the
tissue, distribution of reaction product in the tissue, local-
ization patterns within histologic cell types, and provides
an assessment of the level of expression of the protein for
the immunostaining data. Data is available for ~200 CD
antibodies for human prostate and bladder and ~20 CD
antibodies for mouse prostate and bladder.
Microarray data
Affymetrix array analysis of prostate cancer cell lines
The prostate cell transcriptomes of CD26
+
luminal epithe-
lial, CD104
+
basal epithelial, CD49a
+
stromal fibromus-
cular, plus CD31
+
endothelial, CDw338
+
stem, and side
population (SP) are all available in UESC [4,7]. To date,
prostate cancer transcriptomes include those of lineage-
related cancer cell lines LNCaP, C4-2, and CL1, those of
PC-3 and DU145, plus that of a CD26
+
cancer cell type
sorted from a primary tumor of Gleason 3+3. The bladder
transcriptome data includes 1 replicate each of CD13
+
stromal and CD13
-
stromal cells representing two sub-
domains of the bladder lamina propria [12]. In the data
module, the CEL files are the raw data from the array scan,
the RPT files hold the statistical analysis of chip signals,
the XML files are MAGE-ML descriptions of the experi-
ments, and the Image files are synthetic JPEG images of
the Affymetrix HG-U133_Plus_2 GeneChips.
MPSS analysis of prostate cancer cell lines
Results from MPSS (Massively Parallel Signature Sequenc-
ing) experiments on the LNCaP and C4-2 cell lines are
also available in the UESC database [13]. Data may be
downloaded as an Excel spreadsheet for each cell type,
containing the accession number and experimentally
detected TPM (transcripts per million) for each analyzed
sequence. This data was used to compare MPSS with
Affymetrix arrays in their coverage overlap [13].
Searching the database
CD immunohistochemistry images
The CD immunohistochemistry images may be viewed or
downloaded as ZIP archives. Additionally, a summary of
staining for various species and cell types can be down-
loaded directly as tab-delimited text files. Fig. 1 shows the
data summary for CD138 (syndecan-1). Cell type-specific
expression is scored by staining intensity, and the
uploaded images (at multiple magnifications) of different
tissue sections can be opened for examination. Fig. 2
shows CD138 staining at 200× magnification for human
bladder and prostate.
Transcriptome data
The available array datasets (usually after accepted for
publication) are listed and can be chosen for interroga-
tion.
(1) Single gene search – in which one can enter Affymetrix
Probe Set ID, Gene Name, or Accession Number together
with the options of coalescing probesets and replicates. In
the Affymetrix HG-U133 arrays often times genes would
be represented by multiple probesets, of which not all
would give meaningful results. The hybridization signals
for all probesets of one gene can be combined if the COA-
LESCE PROBESETS tick box is clicked. The tick box COA-
LESCE REPLICATES averages the signal for each of the
biological replicates that make up a sort. The greyscale gra-
dient indicates RMA normalized Affymetrix signal inten-
sity. Signals of 10 or less are represented as white and
signals greater than or equal to 10,000 are represented as
black. Higher Affymetrix signal (more black) indicates
higher levels of gene expression. Fig. 3 shows the analysis
output for CD138 (SDC1). Fig. 3A shows the signal inten-
sities scored by the three probesets for SDC1 of all repli-
cates (n = 5) of four prostate cell types (CD104
+
basal,
CD26
+
luminal, CD31
+
endothelial, CD49a
+
stromal) and
one replicate each of two bladder cell types (CD13
-
stro-
mal and CD13
+
stromal) queried. CD138 expression is
detectable in prostate basal cells, and lowered or undetec-
table in luminal, endothelial and stromal cells. As an illus-
tration of probe variability, the 239256 probeset scored
no expression or Absent Call in basal cells (5/5 replicates),
and other cell types (5/5 replicates). Fig. 3B shows the
analysis summary of CD138 cell-type expression coa-
lesced by replicate, 3C coalesced by probeset, and 3D coa-
lesced by replicate and probeset. This expression data is in
accordance with the pattern of prostatic CD138 expres-
sion scored by immunohistochemistry (Fig. 1). Included
in this summary is the level of CD138 expression in basal
urothelial cells.
(2) Multiple gene (batch) search – in which searches can
be initiated by using "%" as a wildcard character (e.g.,
CD% to list all official gene names with the CD designa-
tion) ["_" is a single character wildcard such that, e.g., A_
brings up AR; A__ brings up A2M, ABO to AXL, A___
brings up A1BG to ASB8, etc.]. Fig. 4 shows the query out-
put for gene names with SOX% (sex determining region Y
box). Other batch search examples would be IL (inter-
leukin), ITG (integrin), TNF (tumor necrosis factor),
ADAM (a disintegrin and metalloprotease domain) genes.
This query feature is not widely available in many other
databases.
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 4 of 7
(page number not for citation purposes)
CD immunohistochemistryFigure 1
CD immunohistochemistry. Shown is the image data summary for CD138 (SDC1). The top table provides annotation data
including the tissue type, distribution of reaction product in the tissue, localization pattern within histologic cell types and an
assessment of the level of protein expression for the immunostaining data. The bottom table provides links to the available
images for each annotated sample. Available immunostaining images and additional data can be retrieved by clicking on the
links.
Antibody Summary
CD138
Alternate Names: heparan sulfate proteoglycan; syndecan-1
Locus Link:
6382
Genome Coordinates:
chr2:20385070-20409385-
Total Assays: 15
Tissue type Cell type
%
Intense
% Equivocal % None # Assays
Urinary Bladder
Cap Cells
40 58 3 4
Intermediate Cells
40 54 6 5
Basal Epithelial Cells
40 54 6 5
Lamina propria - superficial
0 0 100 6
Lamina propria - deep
0 0 100 6
Submucosa
1 1 97 7
Muscularis propria
0 8 92 6
Transitional Cell Carcinoma
010901
Prostate
Atrophic glands
80 20 0 1
Hyperplastic glands
010901
Normal glands
010901
Basal Epithelial Cells
60 22 18 3
Stromal Endothelial Cells
0 0 100 1
Stromal Fibromuscular Cells
5 0 95 1
Stromal Nerve Sheath Cells
90 5 5 2
Stromal Perineural Cells
0 0 100 1
Gleason Pattern 3
010901
Assay Name
Channel Name
Characterizations
Available Images
CD138 02-034A 5 CD138 02-034A 5 - chan 1
8
40x 100x 200x 200x 200x 400x
CD138 02 -047A
1
CD138 02 -047A 1 - chan 1
CD138 99-010E 2 CD138 99-010E 2 - chan 1
10
CD138 03-035A1 CD138 03-035A1 - chan 1
7
40x 100x 200x 200x
CD138 02-054A CD138 02-054A - chan 1 40x 100x 200x 200x 200x 200x
CD138 03-041B2 CD138 03-041B2 - chan 1
2
40x 100x 200x
CD138 03-035A2 CD138 03-035A2 - chan 1
7
100x 200x 40x 100x 200x 200x 200x
CD138 03-043B1 CD138 03-043B1 - chan 1
7
100x 10x 40x 100x 200x 200x
CD138 02-047A 1 CD138 02-047A 1 - chan 1 40x 100x 200x 200x 200x 200x 200x
CD138 03-024A1 CD138 03-024A1 - chan 1
6
40x 100x 200x 200x 400x 200x
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 5 of 7
(page number not for citation purposes)
Utility and Discussion
UESC data types were organized in separate modules to
afford a good balance between flexibility and consistency.
The management system was designed to allow efficient
data access to all levels of users, with both easy web and
scriptable, sophisticated interfaces, and to be reusable so
that a new project may be built on a previous one (e.g.,
kidney and bladder cancer data to prostate cancer data).
CD138 immunostaining imagesFigure 2
CD138 immunostaining images. CD138 (SDC1) immunoreactivity of normal human prostate and bladder (immunoreac-
tion product red-brown; pale blue hematoxylin nuclear counterstain). (A) CD138 staining of human bladder urothelium, assay
name CD138 03-035A1. (B) CD138 staining of human prostate atrophic glands, basal epithelial cells and nerve sheath cells,
assay name CD138 02-007C 5. Original magnification is 200×.
A. B.
Single gene searchFigure 3
Single gene search. The expression of CD138 (SDC1) among the four sorted prostate cell populations (CD104
+
basal epi-
thelial, CD26
+
luminal epithelial, CD31
+
endothelial and CD31
+
stromal fibromuscular) in addition to sorted bladder cell popu-
lations (CD13
-
and CD13
+
bladder lamina propria) is illustrated. The Affymetrix signal intensity levels are represented by the
grey scale. The data can be displayed in full (A), coalesced with respect to biological replicates (B), probesets (C), or both rep-
licate and probeset (D). Present call boxes have no border, Absent calls have a red border, Marginal call is blue (not shown in
this example).
A.
C.
B.
D.
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 6 of 7
(page number not for citation purposes)
The database will therefore continuously expand as more
cell type-specific information becomes available. The
UESC database will be a valuable tool in the analysis of
differential gene expression in prostate cancer genes in
cancer progression.
Conclusion
Strategies for the analysis of the interface between gene
expression and protein information involve a variety of
computational methods that require the storage and
retrieval of large datasets. These databases become per-
force an integral component of biomedical research. The
UESC database is a unique, web-accessible, searchable
compilation of published data concerning the identifica-
Multiple gene searchFigure 4
Multiple gene search. Shown is the query output for gene names containing SOX (sex determining region Y box) among the
four sorted prostate cell populations (CD104
+
basal epithelial, CD26
+
luminal epithelial, CD31
+
endothelial and CD31
+
stromal
fibromuscular) in addition to sorted bladder cell populations (CD13
-
and CD13
+
bladder lamina propria).
BMC Urology 2007, 7:19 http://www.biomedcentral.com/1471-2490/7/19
Page 7 of 7
(page number not for citation purposes)
tion and characterization of genes and proteins in specific
cell types of the urologic organs where cancer is a major
disease. These cell populations retain to a high degree
their CD phenotype as determined by immunostaining in
intact tissue; concordance between gene expression meas-
ured by DNA array and immunohistochemistry was good
and will be published separately. These cell type transcrip-
tomes allow us to pursue many studies which are not pos-
sible with whole tissue transcriptomes.
Availability and requirements
The UESC database is freely accessible at http://scgap.sys
temsbiology.net/. It has been tested to work with Mozilla
Firefox and Internet Explorer.
List of abbreviations used
BPH: Benign prostatic hyperplasia
CD: Cluster designation
EST: Expressed sequence tag
MACS: Magnetic cell sorting
MISFISHIE: Minimum Information Specification For In
Situ Hybridization and Immunohistochemistry Experi-
ments
MPSS: Massively Parallel Signature Sequencing
NIDDK: National Institute of Diabetes and Digestive and
Kidney Diseases
PEDB: Prostate Expression Database
PGDB: Prostate Gene Database
PIN: Prostatic intraepithelial neoplasia
RDBMS: Relational database management system
RMA: Robust multi-array average
SBEAMS: Systems Biology Experiment Analysis Manage-
ment System
SCGAP: Stem Cell Genome Anatomy Project
TPM: Transcripts per million
UESC: Urologic Epithelial Stem Cell
Competing interests
The author(s) declare that they have no competing inter-
ests.
Authors' contributions
LEP drafted the manuscript with input from DSC, EWD
and AYL. Database design and programming was per-
formed by EWD, DSC and MK with input from AYL, LEP
and LDT. LDT provided immunohistochemistry data
annotation and staining summaries. All authors have read
and approved the final manuscript.
Acknowledgements
This work was supported by grant 1U01 DK63630 from NIDDK. Addi-
tional funding came from grants CA85859, CA98699 and CA111244 from
NCI. We thank Tracy Sherertz for her contribution to the database con-
struction.
References
1. Hawkins V, Doll D, Bumgarner R, Smith T, Abajian C, Hood L, Nelson
PS: PEDB: the Prostate Expression Database. Nucleic Acids Res
1999, 27(1):204-208.
2. Li LC, Zhao H, Shiina H, Kane CJ, Dahiya R: PGDB: a curated and
integrated database of genes related to the prostate. Nucleic
Acids Res 2003, 31(1):291-293.
3. Liu AY, True LD: Characterization of prostate cell types by CD
cell surface molecules. Am J Pathol 2002, 160(1):37-43.
4. Oudes AJ, Campbell DS, Sorensen CM, Walashek LS, True LD, Liu
AY: Transcriptomes of human prostate cells. BMC Genomics
2006, 7:92.
5. The Stem Cell Genome Anatomy Project [http://
www.scgap.org/]
6. Liu AY, Roudier MP, True LD: Heterogeneity in primary and
metastatic prostate cancer as defined by cell surface CD pro-
file. Am J Pathol 2004, 165(5):1543-1556.
7. Pascal LE, Oudes AJ, Petersen TW, Goo YA, Walashek LS, True LD,
Liu AY: Molecular and cellular characterization of ABCG2 in
the prostate. BMC Urol 2007, 7(1):6.
8. SCGAP Urologic Epithelial Stem Cells Project [http://
scgap.systemsbiology.net/]
9. Marzolf B, Deutsch EW, Moss P, Campbell D, Johnson MH, Galitski
T: SBEAMS-Microarray: database software supporting
genomic expression analyses for systems biology. BMC Bioin-
formatics 2006, 7:286.
10. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin
N, Schwikowski B, Ideker T: Cytoscape: a software environment
for integrated models of biomolecular interaction networks.
Genome Res 2003, 13(11):2498-2504.
11. Deutsch EW, Ball CA, Berman JJ, Bova GS, Brazma A, Bumgarner RE,
Campbell D, Causton HC, Christiansen JH, Daian F, Dauga D, David-
son D, Gimenez G, Goo YA, Grimmond S, Henrich T, Gerrmann BG,
Johnson MH, Korb M, Mills JC, Oudes AJ, Parkinson HE, Pascal LE,
Pollet N, Quackenbush J, Ramialison M, Ringwald M, Salgado D, San-
sone SA, Sherlock G, Christian J Stoeckert J, Swedlow J, Taylor RC,
Walashek L, Warford A, Wilkinson DG, Zhou Y, Zon LI, Liu AY, True
LD: Minimum Information Specification For In Situ Hybridi-
zation and Immunohistochemistry Experiments (MIS-
FISHIE). Nature Biotechnology in press.
12. Goo YA, Goodlett DR, Pascal LE, Worthington KD, Vessella RL, True
LD, Liu AY: Stromal mesenchyme cell genes of the human
prostate and bladder. BMC Urol 2005, 5:17.
13. Oudes AJ, Roach JC, Walashek LS, Eichner LJ, True LD, Vessella RL,
Liu AY: Application of Affymetrix array and Massively Parallel
Signature Sequencing for identification of genes involved in
prostate cancer progression. BMC Cancer 2005, 5:86.
Pre-publication history
The pre-publication history for this paper can be accessed
here:
http://www.biomedcentral.com/1471-2490/7/19/prepub
... The level of B2M was diagnostic because stem cells express 10-fold lower than differentiated cells. This fold difference was calculated from microarray probeset intensity signals for B2M in stem cell types (B2M lo ) vs. differentiated cell types (B2M hi ) in transcriptome datasets [21]. Stem cell types included ES cell line H1 (WA01) [22], embryonal carcinoma cell line NCCIT, reprogrammed prostate cancer-associated stromal cells [10]. ...
... The values are the average after clicking coalesce replicates and probesets. Dataset query using this public database is described in ref.21. The red line shows the low expression of STC1 in cancer cell lines and xenografts. ...
Article
Full-text available
Background Loss of prostate cancer differentiation or de-differentiation leads to an untreatable disease. Patient survival would benefit if this can be prevented or reversed. Cancer de-differentiation transforms luminal-like (differentiated) adenocarcinoma into less luminal-like and more stem-like (undifferentiated) small cell carcinoma through a sequential activation of stem cell transcription factors (scTF) POU5F1, LIN28A, SOX2 and NANOG. Like stem cells, prostate small cell carcinoma express this quartet of scTF as well as a 10-fold lower level of β2-microglobulin (B2M) than that of differentiated cell types. In organ development, prostate stromal mesenchyme cells mediate epithelial differentiation in part by secreted factors. Methods The identified prostate stromal-specific factor proenkephalin (PENK) was cloned, and transfected into scTF⁺B2Mlo stem-like small cell carcinoma LuCaP 145.1, reprogrammed luminal-like scTF⁻B2Mhi LNCaP, and luminal-like scTF⁻B2Mhi adenocarcinoma LuCaP 70CR. The expression of scTF, B2M and anterior gradient 2 (AGR2) was analyzed in the transfected cells. Results PENK caused down-regulation of scTF and up-regulation of B2M to indicate differentiation. When transfected into reprogrammed LNCaP, PENK reversed the reprogramming by down-regulation of scTF with attendant changes in cell appearance and colony morphology. When transfected into LuCaP 70CR, PENK up-regulated the expression of adenocarcinoma antigen AGR2, a marker associated with cancer cell differentiation. Conclusions Prostate cancer cells appear to retain their responsiveness to stromal PENK signaling. PENK can induce differentiation to counter de-differentiation caused by scTF activation. The many mutations and aneuploidy characteristic of cancer cells appear not to hinder these two processes. Loss of prostate cancer differentiation is like reprogramming from luminal-like to stem-like.
... Another approach is to develop new tests based on other tumor markers. We used comparative analysis between the transcriptomes of isolated CD26 + cancer cells and CD26 + normal luminal cells to identify genes up-regulated in cancer (3,4). Genes that were found to be overexpressed by ≥8-fold, and to encode secreted or extracellular proteins were selected as biomarker candidates for assay development. ...
... While no single cell line produces all the proteins found in CP (3), a panel of available cell lines and xenografts could produce most of these proteins. For example, AGR2 is made by CL1, CRISP3 by LuCaP 35, CEACAM5, BCMP11 by LuCaP 49 [results from dataset queries (4)]. In this way, we are not limited by the availability of ample material for immunization. ...
Article
Full-text available
Background: Prostate cancer (CP) cells differ from their normal counterpart in gene expression. Genes encoding secreted or extracellular proteins with increased expression in CP may serve as potential biomarkers. For their detection and quantification, assays based on monoclonal antibodies are best suited for development in the clinical setting. One approach to obtain antibodies is to use recombinant proteins as immunogen. However, the synthesis of recombinant protein for each identified candidate is time-consuming and expensive. It is also not practical to generate high quality antibodies to all identified candidates individually. Furthermore, non-native forms (e.g., recombinant) of proteins may not always lead to useful antibodies. Our approach was to purify a subset of proteins from CP tissue specimens for use as immunogen. Methods: In the present investigation, ten cancer specimens obtained from cases scored Gleason 3+3, 3+4 and 4+3 were digested by collagenase to single cells in serum-free tissue culture media. Cells were pelleted after collagenase digestion, and the cell-free supernatant from each specimen were pooled and used for isolation of proteins in the 10-30 kDa molecular weight range using a combination of sonication, dialysis and Amicon ultrafiltration. Western blotting and mass spectrometry (MS) proteomics were performed to identify the proteins in the selected size fraction. Results: The presence of cancer-specific anterior gradient 2 (AGR2) and absence of prostate-specific antigen (PSA)/KLK3 were confirmed by Western blotting. Proteomics also detected AGR2 among many other proteins, some outside the selected molecular weight range, as well. Conclusions: Using this approach, the potentially harmful (to the mouse host) exogenously added collagenase was removed as well as other abundant prostatic proteins like ACPP/PAP and AZGP1 to preclude the generation of antibodies against these species. The paper presents an optimized scheme for convenient and rapid isolation of native proteins in any desired size range with minor modifications.
... Bottom: the histogram displays DNA microarray signal intensity values of AGR2 in luminal cells, G3, and G4 cancer cells. The values (y-axis) were retrieved from the Affymetrix microarray datasets archived in our SCGAP Urologic Epithelial Stem Cells Project (UESC) (28). They represent the average after clicking coalesce replicates and probe sets. ...
Article
Full-text available
Multiple cancer cell types are found in prostate tumors. They are either luminal-like adenocarcinoma or less luminal-like and more stem-like non-adenocarcinoma and small cell carcinoma. These types are lineage related through differentiation. Loss of cancer differentiation from luminal-like to stem-like is mediated by the activation of stem cell transcription factors (scTF) such as LIN28A, NANOG, POU5F1 and SOX2. scTF expression leads to down-regulation of β2-microglobulin (B2M). Thus, cancer cells can change from the scT F ˜ B 2 M hi phenotype of differentiated to that of scT F ˙ B 2 M lo of dedifferentiated in the disease course. In development, epithelial cell differentiation is induced by stromal signaling and cell contact. One of the stromal factors specific to prostate encodes proenkephalin (PENK). PENK can down-regulate scTF and up-regulate B2M in stem-like small cell carcinoma LuCaP 145.1 cells indicative of exit from the stem state and differentiation. In fact, prostate cancer cells can be made to undergo dedifferentiation or reprogramming by scTF transfection and then to differentiate by PENK transfection. Therapies need to be designed for treating the different cancer cell types. Extracellular anterior gradient 2 (eAGR2) is an adenocarcinoma antigen associated with cancer differentiation that can be targeted by antibodies to lyse tumor cells with immune system components. eAGR2 is specific to cancer as normal cells express only the intracellular form (iAGR2). For AGR2-negative stem-like cancer cells, factors like PENK that can target scTF could be effective in differentiation therapy.
... The histogram display was generated from a query of the transcriptome datasets archived in our public database (http://scgap.systemsbiology. net/) (Pascal et al., 2007a). Reprogramming of B2M hi cancer-associated stromal cells by scTF lentiviral vectors produced B2M lo iPS cells, whose gene expression was virtually identical to that of ES cells . ...
Chapter
Pluripotent stem cells offer an array of possibilities for organ-specific regenerative therapy. Germline cell-derived pluripotent stem cells (GPSCs) have been successfully generated from mouse spermatogonial stem cells in long-term culture. Apart from phenotypic similarities, GPSCs share molecular features as well as pluripotency characteristics, such as capacity to form teratoma and generate chimera upon injection into mouse blastocysts, with mouse embryonic stem cells. GPSCs are responsive to differentiative cues and can give rise to cells of the three germ layers. Various functional cell types have been generated from GPSCs in vitro, including hepatocytes. GPSC-derived cells also show efficient homing and engraftment in vivo. Interestingly, given the appropriate stimuli, spermatogonial stem cell (SSCs) can also undergo transdifferentiation when transplanted in a nontesticular niche. Thus, SSCs show great potentiality in the field of regenerative medicine, not only for restoring fertility in males but also because of their high plasticity, hence offering a valid stem cell for cell therapy of a multitude of diseases. This chapter provides an update on this issue, with particular focus on hepatic cells generated from pluripotent stem cells of spermatogonial origin.
... Cell-type transcriptome datasets archived in our public UESC database (http://scgap.systemsbiology.net/) were queried as described in ref. [21]. Probeset signal intensity values were retrieved and displayed on a gray scale. ...
Article
Full-text available
Background: Prostate cancer displays different morphologies which, in turn, affect patient outcome. This fact prompted questions about the lineage relationship between differentiated, more treatable prostate adenocarcinoma and poorly differentiated, less treatable non-adenocarcinoma including small cell carcinoma, and the molecular mechanism underlying prostate cancer differentiation. Methods: Newly available non-adenocarcinoma/small cell carcinoma PDX LuCaP lines were analyzed for expression of stem cell transcription factors (scTF) LIN28A, NANOG, POU5F1, SOX2, which are responsible for reprogramming or de-differentiation. cDNA of these genes were cloned from small cell carcinoma LuCaP 145.1 into expression vectors to determine if they could function in reprogramming. Results: Expression of scTF was detected in small cell carcinoma LuCaP 93, 145.1, 145.2, and non-adenocarcinoma LuCaP 173.1, 173.2A. Transfection of scTF from LuCaP 145.1 altered the gene expression of prostate non-small cell carcinoma cells, as well as fibroblasts. The resultant cells grew in stem-like colonies. Of note was a 10-fold lower expression of B2M in the transfected cells. Low B2M was also characteristic of LuCaP 145.1. Conversely, B2M was increased when stem cells were induced to differentiate. Conclusions: This work suggested a pathway in the emergence of non-adenocarcinoma/small cell carcinoma from adenocarcinoma through activation of scTF genes that produced cancer de-differentiation.
... Another marker, CD44 is one of the most prominent stem cell markers. CD44+ cells are located in the basal layer of the normal urothelium as well as in the UCC [7]. CD44 is a cell surface molecule that has been related to multiple functions including cell differentiation, cell proliferation, cell migration and angiogenesis. ...
Article
Full-text available
The rapid development of the cancer stem cells (CSC) field, together with powerful genome-wide screening techniques, have provided the basis for the development of future alternative and reliable therapies aimed at targeting tumor-initiating cell populations. Urothelial bladder cancer stem cells (BCSCs) that were identified for the first time in 2009 are heterogenous and originate from multiple cell types; including urothelial stem cells and differentiated cell types—basal, intermediate stratum and umbrella cells Some studies hypothesize that BCSCs do not necessarily arise from normal stem cells but might derive from differentiated progenies following mutational insults and acquisition of tumorigenic properties. Conversely, there is data that normal bladder tissues can generate CSCs through mutations. Prognostic risk stratification by identification of predictive markers is of major importance in the management of urothelial cell carcinoma (UCC) patients. Several stem cell markers have been linked to recurrence or progression. The CD44v8-10 to standard CD44-ratio (total ratio of all CD44 alternative splicing isoforms) in urothelial cancer has been shown to be closely associated with tumor progression and aggressiveness. ALDH1, has also been reported to be associated with BCSCs and a worse prognosis in a large number of studies. UCC include low-grade and high-grade non-muscle invasive bladder cancer (NMIBC) and high-grade muscle invasive bladder cancer (MIBC). Important genetic defects characterize the distinct pathways in each one of the stages and probably grades. As an example, amplification of chromosome 6p22 is one of the most frequent changes seen in MIBC and might act as an early event in tumor progression. Interestingly, among NMIBC there is a much higher rate of amplification in high-grade NMIBC compared to low grade NMIBC. CDKAL1, E2F3 and SOX4 are highly expressed in patients with the chromosomal 6p22 amplification aside from other six well known genes (ID4, MBOAT1, LINC00340, PRL, and HDGFL1). Based on that, SOX4, E2F3 or 6q22.3 amplifications might represent potential targets in this tumor type. Focusing more in SOX4, it seems to exert its critical regulatory functions upstream of the Snail, Zeb, and Twist family of transcriptional inducers of EMT (epithelial–mesenchymal transition), but without directly affecting their expression as seen in several cell lines of the Cancer Cell Line Encyclopedia (CCLE) project. SOX4 gene expression correlates with advanced cancer stages and poor survival rate in bladder cancer, supporting a potential role as a regulator of the bladder CSC properties. SOX4 might serve as a biomarker of the aggressive phenotype, also underlying progression from NMIBC to MIBC. The amplicon in chromosome 6 contains SOX4 and E2F3 and is frequently found amplified in bladder cancer. These genes/amplicons might be a potential target for therapy. As an existing hypothesis is that chromatin deregulation through enhancers or super-enhancers might be the underlying mechanism responsible of this deregulation, a potential way to target these transcription factors could be through epigenetic modifiers.
Chapter
This research concerns the biology of prostate cancer to understand the mechanisms underlying prostate cancer dedifferentiation. Dedifferentiation leads to nonadenocarcinoma and small cell carcinoma. Compared to well-differentiated adenocarcinoma, poorly differentiated small cell carcinoma is highly aggressive and lethal, associated with advanced disease, particularly after treatment failure. The presence of prostate cancer-specific TMPRSS2-ERG fusion in both adenocarcinoma and small cell carcinoma of the same tumor cases suggests a direct lineage, which we have shown experimentally by reprogramming of adenocarcinoma cells. Small cell carcinoma is stem-like and expresses stem cell transcription factors (scTFs). More importantly, like stem cells, small cell carcinoma can be induced to differentiate with stromal factors such as PENK. This research will delineate how stromal signaling can reverse or prevent prostate cancer dedifferentiation. Our hypothesis also postulates that all cancer undergo dedifferentiation to a stem-like state characterized by scTF + B2Mlo, which is represented by small cell carcinoma. That small cell carcinoma of many different tumor types is indistinguishable from each other and lends support to this view of cancer dedifferentiation/reprogramming. Exome sequencing data revealed no single thematic pattern for small cell carcinoma such as a particularly high number of DNA mutations. This is similar to what was found by exome sequencing analysis of prostate LuCaP small cell carcinoma versus LuCaP adenocarcinoma. It means that the prostate small cell carcinoma phenotype is not because of a specific accumulation of genomic mutations over the disease course. Our understanding of cancer dedifferentiation and therapy based on cancer differentiation induced by stromal factors could lead to a better management of prostate cancer.
Article
Full-text available
Bladder cancer (BC) is a complex and highly heterogeneous stem cell disease associated with high morbidity and mortality rates if it is not treated properly. Early diagnosis with personalized therapy and regular follow-up are the keys to a successful outcome. Cancer stem cells (CSCs) are the leading power behind tumor growth, with the ability of self-renewal, metastasis, and resistance to conventional chemotherapy. The fast-developing CSC field with robust genome-wide screening methods has found a platform for establishing more reliable therapies to target tumor-initiating cell populations. However, the high heterogeneity of the CSCs in BC disease remains a large issue. Therefore, in the present review, we discuss the various types of bladder CSC heterogeneity, important regulatory pathways, roles in tumor progression and tumorigenesis, and the experimental culture models. Finally, we describe the current stem cell-based therapies for BC disease.
Article
Full-text available
The Prostate Expression Database (PEDB) is a curated relational database and suite of analysis tools designed for the study of prostate gene expression in normal and disease states. Expressed Sequence Tags (ESTs) and full-length cDNA sequences derived from more than 40 human prostate cDNA libraries are maintained and represent a wide spectrum of normal and pathological conditions. Detailed library information including tissue source, library construction methods, sequence diversity and abundance are available in a library archive. Prostate ESTs are assembled into distinct species groups using the multiple alignment program CAP2 and are annotated with information from the GenBank, dbEST and Unigene public sequence databases. Annotated sequences in PEDB are searched using the BLAST algorithm. The differential expression of each EST species can be viewed across all libraries using a Virtual Expression Analysis Tool (VEAT), a graphical user interface written in Java for intra- and inter-library species comparisons. PEDB may be accessed via the World Wide Web at http://www.mbt.washington.edu/PEDB/
Article
Full-text available
Identification and characterization of the prostate stem cell is important for understanding normal prostate development and carcinogenesis. The flow cytometry-based side population (SP) technique has been developed to isolate putative adult stem cells in several human tissue types including the prostate. This phenotype is mainly mediated by the ATP-binding cassette membrane transporter ABCG2. Immunolocalization of ABCG2 was performed on normal prostate tissue obtained from radical prostatectomies. Normal human prostate SP cells and ABCG2+ cells were isolated and gene expression was determined with DNA array analysis and RT-PCR. Endothelial cells were removed by pre-sorting with CD31. ABCG2 positive cells were localized to the prostate basal epithelium and endothelium. ABCG2+ cells in the basal epithelium constituted less than 1% of the total basal cell population. SP cells constituted 0.5-3% of the total epithelial fraction. The SP transcriptome was essentially the same as ABCG2+ and both populations expressed genes indicative of a stem cell phenotype, however, the cells also expressed many genes in common with endothelial cells. These results provide gene expression profiles for the prostate SP and ABCG2+ cells that will be critical for studying normal development and carcinogenesis, in particular as related to the cancer stem cell concept.
Article
Full-text available
A set of monoclonal antibodies raised against lymphocyte cell surface molecules, the cluster designation (CD) antigens, was used to distinguish the constituent cell types of the prostate. The luminal secretory epithelial, basal epithelial, fibromuscular stromal, nerve sheath, and endothelial cells express distinctive complements of cell surface molecules that were identified by immunohistochemistry using 152 commercially available antibodies. Many of the CD antibodies stained lymphocyte populations in the prostate. These lymphocyte populations were grouped into abundance classes of rare, moderate, and high. Some of these molecules are expressed by multiple cell types, both parenchymal and lymphoid; others are expressed by only one cell type. Distinctive patterns of CD expression, which are most similar to the expression pattern of prostate luminal cells, also characterize a small series of Gleason score 6 prostate cancers. The cell-type specificity of CD molecules increases the prospect of isolating specific cell populations, using such techniques as laser capture microdissection and flow cytometry, for cell-specific molecular studies.
Article
Full-text available
The Prostate Gene Database (PGDB: http://www.ucsf.edu/pgdb) is a curated and integrated database of genes or genomic loci related to the human prostate and prostatic diseases. Currently, PGDB covers genes involved in a number of molecular and genetic events of the prostate including gene amplification, mutation, gross deletion, methylation, polymorphism, linkage and over-expression, as published in the literature. Genes that are specifically expressed in prostate, as evidenced by analysis of data from expressed sequence tags (ESTs) and serial analysis of gene expression (SAGE), are also included. There are a total of 165 unique entries in the database. Users can either browse or query the PGDB through a web interface. For each gene, in addition to basic gene information and rich cross-references to other databases, inclusive and relevant literature references are provided to support the inclusion of the gene in the database. Detailed expression data calculated from the UniGene and SAGEmap databases are also presented.
Article
Full-text available
Cytoscape is an open source software project for integrating biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework. Although applicable to any system of molecular components and interactions, Cytoscape is most powerful when used in conjunction with large databases of protein-protein, protein-DNA, and genetic interactions that are increasingly available for humans and model organisms. Cytoscape's software Core provides basic functionality to layout and query the network; to visually integrate the network with expression profiles, phenotypes, and other molecular states; and to link the network to databases of functional annotations. The Core is extensible through a straightforward plug-in architecture, allowing rapid development of additional computational analyses and features. Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
Article
This proposal aims to exploit advances in biotechnology and informatics to develop a genetics resource termed the Prostate Expression Database (PEDB) (http://www.pedb.org). . The foundation of PEDB is the identification and characterization of a prostate transcriptome, the intermediary between the genome and the proteome that represents that portion of the human genome actively used or transcribed in the prostate. The research accomplished to date has assembled a working virtual prostate transcriptome that defines the genes used or transcribed in prostate cell types and tissues. This transcriptome has a physical counterpart of 6,500 cDNAs arrayed in cDNA microarray format for large-scale expression studies. This transcriptome has been used as a foundation for studies of the prostate proteome, the working counterpart to the genome and transcriptome. Our results show that these approaches are complementary. Analysis of the virtual transcriptome of LNCaP cells has identified 15 new androgen-regulated genes to date. Characterization of these genes is in progress. We have extended PEDB to include sequence analysis of the murine prostate and constructed a corresponding database, mPEDB to facilitate the dissemination of mouse prostate gene expression information.
Article
Abstract Background Identification and characterization of the prostate stem cell is important for understanding normal prostate development and carcinogenesis. The flow cytometry-based side population (SP) technique has been developed to isolate putative adult stem cells in several human tissue types including the prostate. This phenotype is mainly mediated by the ATP-binding cassette membrane transporter ABCG2. Methods Immunolocalization of ABCG2 was performed on normal prostate tissue obtained from radical prostatectomies. Normal human prostate SP cells and ABCG2<sup>+ </sup>cells were isolated and gene expression was determined with DNA array analysis and RT-PCR. Endothelial cells were removed by pre-sorting with CD31. Results ABCG2 positive cells were localized to the prostate basal epithelium and endothelium. ABCG2<sup>+ </sup>cells in the basal epithelium constituted less than 1% of the total basal cell population. SP cells constituted 0.5–3% of the total epithelial fraction. The SP transcriptome was essentially the same as ABCG2<sup>+ </sup>and both populations expressed genes indicative of a stem cell phenotype, however, the cells also expressed many genes in common with endothelial cells. Conclusion These results provide gene expression profiles for the prostate SP and ABCG2<sup>+ </sup>cells that will be critical for studying normal development and carcinogenesis, in particular as related to the cancer stem cell concept.
Article
Abstract Background The gene expression profiles of most human tissues have been studied by determining the transcriptome of whole tissue homogenates. Due to the solid composition of tissues it is difficult to study the transcriptomes of individual cell types that compose a tissue. To overcome the problem of heterogeneity we have developed a method to isolate individual cell types from whole tissue that are a source of RNA suitable for transcriptome profiling. Results Using monoclonal antibodies specific for basal (integrin β4), luminal secretory (dipeptidyl peptidase IV), stromal fibromuscular (integrin α 1), and endothelial (PECAM-1) cells, respectively, we separated the cell types of the prostate with magnetic cell sorting (MACS). Gene expression of MACS-sorted cell populations was assessed with Affymetrix GeneChips. Analysis of the data provided insight into gene expression patterns at the level of individual cell populations in the prostate. Conclusion In this study, we have determined the transcriptome profile of a solid tissue at the level of individual cell types. Our data will be useful for studying prostate development and cancer progression in the context of single cell populations within the organ.
Article
Abstract Background Stromal mesenchyme cells play an important role in epithelial differentiation and likely in cancer as well. Induction of epithelial differentiation is organ-specific, and the genes responsible could be identified through a comparative genomic analysis of the stromal cells from two different organs. These genes might be aberrantly expressed in cancer since cancer could be viewed as due to a defect in stromal signaling. We propose to identify the prostate stromal genes by analysis of differentially expressed genes between prostate and bladder stromal cells, and to examine their expression in prostate cancer. Methods Immunohistochemistry using antibodies to cluster designation (CD) cell surface antigens was first used to characterize the stromas of the prostate and bladder. Stromal cells were prepared from either prostate or bladder tissue for cell culture. RNA was isolated from the cultured cells and analyzed by DNA microarrays. Expression of candidate genes in normal prostate and prostate cancer was examined by RT-PCR. Results The bladder stroma was phenotypically different from that of the prostate. Most notable was the presence of a layer of CD13<sup>+ </sup>cells adjacent to the urothelium. This structural feature was also seen in the mouse bladder. The prostate stroma was uniformly CD13<sup>-</sup>. A number of differentially expressed genes between prostate and bladder stromal cells were identified. One prostate gene, proenkephalin (PENK), was of interest because it encodes a hormone. Secreted proteins such as hormones and bioactive peptides are known to mediate cell-cell signaling. Prostate stromal expression of PENK was verified by an antibody raised against a PENK peptide, by RT-PCR analysis of laser-capture microdissected stromal cells, and by database analysis. Gene expression analysis showed that PENK expression was down-regulated in prostate cancer. Conclusion Our findings show that the histologically similar stromas of the prostate and bladder are phenotypically different, and express organ-specific genes. The importance of these genes in epithelial development is suggested by their abnormal expression in cancer. Among the candidates is the hormone PENK and the down-regulation of PENK expression in cancer suggests a possible association with cancer development.
Article
Abstract Background Affymetrix GeneChip Array and Massively Parallel Signature Sequencing (MPSS) are two high throughput methodologies used to profile transcriptomes. Each method has certain strengths and weaknesses; however, no comparison has been made between the data derived from Affymetrix arrays and MPSS. In this study, two lineage-related prostate cancer cell lines, LNCaP and C4-2, were used for transcriptome analysis with the aim of identifying genes associated with prostate cancer progression. Methods Affymetrix GeneChip array and MPSS analyses were performed. Data was analyzed with GeneSpring 6.2 and in-house perl scripts. Expression array results were verified with RT-PCR. Results Comparison of the data revealed that both technologies detected genes the other did not. In LNCaP, 3,180 genes were only detected by Affymetrix and 1,169 genes were only detected by MPSS. Similarly, in C4-2, 4,121 genes were only detected by Affymetrix and 1,014 genes were only detected by MPSS. Analysis of the combined transcriptomes identified 66 genes unique to LNCaP cells and 33 genes unique to C4-2 cells. Expression analysis of these genes in prostate cancer specimens showed CA1 to be highly expressed in bone metastasis but not expressed in primary tumor and EPHA7 to be expressed in normal prostate and primary tumor but not bone metastasis. Conclusion Our data indicates that transcriptome profiling with a single methodology will not fully assess the expression of all genes in a cell line. A combination of transcription profiling technologies such as DNA array and MPSS provides a more robust means to assess the expression profile of an RNA sample. Finally, genes that were differentially expressed in cell lines were also differentially expressed in primary prostate cancer and its metastases.