Use of Mouse embryonic
stem cell specific microRNA or mRNA signatures to isolate adult
human stem cells from tissues and organs
Specific Aims of the
Project:
Aim 1: We
have identified several microRNA expression clusters during mouse
embryonic stem cell differentiation. We will validate these data
in stem cells by qRT-PCR and northern hybridization.
Aim 2: Promoters
of stem cell specific miRNAs will be isolated through genomics,
bioinformatics and PCR based genome walking techniques.
Aim 3: Stem cell specific miRNA promoters will
be cloned in to reporter genes (GFP) and use as markers to isolate
adult mouse stem cells from complex cell populations. Next, we
will identify cell surface markers from adult mouse stem cells
to screen human counterparts¡¯ from organs and tissues.
Commercial Importance
and rationale of the project:
The rationale of this study
is to utilize early embryogenesis specific miRNA/mRNA expression
signatures (markers) to identify and extract adult stem cells
from human organs. Adult stem cells are important for diseases
such as Parkinson, Huntington, Type 1 diabetes and many more other
incurable diseases. Until today, there has not been an efficient
way to isolate adult stem cells from human organs.
Identification of adult stem cells is important a) To use patient¡¯s
own stem cells to cure diseases and avoid possible immune rejection
effects b) To avoid social and ethical issues that associated
with the use of human embryos for research purposes.
We believe that promoter reporter constructs, early and late embryonic
stem cell specific surface markers, optimized stem cell culturing
protocols that we develop during this study will be able to generate
substantial revenue for further research and development of this
program.
We have identified several
microRNA expression clusters during mouse embryonic stem cell
differentiation. We will validate these data in stem cells by
qRT-PCR and northern hybridization.

Background and Significance:
microRNAs (or miRNAs) are
small non-coding RNAs (21 to 25 nucleotides) that are processed
from large hairpin RNA precursors and are believed to be involved
in a wide range of developmental and cellular processes, by either
repressing translation or triggering mRNA interference (RNA interference).
Over 200 of distinct genes encoding microRNAs have been identified
through either computer-assisted approaches or cDNA cloning strategies
in many organisms including worm, plants, flies, mouse and human
(Lai 2003., Bartel 2004). Recently we have developed a microarray
based robust method to profile microRNA expression in organs cell
lines and tissues in mammalians (Sun and Perera 2004). Using this
method we have identified a group of miRNAs preferentially expressed
in human primary adipocytes and knocking down one such miRNA (miRNA
143) reverse the differentiation process (Esau 2004). We also
reported that a group of kidney specific miRNAs share evolutionary
conserved phylogenetic foot print Ets1 in the upstream of the
miRNA, possibly important for kidney physiological maintenance
(Sun and Perera 2004). Here I discuss a detail protocol for microRNA
global gene expression profiling through end labeling for micro
array experiment.
Embryonic stem cells (ES) are totipotent cell lines derived from
the inner cell mass (ICM) of the mammalian blastocytes (Smith,
2001). In vitro differentiation of ES cells recapitulates some
of the global methylation that takes place shortly after implantation
and has been used to study the epigenetic events that accompany
X chromosome inactivation during midblastula (Wutx and Jaenisch,
2000). The therapeutic potential of stem cells and nuclear cloning
has let to renewed interest in classical models in regeneration.
This long standing problem is undergoing a new beginning spurred
by the availability of new techniques that finally allow analysis
on the cellular and molecular level. miRNA global gene expression
profiling is one such new technique developed to identify stem
cell specific miRNAs (Sun and Perera 2004).
Preliminary Data
Chip design:
We have developed two protocols to profile miRNA global expression
in mouse and human embryonic stem cells. The data presented here
is mainly coming from the custom miRNA array we developed with
the collaboration of Affymetrix Corporation at Mountain View California.
Total of 278 mirs (probes) are being synthesized on Affymetrix
custom chip and modified T4 RNA ligase mediated protocol was developed
to label target miRNA for chip hybridization.
For each PM probe, we have
created ~1000 random mismatch probes with 1 ¨C 5 substitution bases.
We then computed Perfect match (PM) and mismatch (MM) probe qualities
by intercept (measurement of intensity at zero concentration)
and DeltaG (measurement of sequence similarity). The delta-G measurement
is a measurement of sequence similarity. If the delta-G ratio
is 1, then that mismatch would be expected to respond to a PM
spike just as a PM probe and where a delta_G ratio nearer to zero
indicates a probe that won't respond to the PM no matter what
the concentration of the PM spike.
We defined the intercept difference
(InterceptDiff) by Intercept (PM) ¨C Intercept (MM) and DeltaG
difference (DGDiff) by DeltaG (PM) ¨C Delta G (MM). Finally we
group the probes in bin 0 (with -0.25 <InterceptDiff <0.25)
by the number of mismatches and selected the highest DFDiff probes
for 1MM, 2MM, 3MM, 4MM 5MM respectively, i.e. most discriminated
MM probes from PM. 1MM means a probe has one mismatch base, and
2MM is a probe with two mismatch bases, etc. The bin zero MM probes
have about the same zero concentration intensity as the PM probes
and the high DGDiff values to make sure that the MM probes intensity
is much lower than the PM intensity. Please see in the supplemental
material file MM26_all.txt where contains all the MM probes (with
length equal to 26, only one PM) and the file MM26_final.txt lists
the final 25 MM probes selected using methods described above.
In general, we did not tile
probes which cross-hyb to hard prune sequences, as any signal
from those probes is highly likely to result from hybridization
of the repetitive element. In few occasions, we tile probe sets
which cross-hybridize to one of Affymetrix human library sequences
if we could not find a good unique set. For your initial set of
sequences Affymetrix software designed odd-length probes, so there
are files for 17mers through 25mers, although we can tile even-length
probes on the array. For each length, there are two files, a *.hp.all
file and a *.xhy.all file (please see supplemental material).
The first file (hp.all) contains the sequences which cross-hybridize
to any of our hard prune elements, which consist of repetitive
elements such as alu or ERVs, simple repeats, and abundantly expressed
RNAs such as rRNA. The second file (xhy.all) contains the sequences
which cross-hybridize to either one of the other design sequences
or to a sequence in our human library. The format of both of these
types of files is described in section 4.3.2 of Affymetrix Expression
Design Guide (click
here for details).
Target labeling protocol:
10 ¦Ìg of total RNA was first dephosphorylated with shrimp alkaline
phosphatase (SAP), in 37oC for 40 minutes, and then followed by
an inactivation of SAP at 65oC for 40 minutes. This dephosphorylated
RNA product was then end-labeled with biotin donor compound (NEN
biotin-N4 ¨C CTP, NEL-510) by T4 RNA ligase in the presence of
40% Polyethylene glycol (PEG). The reaction was carried out at
37oC for 2 hours. Chip hybridization, processing and scanning
was done according to Affymetrix protocols.
Preliminary data on
stage specific miRNA expression in mouse embryos:
Total RNA from day 0, day 3, day 6, day 9, day 12, and day 15
of the differentiated stem cells was isolated and labeled for
chip hybridization. Results of the raw signal intensity values
were normalized to chip medium 1 and further analyzed by S-Plus
and SAS (statistical packages) to obtain statistically significant
results. Affymetrix custom miRNA chip contains 15 probe replicates
and 25 different mismatches for each miRNA to provide maximum
statistical significant data. Due to the large number of MM probes
to a given PM probe in this chip, one can clearly discriminate
PM from MM and therefore, results we obtain from this method is
highly statistically significant. Here we focus mainly on up and
down regulated miRNAs in differentiated (15 days) compared to
non differentiated (0 days) ES cells. Table 1 illustrates the
expression changes (fold changes) and P-values of several miRNAs.
miRNA 322 shows the highest fold change differences in day 15
compared to day 0. Another interesting observation is the expression
of Let-7a, Let-7e, and Let-7e during the early stem cell differentiation
(Figure 1). Let-7 group of miRNA has been known to be involved
in developmental regulation in C.elegance.

Table 1. Signal intensity values, P-values and expression changes
of miRNA expression during stem cell differentiation in early
embryogenesis.

Figure 1:
Statistically significant miRNA array data was further cluster
(K-mean) to identify stage specific expression in early embryogenesis.
We were able to identify several stage specific miRNA clusters
that are unique for a particular stage but not the others. Figure
2 illustrates a highly expressed miRNA cluster (mir-152, mir-138,
mir-22, mir-357, Let-7a, and Let-7d) in day 15 compared to day
0 during mouse embryogenesis.

Figure 2:
miRNA expression clusters (K-mean)
Aim 1: ¡°We
have identified several microRNA expression clusters during mouse
embryonic stem cell differentiation. We will validate these data
in stem cells by qRT-PCR and northern hybridization¡±
qRT-PCR and Northern validation:
So far the Northern analysis has been the most efficient way of
validating miRNA array data. Take 10 ¦Ìg of total RNA to run on
the polyacrylamide gels and design antisense oligonucleotides
as labeling probes. Follow the protocol according to Ambion (www.ambion.com).
Use U6 labeled probe as the positive control. Some of the data
we validated so far gave us promising results (Figure 3). Expression
of pri-mir also could be an indicator for mir expression and therefore,
qRT-PCR can be used for this purpose. Primer probes are made to
the region where miRNA sequence is located in the pre-mir.
Aim 2: ¡°Promoters of stem cell specific miRNAs
will be isolated through genomics,
bio-informatics and PCR based genome walking techniques¡±
First we will write a PERL
script to detect upstream genomic sequences of selected miRNA.
PERL Script will use the miRNA sequence information and then locate
the corresponding genomic sequence (2-5Kb upstream sequences)
as the putative miRNA promoter from the publicly available human
genome sequences. PCR based genome walker technique will be used
to isolate the upstream genomic fragment. Genome walking protocols
are according to BD Biosciences (www.BD.com)
Aim 3: ¡°Stem
cell specific miRNA promoters will be cloned in to reporter genes
(GFP) and use as markers to isolate adult mouse stem cells from
complex cell populations. Next, we will identify cell surface
markers from adult mouse stem cells to screen human counterparts¡¯
from organs and tissues¡±
For more information please
write to invest@GBNweb.com