ArticlePDF Available

Elements of an archaeal promoter defined by mutational analysis

Authors:

Abstract and Figures

The sequence requirements for specific and efficient transcription from the 16S/23S rRNA promoter of Sulfolobus shibatae were analysed by point mutations and by cassette mutations using an in vitro transcription system. The examination of the box A-containing distal promoter element (DPE) showed the great importance of the TA sequence in the center of box A for transcription efficiency and the influence of the sequence upstream of box A on determining the distance between the DPE and the start site. In most positions of box A, replacement of the wild type bases by adenines or thymlnes are less detrimental than replacements by cytosines or guanines. The effectiveness of the proximal promoter element (PPE) was not merely determined by its high A + T content but appeared to be directly related to its nucleotide sequence. At the start site a pyrimldlne/purine (py/pu) sequence was necessary for unambiguous initiation as shown by analysis of mutants where the wild type start base was replaced. The sequence of box A optimal for promoter function in vitro is identical to the consensus of 84 mapped archaeal promoter sequences.
No caption available
… 
Content may be subject to copyright.
Nucleic
Acids
Research,
Vol.
20,
No.
20
5423-5428
Elements
of
an
archaeal
promoter
defined
by
mutational
analysis
Johannes
Hain*,
Wolf-Dieter
Reiter+,
Uwe
Hudepohl
and
Wolfram
Zillig
Max-Planck-lnstitut
fOr
Biochemie,
8033
Martinsried,
Germany
Received
June
24,
1992;
Revised
and
Accepted
September
25,
1992
ABSTRACT
The
sequence
requirements
for
specific
and
efficient
transcription
from
the
16S/23S
rRNA
promoter
of
Sulfolobus
shibatae
were
analysed
by
point
mutations
and
by
cassette
mutations
using
an
in vitro
transcription
system.
The
examination
of
the
box
A-
containing
distal
promoter
element
(DPE)
showed
the
great
importance
of
the
TA
sequence
in
the
center
of
box
A
for
transcription
efficiency
and
the
influence
of
the
sequence
upstream
of
box
A
on
determining
the
distance
between
the
DPE
and
the
start
site.
In
most
positions
of
box
A,
replacement
of
the
wild
type
bases
by
adenines
or
thymines
are
less
detrimental
than
replacements
by
cytosines
or
guanines.
The
effectiveness
of
the
proximal
promoter
element
(PPE)
was
not
merely
determined
by
its
high
A+
T
content
but
appeared
to
be
directly
related
to
its
nucleotide
sequence.
At
the
start
site
a
pyrimidine/purine
(py/pu)
sequence
was
necessary
for
unambiguous
initiation
as
shown
by
analysis
of
mutants
where
the
wild
type
start
base
was
replaced.
The
sequence
of
box
A
optimal
for
promoter
function
in
vitro
is
identical
to
the
consensus
of
84
mapped
archaeal
promoter
sequences.
INTRODUCTION
Two
consensus
regions
have
been
defined
by
comparison
of
promoter
sequences
of
Archaea
[1]
(Archaebacteria):
the
box
A
centered
about
27
bases
upstream
of
the
transcription
start
site
and
the
box
B
at
the
start
site
[2,
3].
Mutational
analysis
of
the
16S/23S
rRNA
promoter
of
Sulfolobus
shibatae
using
an
in
vitro
transcription
system
[4]
has
identified
the
position
and
function
of
three
essential
promoter
elements:
(i)
a
distal
promoter
element
(DPE)
encompassing
box
A,
which
is
important
for
transcription
efficiency
and
start
site
selection;
(ii)
a
proximal
promoter
element
(PPE)
between
position
-11
and
-2
which
also
contributes
to
transcription
efficiency;
and
(iii)
a
pyrimidine/purine-sequence
which,
at
the
proper
distance
from
the
DPE,
serves
as
start
site
[5].
Utilisation
of
heterologous
promoters
in
the
S.
shibatae
transcription
system
showed
that
constitutive
promoters
are
functionally
conserved
between
distantly
related
archaea
[6].
An
analysis
of
the
Methanococcus
vannielii
tRNAVaI
gene
promoter
[7]
is
in
fair
agreement
with
these
data
but
extends
the
determination
of
sequence
requirements
by
point
mutations
in
which
certain
bases
between
position
-35
to
+2
were
replaced
with
guanine.
In
this
study
we
analyze
the
functional
importance
of
positions
in
promoter
elements
in
more
detail,
especially
within
box
A
where
each
base
was
replaced
with
the
other
three.
We
also
examined
the
effect
of
all
possible
base
exchanges
at
the
start
site
in
order
to
check
the
requirement
for
a
pyrimidine/purine
sequence.
Moreover,
we
replaced
the
proximal
promoter
element
by
streches
of
adenines
or
thymines,
or
the
complement
of
the
wild
type
PPE-sequence,
to
test
the
hypothesis
that
a
high
A+T
content
is
sufficient
for
its
function
in
the
16S/23S
rRNA
promoter
[5].
Promoter
strength
and
transcript
start
sites
of
all
mutant
promoters
were
determined
by
Si
nuclease
mapping
of
the
in
vitro
transcripts.
The
box
A
sequence
optimal
for
promoter
function
in
vitro
was
compared
to
the
consensus
of
mapped
archaeal
promoters.
MATERIALS
AND
METHODS
Materials
Restriction
enzymes,
RNase-free
DNase
and
T4
ligase
were
obtained
from
Boehringer
Mannheim,
T4
polynucleotide
kinase,
Sl-nuclease
and
Klenow-fragment
of
E.coli
DNA
polymerase
I
from
Pharmacia,
radiochemicals
from
Amersham.
The
soluble
cell-free
extract
of
S.
shibatae
was
prepared
as
described
[4].
Methods
Construction
of
the
vector
and
mutants.
A
sequence
identical
to
the
-39
to
+6
promoter
region
of
the
S.
shibatae
16S/23S
rRNA
operon
[14]
was
cloned
into
the
phagemid
pBluescript
II
KS'
(Stratagene)
between
the
Xho
I
and
Bam
HI
cleavage
sites,
using
synthetic
oligonucleotides.
The
Pst
I-site
in
the
middle
of
the
promoter
region
was
generated
by
exchange
of
a
thymidine
against
guanosine
at
position
-19
and
against
cytidine
at
position
-21
in
the
oligonucleotide
sequence
(Fig.
1).
This
difference
in
the
promoter
sequence
had
no
negative
effect
on
transcription
efficiency
and
start
site
selection.
This
vector
construct
was
named
pSP2
and
used
as
standard.
The
Pst
I-site
together
with
either
the
XhO
I-
or
the
Bam
HI-site
were
used
to
generate
mutants
by
introduction
of
synthetic
oligonucleotides
comprising
distinct
mutations.
These
oligonucleotides
were
designed
with
ends
*
To
whom
correspondence
should
be
addressed
+
Present
address:
MSU-DOE
Plant
Research
Laboratory,
Michigan
State
University,
East
Lansing,
MI
48824-1312,
USA
5424
Nucleic
Acids
Research,
Vol.
20,
No.
20
complementary
to
the
sticky
ends
of
either
the
Xho
IVPst
I-or
Pst
I/Bam
HI-vector
fragment
generated
by
digestion
of
pSP2
with
the
respective
enzymes.
Both
double
digests
always
led
to
a
vector
fragment
with
overhanging
ends
on
the
same
strand
(Fig.
1).
Ligation
[8]
of
the
oligonucleotides
with
these
vector
fragments
led
to
a
circular
vector
with
a
short
single
stranded
region.
Upon
transformation
[9]
the
DNA-repair
system
of
the
recipient-cell
strain
E.coli
XL1-blue
converted
the
construct
into
double-stranded
DNA.
After
selection
on
LB
agar
containing
tetracyclin
(25
pg/ml)
and
ampicillin
(75
itg/ml)
the
recombinant
DNA
was
isolated
[10]
and
the
mutation
verified
by
DNA-sequencing
[11].
Mutated
phagemids
obtained
through
this
procedure
were
cleaved
with
Bgl
I
yielding
two
fragments
and
used
as
linear
templates
for
in
vitro
transcription.
In
vitro
transcription.
100
ng
template-DNA
were
incubated
for
10
min
at
60°C
in
50
/d
of
a
reaction
mixture
containing
50
mM
Tris-HCl
(pH
8.0),
25
mM
MgCI2,
1
mM
EDTA,
1
mM
Dithiothreitol,
2
mM
AT?,
1
mM
CTP,
1
mM
GTP,
1
mM
UTP
and
8
pl
of
the
cell-free
extract
from
S.
shibatae
[4].
The
mixture
was
chilled
on
ice,
50
/l
ddH20
were
added
and
the
reaction
mixture
was
extracted
three
times
with
100
pl
of
phenol/chloroform/isoamylalcohol
25:24:1
(vol/vol/vol).
The
nucleic
acids
were
precipitated
with
ethanol
and
the
template-
DNA
was
removed
by
digesting
with
25
U
RNase-free
DNase
I
(Boehringer,
Mannheim)
in
50
1d
reaction
buffer
for
30
min
at
room
temperature.
After
addition
of
50
pl
ddH20
the
reaction
mixture
was
extracted
twice
with
phenol/chloroform/
isoamylalcohol
and
with
chloroform/isoamylalcohol
24:1
(vol/vol).
The
aqueous
phase
containing
the
in
vitro
RNA
was
stored
at
-700C.
SI
analysis.
DNA
probes
for
SI
nuclease
analysis
[12,
13]
were
prepared
by
extension
of
the
5'-32P-labeled
M13
universal
sequencing
primer
hybridized
to
single
stranded
pSP2
or
its
respecfive
mutant
derivative
single
strand.
Identical
volumes
(two
microliters)
of
the
in
vitro
RNA
solutions
were
hybridized
to
an
at
least
fivefold
molar
excess
of
DNA
probe.
Hence
only
the
quantity
of
the
in
vitro
RNA
determined
the
strengit
of
the
signal
and
therefore
the
transcription
efficiency.
The
condition
for
hybridization,
SI-nuclease
digestion
and
the
electrophoresis
through
denaturing
polyacrylamide
gels
were
as
described
[4].
The
transcription
efficiencies
were
quantified
by
densitometry
of
the
autoradiographs.
A
calibration
curve
was
used
to
correct
for
nonlinearity.
Primer
extension.
After
in
vitro
transcription
and
DNase
I
digestion
the
reaction
mixtures
were
extracted
twice
with
phenol/chloroform/isoamylalcohol
and
were
ethanol
coprecipitated
with
30,000
cpm
of
the
5'-32P-labeled
M13
universal
sequencing
primer.
Reverse
transcription
was
performed
as
described
[14]
yielding
a
fragment
of
99
bp
(in
the
case
of
initiation
at the
wild
type
start
site).
The
cDNA
was
analyzed
on
a
6%
polyacrylamide
sequencing
gel.
RESULTS
In
order
to
define
the
sequence
requirements
within
the
archaeal
promoter
elements
responsible
for
transcription
efficiency
and
start
site
selection,
an
extensive
mutational
analysis
of
the
promoter
region
of
the
16S/23S
rRNA
operon
of
S.
shibate
was
performed.
For
this
purpose
a
cassette
with
a
sequence
almost
A
5,
-38
DPE
-25
1
-11
PPE
-2
(Start)
I
I
.
cccctcga
ggactgaaqaa&tAaOgqtcc
______
Xho
I
B
box
A
Pat
I
box
B.
Bar
HI
3,
5'
tcgag
.ctgc
3'
(oligo
with
a
miutation)
5'
ccc
S
I
cccc
3'
ggggagct
Xho
I
5,
3,1
3,
5S
Pat
I
Pat
I
cccctcgagttagatttatatgqgactgca
qgggagctcaatctaaatataccctg,
Bam
HI
gatcc
3'
5,
(oligo
with
a
mutation)
3'
acgtc
.................
cctag
5'
Figure
1:
(A).
Sequence
of
the
promoter
region
of
the
16S/23S
rRNA
operon
of
S.
shibata
containing
a
Pst
I
site
introduced
by
two
nucleotide
exchanges
at
posiions
-19
(F
to
G)
and
-21
(F
to
C).
Oligonueooides
with
this
promoter
sequence
from
position
-39
to
+6
(tnscription
stat
site
poiion
+1)
have
been
inserted
into
pBluescript
II
KS+
between
the
Xo
I
ad
te
Bwn
HI
sites
yielding
the
constrct
pSP2.
The
sequence
shows
the
disl
promoter
element
(DPE,
underined)
iluding
box
A
(uppercase
and
bod),
the
proximal
pmoter
ekment
(PPE,
undrlned)
and
box
B
(u
e
and
bold).
Th
M13-20
univra
primer
used
for
sequencing,
generation
of
the
SI
probes
and
for
prie
extension
binds
at
position
+83
to
+99.
(B).
Strategy
of
the
insrtion
of
synthetic
oligonuclides
containing
certain
mutai
into
pSP2.
The
pSP2
cosrct
was
cleaved
with
either
Xho
IIPst
I
(upper
part)
or
with
Pst
I/M
HI
(lower
part).
The
sythtc
doigoleotides
were
c-eny
at
both
ends
to
these
restiction
sites.
After
ligaton
and
transfonnation
the
single
straned
region
was
filled
in
by
the
DNA
repair
system
of
the
host
cell.
identical
to
that
in
the
natural
promoter
from
position
-39
to
+6
(transcription
start
site
defined
as
+
1)
was
assembled
from
overlapping
oligonucleotides
and
cloned
into
pBlescript
II
KS+
yielding
pSP2
(Fig.
1).
To
permit
the
facile
and
efficient
manipulation
of
all
parts
of
the
promoter,
the
pSP2
promoter
cassette
contained
a
Pst
I
site
between
the
distal
and
the
proximal
promoter
element
leading
to
two
nucleotide
exchanges
compared
to
the
wild
tpe
promoter.
In
vitro
transcription
of
pSP2
and
a
similar
construct
containing
an
entirely
wild
type
promoter
sequence
indicated
that
the
introduction
of
the
Pst
I
site
did
not
alter
the
initiation
site
or
negatively
affect
tnription
efficiency
(data
not
shown).
Mutations
were
focussed
on
three
regions:
(i)
box
A
(position
-32
to
-26),
where
each
of
the
seven
bases
of
the
wild
type
sequence
was
substituted
by
each
of
the
other
ihree
possibilities
(figure
2,
upper
part),
(ii)
the
PPE,
which
was
examined
by
three
cassette
mutants
(figure
2,
middle),
and
(iii)
box
B
with
the
start
site,
where
the
wild
type
guanine
at
position
+
1
was
substituted
by
each of
the
three
other
bases
(figure
2,
bottom).
The
mutations
in
the
promoter
region
were
introduced
by
ligation
of
synthetic
oligonucleotides
and
verified
by
DNA
sequence
detm
ination.
The
various
constru
acted
as
template
for
in
vtro
tnscription
with
a
cell
free
extract
of
S.
shibatae.
The
transcription
efficiency
of each
mutant
promoter
was
determind
by
Sl-nuclease
mapping
of
fte
resulting
transcription
products
followed
by
densitometric
analysis
of
the
respective
signals
on
the
autoradiogram.
To
check
the
reproducibility
for
each
individual
construct,
at
least
three
independent
repetitions
of
these
experiments
were
performed.
In
the
case
of
the
box
A
mutants,
a
pSP2-derived
probe
was
Nucleic
Acids
Research,
Vol.
20,
No.
20
5425
-4U
-30
-20
-10
-
1
.-
Start)
pSP2:
5'
tcgagttagatttatatgggactgcagaacaatatgtataatgcgga
3'
tcgagttaga2ttatatqqggactgcagaacaatatgtataatgcgqa
tcgagttagatgtatatgggactgcagaacaatatgtataatgcgga
tcgagttagatt.&atatgggactgcagaacaatatqtataatgcgga
tcgagttagatLt&atatgggactgcagaacaatatgtataatgcgga
tcgagttagattta&atgggactgcagaacaatatgtataatgcgga
tcgaqttagattta5atgggactgcagaacaatatgtataatgcgga
tcgagttagatttatStgggactgcagaacaatatgtataatgcgga
tcgagttagatttataCgggactgcagaacaatatgtataatgcgga
tcgagttagatttata2gggactgcagaacaatatgtataatgcgga
tcgagttagatAtaAatgggactgcagaacaatatgtataatgcgga
:gagtt&Alttatatgggactgcagaacaatatgtataatqcgga
tcgagttagatttatatgggactgcagaac
flTfltqcgga
tcgagttagatttatatgggactgcagaacaaA3
a&aatgcgga
tcgagttagatttatatgggactgcagaacaatatgtataatZcgga
tcgagttagatttatatgggactgcagaacaatatgtataatScgga
tcgagttagatttatatgggactgcagaacaatatgtataat&cgga
Si
Transcription
Si
(rmutants)
efficiency[%]
(wild
type,
100
reference)
3'
11
i
2
3'
69
±11
3'
45tS
\
3'
67
9
3'
1
_
3'
3
63±5
3'
7±23t
/
3'
20i4
3'
18
4
3'
9
a8±2
3'
1
3'
1
3'
116±18
\
3'
5±1
3'
b
39
3
3
'
28
6
3'
=
27±
9
'3'
*
96±10
3'
20
±
4
3'
89
±13
3'
19±
5
3'
:m
83
±14
*.1
3'
MU
3'
3'
3'
9:
31
9
3'
J
23±6
3
115
±20/
Fue
2:
Transcription
efficiencies
of
all
mutant
constructs
mapped
by
Sl-analysis.
The
data
were
derived
from
at
least
three
independent
repetitions
of
the
SI-analysis.
Box
A-mutations
upper
part,
PPE-mutations
in
the
middle
and
box
B-mutations
at
the
bottom
of
the
list.
Mutations
are
shown
by
bold
underlined
uppercase
letters,
wild
type
nucleotides
in
lowercase
letters.
The
box
A
motif
defined
by
functional
analysis
is
double
underlined.
The
autoradiographs
from
SI
experiments
and
the
deduced
transcription
efficiencies
(4+
standard
deviation)
are
shown
to
the
right
of
the
template
sequences.
Autoradiographs
shown
were
assembled
from
several
gels;
the
corresponding
wild
type
(=pSP2)
controls
are
shown
to
the
right
of
the
transcription
efficiency
data.
Autoradiographs
from
the
start
site
determinations
by
primer
extension
(PE)
are
shown
to
the
extreme
right
of
the
figure;
arrows
indicate
the
fragment
corresponding
initiation
at
the
wild
type
+
1
position.
Mapped
start
sites
are
also
indicated
on
the
template
sequences,
if
deviating
from
the
wild
type
start
site
position:
major
initiation
site:
black
dot;
niinor
initiation
site(s):
open
circle.
used
in
S1
analysis
for
the
examination
of
the
transcription
efficiency
and
the
start
site.
Since
the
probe
was
complementary
to
the
template
DNAs
up
to
box
A,
the
start
sites
of
all
these
mutants
could
be
monitored
without
difficulty.
In
case
of
mutations
in
the
PPE
and
in
box
B
promoter
mutants,
specific
DNA
probes
were
prepared
for
each
construct
to
exclude
SI
nuclease
digestion
at
mismatch
positions.
DPE
mutants
In
the
following
part,
the
influence
of
mutations
in
box
A
of
the
16S/23S
rRNA
promoter
shall
be
described
in
the
order
of
decreasing
effects
on
transcription
efficiency.
All
nucleotide
exchanges
at
positions
-30
and
-29
reduced
transcription
efficiency
dramatically
(Fig.
2),
indicating
the
importance
of
the
TA-sequence
at
these
positions
for
the
function
of
box
A;
only
the
adenine
at
position
-30
maintained
a
high
transcription
efficiency.
Every
exchange
at
position
-27
also led
to
a
reduction,
though
to
a
lesser
extent
than
at
the
positions
-30
and
-29.
The
thymine
at
position
-32
could
only
be
replaced
by
cytosine,
the
thymines
at
positions
-31
and
-28
only
by
adenines,
in
each
case
leaving
the
transcription
efficiency
higher
than
67%
of
that
of
the
wild
type
promoter
(Fig.
2).
Guanine
at
position
-32
was
tolerated,
but
in
a
lesser
extent
(45%
transcription
efficiency).
The
other
possible
exchanges
at
each
of
these
positions
left,
at
most,
11%
of
the
transcription
efficiency.
The
tolerance
of
the
transcription
system
especially
towards
cytosine,
but
not
adenine
or
guanine,
at
position
-32
shows
that
at
this
position
occupation
by
a
pyrimidine
rather
than
a
high
A+T
content
is
necessary
for
promoter
function.
In
contrast,
the
tolerance
towards
transversion
at
positions
-31
and
-28
suggests
that
a
weak
base
pairing
rather
than
a
certain
base
is
required
there.
At
position
-26,
only
the
replacement
of
the
wild
tpe
thymine
by
cytosine
led
to
a
strong
reduction
in
transcription
efficiency:
the
other
two
possible
exchanges
were
tolerated
(Fig.
2;
see
also
figure
3
for
an
overview).
All
of
these
box
A
mutations
influenced
the
transcription
efficiency
without
leading
to
a
shift
of
the
transcription
start
site.
One
mutant
('5S
DPE',
Fig.
2)
contained
the
DPE
of
the
5S
rRNA
gene
promoter
of
S.
shibatae.
This
alteration
in
the
region
immediately
upstream
of
the
box
A,
which
itself
was
the
same
as
in
the
16S/23S
rRNA
promoter,
left
the
transcription
efficiency
at
83%
but
introduced
ambiguity
of
transcription
initiation.
Starts
were
mapped
at
positions
-7
(guanine),
-5,
-3
(adenines)
and
+1
(guanine)
with
the
major
start
site
at
position
-5.
The
mutation
'Euka'
transfonned
the
wild
type
box
A
sequence
'TTTl-lATAT'
to
'TATAAAT'
which
represents
the
'TATA-box'
consensus
of
eukaryotic
pol2
promoters.
The
transcription
efficiency
from
this
promoter
mutant
was
19%
and
thus
M32
A:
M32
C:
M32
G:
M31
A:
M31
C:
M31
G:
M30
A:
130
C:
M30
G:
M29
T:
M29
C:
M29
G:
M28
A:
M28
C:
M28
G:
M27
T:
M27
C:
M27
G:
M26
A:
M26
C:
M26
G:
Euka:
5S
OPE:
5'
5
.
5'
5'
5'
5'
5
'
5'
5'
5'
5'
5'
5'
5.
5'
5'
5'
5'
5'
5.-
5'
5'
5'
Trans
AT:5'
oligo
T:
5'
oligo
A:
5'
Start
T
5'
Start
C
5'
Start
A
5'
Sf
I
I.
a
21
±
6
5
±
1
7
±
3
5426
Nucleic
Acids
Research,
Vol.
20,
No.
20
pSP
2:
T
T
T
A
T
A
T
ACG
ACG
ACG
TCG
ACG
TCG
ACG
N
C)
C
8
a a
C
o
X
2
2
2 2
2 2
2~~~~~~~~~~~2
Figure
3:
Transcription
efficiencies
of
box
A
mutants
obtned
by
quantitative
Si-mapping.
The
names
of
the
mutants
are
given
below
the
x-wis
at
their
corresponding
column
and
are
in
accordance
with
those
at
figure
2.
The
black
column
at
the
right
represents
the
pSP2
(the
16S/23S
rRNA
operon
promoter
introduced
into
pBluescript
II
KS+).
significant
though
weak.
Transcription
was
initiated
at
the
same
site
as
in
the
wild
type
promoter.
PPE
mutants
The
three
cassette
mutations
in
the
PPE,
an
oligo
adenine
stretch,
an
oligo
thymine
stretch
and
the
complement
of
the
wild
type
PPE-sequence,
reduced
the
tnscription
efficiency
to
7%,
5%
and
21%,
respectively.
The
start
site
of
each
of
these
ffiree
mutants
was
the
same
as
that
of
the
wild
type
promoter
(Fig.
2).
Box
B
mutants
In
box
B
the
guanine
at
the
trnscription
start
site
was
substituted
by
each
of
the
three
other
bases.
The
replacement
of
the
wild
type
guanine
by
adenine
did not
shift
the
transcription
start
site
but
increased
the
transcription
efficiency
slightly.
The
replacement
by
thymine
shifted
transcription
initiation
to
the
positions
-3
(adenine)
and
+2
(cytidine)
accompanied
by
a
reduction
of
transcription
efficiency
to
31%
overall.
The
replacement
by
cytidine
led
to
an
ambiguous
initiaton
in
the
vicinity
of
the
wild
type
start
site
but
mainly
at
the
position
+
1
(cytidine)
with
an
overall
htanscrition
efficiency
of
23%
(Fig.
2).
DISCUSSION
Using
an
in
vitro
transcription
system
from
Sulfolobus
shibatae
[4],
the
functional
significance
of
single
positions
widtin
archaeal
promoter
elements
was
examined
by
detennining
the
streng
of
altered
promoters
and
their
start
site
selecdon
in
comparison
to
the
wild
type
promoter
of
the
16S/23S
rRNA-operon
of
S.
shibatae.
The
latter
was
chosen
because
of
its
strength
in
vivo
organism
and
gene
Hc.
rRNA
P1
H.c.
rRNA
P2
H.c.
rRNA
P3
H.c.
LlI,l
H.c.
LIe
B.c.
NA
H.c.
SOD
N.h.
RIAP
N.h.
512
N.h.
HOP
H.h.
DOP
N.h.
Glycopr
.h.
flgA
Rh.
flgS
N.h.
Mn-SOD
N.h.
p-vac
N.h.
p-gvpO
N.h.
TS1,2,3
N.h.
NR
T4
N.h.
N9
TN
N.h.
N8
T7
N.h.
*b
TN
N.h.
N8
T9/10
N.h.
b
TINSSI.
N.h.
0
TLXI
N.h.
*N
Tant
N.m.
168/235
rRIA
P1
.mm.
168/238
rlUA
P2
N.m.
16S/238
rRNA
P3
Nt.
mvhDCA8
NM.
tcr
Mt.
purl
Mv.
165/238
eDNA
NMv.
tRIA
G
5
reUNA
M.v.
tRyArg
Mv.
ORFa
M.v.
S17
Mv.
hisA
M.v.
mcr
N.V.
ORrl
Ta.
165
rRNA
T.a.
23S
rRIA
Ta.
SS
rRNA
T.a.
tRNAOt
D.&.
Ligase
D.a.
SOR
(ar)
D.e.
sor-OF2
(konst.)
Da.
sor-ORN3
(aer)
D.a.
sorODF4
(anaer)
DG..
55
rRNA
D.-.
rRNA-Pl
D.m.
rRNA-P2
S..
S12
S.&.
ONFNS
(rpoN)
Se.
rpoC
S.a.
ORF-X
S.a.
SOD
S.a.
S7
Na.
tRNAMSr
S.a.
ef
la
S.a.
Ef
2
Ss.
16S/23S
rRNA
Ss.
5S
rRNA
S.s.
tRNAArg
S.s.
SSVI
T1,2
S.s.
SSV1
T3
S.9.
SSV1
T4
S.s.
SSV1
T9
S.s.
SSV1
T5
S.s.
SSV1
T6
T.c.
5S
rRNA
Tc.
rpoHIl)
T.c.
rpoH(2)
Tc.
rpl30
Tc.
rpsi2
Tc.
tRNAThr
Tc.
tRNAPro
T.p.
16S/23S
rRNA
T.p
tRNAMnt
T.p.
ORF1
T.p.
ORF2
T.t.
16S/23S
rRNA
Tt.
tRNAAla
T.t.
tRNAMet
consensus:
H.
c.
ORF
H.
h.
BRP
N.
h.
ON
TLX3
H.me.
mc-gvpA
S.s.
SSV1
Tind
sequence
TCC"=TcC
SAACCOCA
CCTCs
OAOa
M
CCA
OTCCOATOCCC
A
OCCTACTTC
CTGCAATC
AAC
ATTCOTOCCATAATM
OOOTTTCO
ATOMASO
SAC
MMMOG
n8WCO00
COO=zOOTTTC
TOACM~
OCT
CTTC
S-C-CTCCA
TCMCSOTG
T
ACA
GTTTCOACW
2SZc
MRO
GTGOAMO
OCWA?M~
TOG
OTTQACAOOAOAMTOCTOTO
OSADCARMY
G8OCPT6--
ATC
GTTGOOOOA
A
C
M
TOCTC
CTTOCAC
CAT
_GGTCSTAOMZACATAT
CCTCO!S
TACTSYWA
TOT
OCTAGAAACIMSOAOTOOCC
OOOTCATSACTC
RM
Cm
Tax=AT
CMC
cTCMTC
=
c
CAC
CATACACATCOO
WATOCC
CGSTATAM
TAGMISS
TTA
GAOOOTCAATC1ATACS
Gm0Cal
ACDAC
GCA
TAOATAGAZ*ZWAGSO
CCTCITAM
JOIO
MSA
ATOGGAACUCSTOOOOC
AAMOWT
a
I
a
SI-?
CCA
CTOCTAIW
C
CCASAOCA
CAT
ATCAAACAGAMTAOOTAO
GOGTCA!
OTIOW
GTSA
TGOCACOAA
AOCCAT
CcTAWC
?ATICMCA
TOA
CTCACAAhA
G5WOAA9Y=T=
GAGCORACOCM
C
CS=
IS
Im
CE
COCC
1IC
--AAAATSONGA
MTCA
GAO
CTACCTTT5_SOCOA
T0TOA£S?YS
C
AYCT
GOC
CTT~OA0O~5WG
AACR
CAOAU
SIA
AAG
C~~~~~~~~~~~CS
TCCO
R
lCAMGTM
OM:lCASTTB
AAUSt
MO
ATAACCZZZ
STTACCA
COCTA
ASA
TAC
OOOCACYIM&ATO
TcOAT
5C_T
CSM
TASCCTAAACS=AZMlATCA
ACSIIS
2AAIWA
T
TACCGAAAACSCTATTATA
ACACTATAT
TCMATAT
AaM
AACCGAAATAZZT
ACTA
ATMCCTC
TATATAC
?CT
TAAOCATAAT
ATCAT
AMCCAOTATS
TCTAT6c5T
MAC
CTATATTAAACZZUATCTOT
ATTcTATS
AMCTMA
CT
TAOTACCAMSZLTOTAM
ACCTAMTIA
ACATsWT
AAT
TGAAAACTTOMS&=TCTT~CT
TIMYMYTAT
TAYMTOTAT
ATA
TATOCAAAMAG
AANTAMcS
TAA
cTTATT
AAMACMT
TAT
GCCTTCGAAMZSIATM
ATTTOCTATT
CYTTACTS
CM
GATCAAAAT
CTCT
TMAATRTA
TCCA2SM
OCT
TCACGAAAATCZATTMATOT
CTTCTATATA
GTOIOCAcM
CCCTACAGCCMTSLSTC=C
TATOCTOIA
TGOCO
TO
CATCATCAAZAZSATATCT
TTCMACT
...A.Ta
GaO
AAMAAA_ASIS
TAO
AOAAATOTAT
ATITACAA
AMA
MTTSCACTTAAZATTTGT
TAAMSAOS
ITATATTATS
CAT
GAMATAOA1RZEZTCCTOAA
TAAAOTAOT
YMATTATO
TCG
CTCAACTATT1CAS8kCTAGT
OCLAAAAM
ATRAPAT
MCT
CCTAACCACAATATATT
GATOCTCOGA
TSTOM
TCC
TATATAAACQAj
CATMsTAGA
OTAAT
SMOYM
P
AccGTcATqaUAZMCTTO
OAGCMAAM
AcTCcIOAY
a
CM
CTAMGTTMTIZZMLMAGUT
TIOGATATI
mAITOTOW
TOT
CTAMCATMATmSLZAMCTA
AATOMYA
MSAW
ITO
cTGTGMaAMA3zzzSSAOaT
ATTOTICAMB
TM
O
COGTAAACCT2A=ATCT
TTATTASA
TAVSI*
AGM
AMTCAAAATAZCMZZAWM0T
TOOCAYTT
TT20TSM
MCA
CACATCTAM
ACCCTTCA
TMATATAWT
ASACM
TM
TTTAT
CATC
OTATAM
ATTM&SSC
C0o
TMCTAAACTZS&WCTMTC
TACBTATAaA
AATO
M
OTA
TCCACACACZAACTMAA
MYTATTAA
CTAFSa
csC
AGAAGTTAGAmSaZOTA
TT
TCAMAUCAMT
ATOTAMIC
TAGTTAATTTZSZMTOTGTTA
TGACTTTA
ATT
mA
CCC
TGTCTAACAGIZZM&ATATCCG
ATGAGOCAYA
TTTATOTT
ACC
ACTGGAGGGGZTAAAACGTAA
GCGGGAAOCC
ATAT"ACC
AAG
TTAGGCTCTTUTSA&GTCTACC
TTCTTTTTO
CTTAllTOA
OGA
GATAGCCCTTAZMGCCATAA
ATTTTTTATC
GCTTCATAA
GTG
AAGTAGGCCCZZAZAGTCAT
ATTCTTTTTC
TTTCCCTGAT
GAG
TAGAGTAAGAGIIAMTACTAAT
TTATACATAG
AGTATAGAT&
GAG
TAGAGTAAAGZZM&TACTTAT
ATAGATAGAG
TATAGAT&GA
OGG
GACCGTAACCTTTATAAACCCAG
GCCGGGAGTT
CCGTACCOT
ACG
TCGTAACCGAIAIAMCCCCTCT
GAGCGAGTTG
TCACfCAA
AGC
ACTCGAAAAGCTAUTACCGAA
ACCCOCAOOG
OOMTAOTG
ATA
ACCCGAAAAGGZTTAAAAGCACC
GCCTTAAGOG
TCTTTCQC
GTC
CTAGGAACAT=IUS&GA=CT
CGGGGTTAAM
GTTAAAMO
TCC
GAGGGAAAAATTTATAAATGCCA
ATCTGCATOC
TATCATCAC
C
CC
GAGGGAAAAGCTTAATAAGGAAC
CCTTTAACCC
ACTACCOCCC
CATO
GCATAATATTCAZAIACCCCC
GTTTACTAMC
TACATTSCC=
CCA
ATGCTAAAGGZTIAIIACCCAGG
AAGTATTCCO
GTCATW)GC
GTT
GCATTCGACAAA,TTAACCT
CGTTCGACGA
GTAT5MAGT
OCT
TAAATAGCTTZIACGTAGGAG
GTGCTACCTC
CAqWATCG
TAA
GCGAAAAATT*ATTAGGGT
GTTTTAGGAT
GGTOCCCT
TAA
AGCGAAAAAAflIAMTCGGTGA
GTMGTACOC
TCBGGCCGG
TAG
ACAAAAGCTTTZAIMTTCGCGC
AAAGCTTAGA
CCTIGCGGGG
TAG
TTA
A
C
A
GGAAAGCGCTTTTCOGCGCTTGCTGTCTACGQCCACGTATG
GGTCTTTTTTGATGCTCGGTAGTGACGTGTGTATTCATGCA
CGATGGGTTCAAGAACCTCGCTGCCCGTCTATTTCAATATGCCC
CACGAATGATTTTGTTACTTGCCAACACGTTTTCAGATOPGTA
Fure
4:
The
list
(first
part
euyarchaeotal,
second
part
c
al
si)
shows
the
in
start
sites
of
mapped
archaal
gensa
d
teir
pro
er
(box
A-seqences
are
underlined
and
the
Ipton
start
sites
bold
and
underlined.
The
sequences
are
aligned
for
the
box
A
elmet,
except
the
last
five
which
are
aligned
for
the
st
sits
since
no
consensus
box
A
could
be
found.
Abbreviations
usd
for
r
s:
H.c.:
Halobwntrim
ibrum
H.h.:
Halcrimj
halobium,,
H.mm.:
iu
marismortui,
H.me.:
Halobacterium
medierrani,
M.t.:
Me
bterium
thermoautoar
icum,
M.v.:
Methanococcus
vannielii,
T.a.:
acidophilum,
D.a.:
Desulfurolobus
ambivalens,
D.m.:
mobilis,S.a.:
Sulfolobus
acidocaldarius,
S.s.:
Sul
lobs
shibat,
T.c.:
Thermococcus
celer,
T.p.:
Thermofilum
pendens,
T.t.:
Throproeu
tenax.
reference
1161
(161
(161
1171
(171
(17)
(181
1201
121)
1221
1231
1231
1241
(25)
(261
(271
(271
(271
(271
1291
(291
1301
1311
(291
1321
(321
(321
(331
(341
(35)
(15)
(15)
1151
(36)
(36)
1371
(391
(39)
(401
(401
1401
(411
(421
1431
(43)
(43)
143)
144)
(44)
(441
(45)
(45)
(45)
1451
1461
1471
(47)
1481
(481
121
121
1491
12)
121
121
121
(2)
121
150)
1511
1511
1521
1521
(461
1461
1531
153)
(531
1531
154)
1541
1541
171
1551
311
1561
121
Nucleic
Acids
Research,
Vol.
20,
No.
20
5427
and
in
vitro
and
its
identity
to
the
promoter
consensus
defined
by
sequence
comparison
[2,
3, 15].
Promoter
efficiency
The
box
A.
All
box
A
mutations
replacing
the
thymine
at
position
-30
and
the
adenine
at
position
-29
in
the
center
of
box
A
showed
strong
reductions
of
transcription
activity
indicating
the
importance
of
the
TA-sequence
at
these
positions.
The
only
exception,
a
tolerance
towards
the
exchange
of
thymine
-30
against
adenine,
can
be
interpreted
as
a
shift
of
the
TA-sequence
one
position
upstream.
These
data
are
in
accordance
with
a
high
conservation
of
the
TA-sequence
in
the
center
of
box
A
in
archaeal
promoters
(Fig.
4).
The
distance
between
the
TA-
sequence
and
the
start
site
is
29
bases
in
the
16S/23S
promoter
but
between
25
to
28
bases
on
average.
These
results,
and
the
sensitivity
of
the
promoter
to
certain
base
exchanges
at
single
positions
of
box
A,
defined
an
optimal
box
A
sequence
for
promoter
function.
For
the
determination
of
this
optimal
box
A
sequence,
only
mutants
with
a
transcription
efficiency
higher
than
66%
of
the
wild
type
were
considered.
Position
-26
of
the
16S/23S
promoter
was
not
considered
important
for
promoter
function
since
only
one
of
the
three
possible
base
exchanges,
the
introduction
of
cytosine
instead
of
the
wild
type
thymine,
led
to
a
strong
reduction
of
promoter
strength.
Applying
the
above
criteria,
an
optimal
box
A
sequence
5'
T/CTTAT/AA
3'
(positions
-32
to
-27)
was
derived.
This
functionally-determined
sequence
was
in
good
agreement
with
the
consensus
of
archaeal
box
A
sequences
5'
TTTAT/AA
3'
[2,
3,
15].
Inspection
of
84
mapped
archaeal
promoters
(Fig.
4)
showed
that
cytosine
did
indeed
sometimes
replace
thymine
in
the
first
position
of
the
consensus
box
A.
The
last
position,
-27,
in
the
consensus
occupied
by
adenine
proved
very
sensitive
against
base
exchange
in
accordance
with
its
conservation
in
all
but
the
halobacterial
promoters.
The
PPE.
Box
A
is
not
the
only
element
determining
the
efficiency
of
transcription.
Exchange
of
a
second
essential
promoter
element,
the
A
+T
rich
PPE
at
positions
-11
to
-2,
by
stretches
of
adenines
or
thymines
(Fig.
2)
led
to
a
20
fold
reduction
in
promoter
efficiency
and
replacement
of
this
element
by
its
complementary
sequence
(Fig.
2)
resulted
in
a
5
fold
reduction
of
promoter
strength.
Thus,
the
element
is
not
merely
an
A+T-rich
region
facilitating
strand
separation
during
the
initiation
of
transcription,
as
previously
discussed
[5].
A
certain
sequence,
the
alternating
purine/pyrimidine
sequence
5'
ATATGTATA
3'
in
the
case
of
the
strong
16S/23S
rRNA
promoter
of
S.
shibatae,
appears
to
be
required
for
maximal
promoter
strength.
The
strong
reduction
of
transcription
efficiency
upon
replacement
of
the
element
by
its
complementary
sequence
indicates
that
the
sequence
must
be
correctly
positioned
with
respect
to
the
start
site.
The
PPE
sequence
is
not
generally
conserved
between
archaeal
promoters
(Fig.
4)
and
may
therefore
represent
a
particular
feature
of
the
16S/23S
rRNA
promoter.
The
start
site.
Promoter
strength
appears
to
depend
on
the
distance
between
the
DPE
and
the
start
site
[5]
as
well
as
on
the
occupation
of
this
site.
Apart
from
the
elements
already
discussed,
the
core
promoter
region
thus includes
the
start
site
itself.
When
an
unfavourable
start
site
was
introduced,
e.
g.
a
pyrimidine
instead
of
the
wild
type
purine,
the
start
shifted
to
nearby
sites
at
less
favourable
distances
concomitant
with
a
reduction
of
total
transcription
efficiency.
In
these
cases,
it
is
difficult
to
estimate
the contributions
of
the
changes
of
distance
and
start
site
context
to
the
overall
effect.
But
it
appears
that
promoter
strength
declines
sharply
whenever
a
dinucleotide
different
from
a
pyrimidine/purine
(py/pu)
occupies
the
optimal
initiation
region
defined
by
its
distance
from
the
DPE.
Start
site
selection
Analysis
of
promoter
mutants
with
regard
to
start
site
selection
indicated
the
necessity
of
a
start
motif
as
well
as
a
distance
measurement
in
defining
a
start
region
in
which
this
motif
serves
its
role.
We
suggested
previously
[5]
that
a
purine
preceeded
by
a
pyrimidine
acted
as
minimal
start
signal
since
most
initiations
on
a
number
of
mutant
constructs
occured
at
a
purine
following
a
pyrimidine.
Results
from
a
methanogene
transcription
system
[7]
and
those
of
our
current
work
corroborate
this
assumption.
Furthermore
a
comparision
of
mapped
archaeal
transcription
start
sites
showed
that
79
of
89
transcripts
initiate
at
a
purine
after
a
pyrimidine
(Fig.
4).
Since
such
a
dinucleotide
sometimes
occurs
more
than
once
at
an
appropriate
distance
from
box
A
and
initiation
nevertheless
remained
specific
there
must
be
additional
information
to
provide
specific
initiation.
The
formerly
proposed
box
B
consensus
T/ATGC/A
found
around
the
start
site
[2]
does
not
appear
to
play
this
role
since
initiation
occurs
either
upstream,
or
within,
or
downstream,
and
thus
not
at
a
defined
site
in
this
sequence.
Moreover
many
of
the
mapped
promoters
in
figure
4
do
not
show
this
consensus.
Aligning
promoter
sequences
for
the
start
site
yielded
a
different,
rather
weak
consensus
A/TT/CG/A
with
the
initiation
at
the
last
position
and
in
crenarchaeotal
promoters
a
weakly
conserved
(py/pu)4
pattern
(see
above).
The
DPE
as
well
as
the
py/pu
dinucleotide
determines
the
start
site.
This
had
already
been
shown
with
the
insertion
and
deletion
mutants
[5]
and
was
reconfirmed
by
some
of
our
promoter
mutants.
Alteration
of
the
distance
between
the
DPE
and
the
wild
type
start
site
led
to
a
shift
of
the
start
site
and
concomitantly
sometimes
to
ambiguity
of
initiation.
This
ambiguity
was
also
observed
when
the
wild
type
start
guanine
was
substituted
with
thymine
or
cytosine.
The
mutant
containing
the
5S
rRNA
promoter
DPE
instead
of
the
16S/23S
rRNA
promoter
DPE
showed
both
an
upstream
shift
of
the
start
site
and
ambiguity
in
start
site
selection.
The
three
different
types
of
mutants
share
one
common
feature:
all
of
the
new
start
sites
were
positioned
within
a
region
of
eight
bases.
This
indicates
that
the
DPE
delimits
(by
some
sort
of
distance
measurement)
a
certain
window
of
about
eight
bases
in
which
initiation
occurs.
Furthermore,
the
mutant
with
the
5S
DPE
shows
that
a
stretch
of
more
than
four
thymines
upstream
of
box
A
altered
the
structure
of
the
promoter
in
such
a
way
that
the
transcription
system
used
a
start
point
nearer
to
box
A
than
with
the
16S/23S
DPE-sequence.
Several
other
archaeal
promoters
carrying
stretches
of
thymines
in
this
region
also
showed
a
reduction
in
the
distance
between
box
A
and
the
start
site.
A
stretch
of
more
than
three
thymines
causes
bending
of
the
DNA
which
therefore
could
be
the
reason
for
the
reduction
of
this
distance.
ACKNOWLEDGEMENTS
We
wish
to
thank
Felicitas
Pfeifer
and
Melissa
Holmes
for
critical
reading
of
the
manuscript.
5428
Nucleic
Acids
Research,
Vol.
20,
No.
20
REFERENCES
1.
Woese,
C.
R.,
Kandler,
0.
and
Wheelis,
M.
L. (1990)
Proc.
Natl.
Acad.
Sc.
USA
87,
4576-4579.
2.
Reiter,
W-D.,
Palm,
P.,
Zillig,
W.
(1988)
NucI.
Acid
Res.
16,
1-19.
3.
Thoun,
M.
&
Wich,
G.
(1988)
Nucl.
Acid
Res.
16,
151-163.
4.
Hidepohi,
U.,
Reiter,
W.-D.,
Zilhg,
W.(1990)
Proc.
Natl.
AcadL
Sci.
USA,
87,
5851-5855.
5.
Reiter,
W-D.,
Hudepohl,
U.,
Zillig,
W.
(1990)
Proc.
Natl.
Acad
Sci.
USA
87,
9509-9513.
6.
Hidepohl,
U.,
Gropp,
F.
Horne,
M.,
Zillig,
W.(1991)
FEDS
Lot,
285,
257-259
7.
Hausner,
W.,
Frey,
G.
and
Thomm,
M.
(1991)
J.
Mol.
Biol.
222,
495-508
8.
Sambrook,
J.,
Fritsch,
E.F.
and
Maniats,
T.:
Molecula
Cloning,
Cold
Spring
Harbor
Laboratory
Press,
2nd
ed.,
New
York
1989
9.
Chung,
C.,
Niemela,
S.
&
Miller,
R.
(1989)
Proc.
Natl.
Acad
Sci.
USA
86,
2172-2175.
10.
Bimnboim,
H.C.
&
Doly,
J.
(1979)
Nucl.
Acid
Res.
7,
1513-1423
11.
Sanger,
F.,
Nicklen,
S.
&Coulson,
A.
R.
(1977)
Proc.
Natl.
Acad.
Sci.
USA
74,
5463-5467.
12.
Murray,
M.
G.
(1986)
Anal.
Biochem.
158,
165-170.
13.
Aldea,
M.,
Claverie-Marfin,
F.,
Diaz-Torres,
M.
R.
&
Kushe,
S.
R.
(1988)
Gene
65,
101
-110.
14.
Reiter,
W.-D.,
Palm,
P.,
Voos,
W.,
Kaniecki,
J.,
CGrumpp,
B.,
Schulz,
W.
and
ZiSlig,
W.
(1987)
Nucl.
Acid
Res.
15,
5581-5595.
15.
Wich,
G.,
Hummel,
H.,
Jasch,
M.,
BEr,
U.
and
BOck,
A.
(1986)
NucI.
Acid
Res.
14,
2459
-
2479.
16.
Dennis,
P.P.
(1985)
J.
Mol.
Biol.
186,
457-461.
17.
Shinnin,
L.
&
Dennis,
P.P.
(1989)
EMBO
J.
8,
1225-1235.
18.
May,
B.
&
Dennis,
B.P.
(1990)
J.
Baceriol.
172,
3725-3729.
19.
Leffers,
H.,
Gropp,
F.,
Lottspeich,
F.,
Zillig,
W.
and
Garrett,
R.A.
(1989)
J.
Mol.
Riol.
206,
1-17.
20.
Blanck,
A.
&
Oesterhelt,
D.
(1987)
EMBO
J.
6,
265-273.
21.
DasSarma,
S.,
RajBhandary,
U.L.,
Klxaa,
H.G.
(1984)
Proc.
NatL
Acad
Sci.
USA
81,
125-
129.
22.
Lechner
&
J.,
Sumper,
M.
(1987)
J.
Biol.
Chem
262,
9724-9729.
23.
Gerl,
L.
&
Sumper,
M.
(1988)
J.
Biol.
Chem
263,
13246-13251.
24.
Salin,
M.L.,
Duke,
M.V.,
Ma,
D.-P.
and
Boyle,
J.A.
(1991)
Free
Rad.
Res.
Comms.
12-13,
443449.
25.
Homne,
M.,
Englert,
C.
and
Pfeifer,
F.
(1988).
Mol.
Gen.
Genet.
213,
459-464.
26.
Jones,
J.G.,
Hackett,
N.R.,
Halladay,
J.T.,
Scothomn,
D.J.,
Yang,
C.-F.,
Ng,
W.-L.
and
Dassarma,
S.
(1989)
Nucl.
Acid
Res.
17,
7785-7793.
27.
Gropp,
F.,
Palm,
P.
and
Zillig,
W.
(1989)
Can.
J.
Microbiol.
35,
182-188.
28.
Gropp,
F.,
Grampp,
B.,
Stolt,
P.
Palm,
P.
and
Zillig,
W.
Virology,
in
press
29.
Stolt,
P.
and
Zillig,
W.
submitted
30.
Schnabel,
H.,
Palm,
P.,
Dick,
K.
and
Zillig,
W.
(1984)
EMBO
Journal
3,
1717-1722.
31.
Stolt,
P.
unpublished
32.
Mevarech,
M.,
Hirsch-Twizer,
S.,
Goldman,
S.,
Yakobson,
E.,
Eiaenbeg
H.
and
Dennis,
P.P.
(1989)
J.
Bacteriol.
171,
3479-3485.
33.
Reeve,
J.N.,
Beckler,
G.S.,
Cram,
D.S.,
Hamilton,
P.T.,
Brown,
J.W.,
Krzyckd,
J.A.,
Kolodziej,
A.F.,
Alx,
L,
Onme-Johnson,
W.H.
and
Walsh,
C.T.
(1989)
Proc.
Natl.
Acad.
Sci.
USA
86,
3031-3035.
34.
Bokranz,
M.,
Bilumer,
G.,
Allmberger,
R.,
Ankel-Fuchs,
D.
and
Klein,
A.
(1988)
J.
Bacteriol.
170,
568-577.
35.
Brown,
J.W.
&
Reeve,
J.N.
(1989)
FEMS
Microbiol.
Lett.
60,
131-136.
36.
Auer,
J.,
Spicker,
G.
and
BOck,
A.
(1989)
J.
Mol.
Biol.
209,
21-36.
37.
Brown,
J.W.,
Thonm,
M.,
Beckler,
G.S.,
Frey,
G.,
Steeter,
K.O.
and
Reeve,
J.N.
(1988)
Nucl.
Acid
Res.
16,
135-150.
38.
Thomm,
M.,
Sherf,
B.A.
and
Reeve,
J.N.
(1988)
J.
Bacteriol.
170,
1958-1961.
39.
Lechner,
K.,
Heller,
G.
and
BOck,
A.
(1989)
J.
Mol.
Evl.
29,
20-27.
40.
Ree,
H.
&
Zimmermann,
R.A.
(1990)
Nucl.,Acids
Res.
18,
4471-4478.
41.
Klenk,
H.-P,
Renner,
O.,
Schwass,
V.
and
Zillig,
W.
Nucl.
Acid
Res.
20,
in
press
42.
Kletzin,
A.
submitted
43.
Kletzin,
A.
J.
Bacteriol.
174,
in
press,
X56616
44.
Kjems,
J.
&
Garrett,
R.A.
(1987)
EMBO
J.
6,
3521-3530.
45.
Plhler,
G.,
Leffers,
H.,
Gropp,
F.,
Palm,
P.,
Klenk,
H.-P.,
Lottspeich,
F.,
Garrett,
R.A.
and
Zillig,
W.
(1989)
Proc.
Natl.
Acad.
Sci.
USA
86,
4569-4573.
46.
Klenk,
H.-P.,
unpublished
47.
Auer,
J.,
Spicker,
G.,
Mayerhofer,
L.,
POihler,
G.
and
BOck,
A.
(1991)
System.
Appl.
Microbiol.
14,
14-22.
48.
Schi6der,
J.
&
Klink,
F.
(1991)
Eur.
J.
Bio:
16,
321-327
49.
Reiter,
W-D.
&
Palm,
P.
(1990)
Mol
Gen
Gmw
221:
65-71
50.
Culhm,
D.E.
&
Nazar,
R.N.,
(1989)
Mot.
Ge.
Genet.
216,
412-416
51.
Klenk,
H.-P.,
Palm,
P.,
Lotspeich,
F.
and
Zilhig,
W.
(1992)
Proc.
NatL
Acad
Sci.
USA
89,
407-410.
52.
Klenk,
H.-P.,
Schwass,
V.
and
Zimig,
W.
(1991
4-klAu
",
6047.
53.
Kjenm,
J.,
Leffers,
H.,
Olesen,
T.,
Holz,
I.
and
GCutdt,
R.A.
(199
S)wWnL
AppL
Mcrbiol.
13,
117-127.
54.
Wich,
G.,
Leinfelder,
W.,
and
BOck,
A.
(1987
EMURO
J.
6,
523-528&
55.
Bedach,
M.,
Friedman,
J.,
Boyer,
H.W.
and
PAeI*r,
F.
(1984)
Ncl.
Acid
Res.
12,
7949-7959.
56.
Englert,
C.,
Horne,
M.
and
Pfeifer,
F.
(1990)
MoL
Gen.
Genet.
22,
225-232.
... The regulation of gene expression is an active area of research in archaea because of this evolutionary overlap and the attraction of a less complex experimental system. Examples of this overlap include homologous promoter structure (21,41), orthologs of TATA binding protein (29,37,48), TFIIB (referred to as TFB in archaea [18,38,39]), TFIIE␣ (TFE␣ [6,22]), TFIIS (TFS [25]), and the 12-subunit RNA polymerase II (28). They appear, however, to lack homologs of TFIIA, TFIIF, and TFIIH. ...
... However, merAp structure appears insufficient to explain the merA expression pattern observed in the merR disruption mutant strain, because merAp is unlikely to constitute a strong promoter. Though the merAp promoter has a putative TATA box and a consensus BRE, the presence of a G at the 3Ј end of another archaeal TATA hexamer reduced promoter strength by 75% (21). In addition, merAp exhibits nonstandard spacing between the TATA box and the start point of transcription. ...
... In addition, merAp exhibits nonstandard spacing between the TATA box and the start point of transcription. The consensus for this distance in archaeal promoters is 26 nt measured from the midpoint of the octameric TATA box to the start point of transcription (21,40,41). In merAp, this distance is 33 nt and would thus rotate the TATA box around the DNA helix relative to the start point of transcription. ...
Article
Full-text available
Mercury resistance mediated by mercuric reductase (MerA) is widespread among bacteria and operates under the control of MerR. MerR represents a unique class of transcription factors that exert both positive and negative regulation on gene expression. Archaea and bacteria are prokaryotes, yet little is known about the biological role of mercury in archaea or whether a resistance mechanism occurs in these organisms. The archaeon Sulfolobus solfataricus was sensitive to mercuric chloride, and low-level adaptive resistance could be induced by metal preconditioning. Protein phylogenetic analysis of open reading frames SSO2689 and SSO2688 clarified their identity as orthologs of MerA and MerR. Northern analysis established that merA transcription responded to mercury challenge, since mRNA levels were transiently induced and, when normalized to 7S RNA, approximated values for other highly expressed transcripts. Primer extension analysis of merA mRNA predicted a noncanonical TATA box with nonstandard transcription start site spacing. The functional roles of merA and merR were clarified further by gene disruption. The merA mutant exhibited mercury sensitivity relative to wild type and was defective in elemental mercury volatilization, while the merR mutant was mercury resistant. Northern analysis of the merR mutant revealed merA transcription was constitutive and that transcript abundance was at maximum levels. These findings constitute the first report of an archaeal heavy metal resistance system; however, unlike bacteria the level of resistance is much lower. The archaeal system employs a divergent MerR protein that acts only as a negative transcriptional regulator of merA expression.
... hispanica growth can be restored by supplementing a wider variety of 79 carbon sources, in line with its saccharolytic capabilities. We identified 15 robust TrmB 80 binding sites across the genome corresponding to the differential expression of 9 genes 81 predominately involved in gluconeogenesis. A point of bidirectional regulation by TrmB 82 of the EMP pathway in Hbt. ...
... For these targets, we also determined 430 whether the relative TrmB motif distance from the putative TATA box or start codon 431 was predictive of the direction of regulation (S4 Table). The archaeal promoter 432 architecture resembles a simplified version of that found in eukaryotes, including a 433 TATA box located around -26 bp and a BRE element around -33 bp [79,80]. Generally, 434 motifs were upstream of predicted transcription initiation features, consistent with our 435 observation of TrmB acting as an activator of these genes. ...
Preprint
Full-text available
Timely regulation of carbon metabolic pathways is essential for cellular processes and to prevent futile cycling of intracellular metabolites. In Halobacterium salinarum , a hypersaline adapted archaeon, a sugar-sensing TrmB family protein controls gluconeogenesis and other biosynthetic pathways. Notably, Hbt. salinarum does not utilize carbohydrates for energy, uncommon among Haloarchaea. We characterized a TrmB-family transcriptional regulator in a saccharolytic generalist, Haloarcula hispanica , to investigate whether the targets and function of TrmB, or its regulon, is conserved in related species with distinct metabolic capabilities. In Har. hispanica , TrmB binds to 15 sites across the genome and induces the expression of genes primarily involved in gluconeogenesis and tryptophan biosynthesis. An important regulatory control point in Hbt. salinarum , activation of ppsA and repression of pykA , is absent in Har. hispanica . Contrary to its role in Hbt. salinarum and saccharolytic hyperthermophiles, TrmB does not act as a global regulator: it does not directly repress the expression of glycolytic enzymes, peripheral pathways such as cofactor biosynthesis, or catabolism of other carbon sources in Har. hispanica . Cumulatively, these findings suggest re-wiring of the TrmB regulon alongside metabolic network evolution in Haloarchaea.
... For these targets, we also determined whether the relative TrmB motif distance from the putative TATA box or start codon was predictive of the direction of regulation (S4 Table). The archaeal promoter architecture resembles a simplified version of that found in eukaryotes, including a TATA box located around -26 bp and a BRE element around -33 bp [81,82]. Generally, motifs were upstream of predicted transcription initiation features, consistent with our observation that TrmB acts primarily as an activator. ...
Article
Full-text available
Timely regulation of carbon metabolic pathways is essential for cellular processes and to prevent futile cycling of intracellular metabolites. In Halobacterium salinarum , a hypersaline adapted archaeon, a sugar-sensing TrmB family protein controls gluconeogenesis and other biosynthetic pathways. Notably, Hbt. salinarum does not utilize carbohydrates for energy, uncommon among Haloarchaea. We characterized a TrmB-family transcriptional regulator in a saccharolytic generalist, Haloarcula hispanica , to investigate whether the targets and function of TrmB, or its regulon, is conserved in related species with distinct metabolic capabilities. In Har. hispanica , TrmB binds to 15 sites in the genome and induces the expression of genes primarily involved in gluconeogenesis and tryptophan biosynthesis. An important regulatory control point in Hbt. salinarum , activation of ppsA and repression of pykA , is absent in Har. hispanica . Contrary to its role in Hbt. salinarum and saccharolytic hyperthermophiles, TrmB does not act as a global regulator: it does not directly repress the expression of glycolytic enzymes, peripheral pathways such as cofactor biosynthesis, or catabolism of other carbon sources in Har. hispanica . Cumulatively, these findings suggest rewiring of the TrmB regulon alongside metabolic network evolution in Haloarchaea.
... Les virus étaient utilisés comme des systèmes modèles simplifiés du fonctionnement moléculaire du vivant. L'étude des phages a permis de démontrer la nature aléatoire des mutations (Luria and Delbrück, 1943), de déterminer la structure des gènes (Benzer, 1955), de démontrer de l'usage de triplets dans le code génétique (Crick et al., 1961) ou d'identifier des systèmes de régulation de l'expression des gènes (Hain et al., 1992;Reiter et al., 1987). L'étude des virus a aussi permis le développement d'outils pour la biologie moléculaire comme les enzymes de restriction, les ligases ou le système d'expression T7 (Salmond and Fineran, 2015). ...
Thesis
Les réarrangements permettent une évolution rapide du génome par l’acquisition de séquences codantes exogènes, la perte de fonctions non-essentielles ou la création de nouvelles organisations génomiques. Différents mécanismes de réarrangements impliquant des éléments génétiques mobiles (EGM) ont été identifiés chez les archées, les bactéries et les eucaryotes. En revanche, on ignore l’origine des nombreuses inversions génomiques détectées pour les espèces du genre archéen Thermococcus. Mes travaux de thèse visent à améliorer la compréhension de l’évolution génomique chez les Thermococcales à travers l’étude de deux familles d’EGM : les familles de plasmides pTN3 et pT26-2. Plus précisément, je me suis intéressée aux recombinases à tyrosine (ou intégrases) que ces plasmides encodent et qui permettent leur intégration dans le chromosome de l’hôte. J’ai montré que l’intégrase plasmidique Intᵖᵀᴺ³ est responsable d’inversions dans le chromosome de son hôte Thermococcus nautili grâce à une activité catalytique inédite de recombinaison homologue. J’ai par la suite caractérisé deux autres intégrases de Thermococcales reliés phylogénétiquement à Intᵖᵀᴺ³ dont seulement une présente une activité de recombinaison homologue. La comparaison de leurs séquences primaires et la résolution de la structure de Intᵖᵀᴺ³ vont maintenant éclairer les déterminants génétiques responsables de la spécificité de site et de l’activité de recombinaison homologue. Les trois intégrases appartiennent à une classe de recombinases spécifique des archées qui catalyse une intégration suicidaire. Lors de l’intégration, le gène de l’intégrase est fragmenté et probablement désactivé. L’EGM intégré se retrouve piégé dans le chromosome. Les avantages évolutifs d’une telle activité suicidaire restent pour l’instant mystérieux. J’ai identifié 62 intégrases hyperthermophiles suicidaires et reconstruit leur histoire évolutive. Ces intégrases sont très prévalentes et recrutées par différents EGM. De plus, j’ai montré que l’une de ces intégrases présente in vitro une activité de recombinaison site-spécifique à des températures proches de l’ébullition de l’eau, représentant un avantage dans les environnements hyperthermophiles.
... 20−23 Transcription initiation occurs upon the sequence-specific binding of TBP to a TATA box upstream of the transcription start sites (TSS) and TFB to a factor B recognition element (BRE) upstream of the TATA box. 24,25 RNAP is then recruited to the TSS, and the preinitiation complex is formed. By contrast, transcription regulation is bacterial-like, with over half of the identified archaeal transcription factors having at least one bacterial homologue. ...
Article
Methanococcus maripaludis is a rapidly growing, hydrogenotrophic, and genetically tractable methanogen with unique capabilities to convert formate and CO2 to CH4. The existence of genome-scale metabolic models and an established, robust system for both large-scale and continuous cultivation make it amenable for industrial applications. However, the lack of molecular tools for differential gene expression has hindered its application as a microbial cell factory to produce biocatalysts and biochemicals. In this study, a library of differentially regulated promoters was designed and characterized based on the pst promoter, which responds to the inorganic phosphate concentration in the growth medium. Gene expression increases by 4- to 6-fold when the medium phosphate drops to growth-limiting concentrations. Hence, this regulated system decouples growth from heterologous gene expression without the need for adding an inducer. The minimal pst promoter is identified and contains a conserved AT-rich region, a factor B recognition element, and a TATA box for phosphate-dependent regulation. Rational changes to the factor B recognition element and start codon had no significant impact on expression; however, changes to the transcription start site and the 5' untranslated region resulted in the differential protein production with regulation remaining intact. Compared to a previous expression system based upon the histone promoter, this regulated expression system resulted in significant improvements in the expression of a key methanogenic enzyme complex, methyl-coenzyme M reductase, and the potentially toxic arginine methyltransferase MmpX.
... Each GTF binds to a specific DNA element in the promoter region ( Figure 2) (166). TBP first recognizes and binds to the TATA box (14,62). TFB then binds to the B recognition element (BRE), stabilizing the TBP-TATA box interaction (128,166). ...
Article
Archaea are major contributors to biogeochemical cycles, possess unique metabolic capabilities, and resist extreme stress. To regulate the expression of genes encoding these unique programs, archaeal cells use gene regulatory networks (GRNs) composed of transcription factor proteins and their target genes. Recent developments in genetics, genomics, and computational methods used with archaeal model organisms have enabled the mapping and prediction of global GRN structures. Experimental tests of these predictions have revealed the dynamical function of GRNs in response to environmental variation. Here, we review recent progress made in this area, from investigating the mechanisms of transcriptional regulation of individual genes to small-scale subnetworks and genome-wide global networks. At each level, archaeal GRNs consist of a hybrid of bacterial, eukaryotic, and uniquely archaeal mechanisms. We discuss this theme from the perspective of the role of individual transcription factors in genome-wide regulation, how these proteins interact to compile GRN topological structures, and how these topologies lead to emergent, high-level GRN functions. We conclude by discussing how systems biology approaches are a fruitful avenue for addressing remaining challenges, such as discovering gene function and the evolution of GRNs.
Article
Molecular phylogenetic surveys have recently revealed an ecologically widespread crenarchaeal group that inhabits cold and temperate terrestrial and marine environments. To date these organisms have resisted isolation in pure culture, and so their phenotypic and genotypic characteristics remain largely unknown. To characterize these archaea, and to extend methodological approaches for characterizing uncultivated microorganisms, we initiated genomic analyses of the nonthermophilic crenarchaeote Cenarchaeum symbiosum found living in association with a marine sponge, Axinella mexicana . Complex DNA libraries derived from the host-symbiont population yielded several large clones containing the ribosomal operon from C. symbiosum . Unexpectedly, cloning and sequence analysis revealed the presence of two closely related variants that were consistently found in the majority of host individuals analyzed. Homologous regions from the two variants were sequenced and compared in detail. The variants exhibit >99.2% sequence identity in both small- and large-subunit rRNA genes and they contain homologous protein-encoding genes in identical order and orientation over a 28-kbp overlapping region. Our study not only indicates the potential for characterizing uncultivated prokaryotes by genome sequencing but also identifies the primary complication inherent in the approach: the widespread genomic microheterogeneity in naturally occurring prokaryotic populations.
Article
Methanogenic Archaea can be integrated into a sustainable, carbon-neutral cycle for producing organic chemicals from C1 compounds if the rate, yield and titer of product synthesis can be improved using metabolic engineering. However, metabolic engineering techniques are limited in methanogens by insufficient methods for controlling cellular protein levels. We conducted a systematic approach to tune protein levels in Methanosarcina acetivorans C2A, a model methanogen, by regulating transcription and translation initiation. Rationally designed core promoter and ribosome bind site mutations in M. acetivorans C2A resulted in a predicable change in protein levels over a 60 fold range. The overall range of protein levels was increased an additional 3 fold by introducing the 5' untranslated region of the mcrB transcript. This work demonstrates a wide range of precisely controlled protein levels in M. acetivorans C2A which will help facilitate systematic metabolic engineering efforts in methanogens.
Article
Full-text available
The archaebacterium Halobacterium cutirubrum contains a single detectable, Mn-containing superoxide dismutase, which is encoded by the sod gene (B. P. May and P. P. Dennis, J. Biol. Chem. 264:12253-12258, 1989). The genome of H. cutirubrum also contains a closely related sod-like gene (slg) of unknown function that has a pattern of expression different from that of sod. The four amino acid residues that bind the Mn atom are conserved, but the flanking regions of the two genes are unrelated. Although the genes have 87% nucleotide sequence identity, the proteins they encode have only 83% amino acid sequence identity. Mutations occur randomly at the first, second, and third codon positions, and transversions outnumber transitions. Most of the mutational differences between the two genes are confined to two limited regions; other regions totally lack differences. These two gene sequences are apparently in the initial stage of divergent evolution. Presumably, this divergence is being driven by strong selection at the molecular level for either acquisition of new functions or partition and refinement of ancestral functions in one or both of the respective gene products.
Article
Full-text available
To elucidate the organization of the transcription units encoding the 16S, 23S and 5S rRNAs in the archaebacterium Thermoplasma acidophilum, the nucleotide sequences flanking the three rRNA genes were determined, and the 5' and 3' termini of the rRNA transcripts were mapped by primer extension and nuclease S1 protection. The results show that each of the rRNAs is transcribed separately, consistent with the lack of physical proximity among them in the T. acidophilum genome. The transcription initiation sites are preceded at an interval of approximately 25 base pairs by conserved A + T-rich sequences of the form CTTATATA, which strongly resemble the archaebacterial promoter consensus, TTTAT/AATA. In all three cases, transcription termination occurs within T-rich tracts just downstream from inverted repeats which can be folded into relatively stable stem-loop structures. While no partially processed intermediates of the 16S or 5S rRNA transcripts were detected, the 23S rRNA transcript appears to be processed by a RNase Ill-like activity prior to final maturation. This is the only organism known in the prokaryotic world in which the 16S, 23S and 5S rRNAs are all expressed
Article
DNA-dependent RNA-polymerase (RNAP) purified from the thermophilic archaebacterium Methanobacterium thermoautotrophicum strain ΔH has been shown to bind specifically to DNA in the intergenic region upstream of the purE gene cloned from this species. The RNAP binding site has been limited to a 41 bp region of DNA which contains the Box A archaebacterial promoter sequence, 5′ATTAAATA. Transcription of the purE gene appears to initiate in vivo at three locations 20–22 bp downstream of the Box A sequence and 27–29 bp upstream of the ATG translation initiation codon of the purE gene.
Article
An improved method for mapping RNA transcript boundaries by the nuclease protection technique is presented. This method exploits the large (>20°C) difference in the thermal stability of RNA:DNA and DNA:DNA duplexes in concentrated chaotropic salt solutions. At 45°C in 3.0 m sodium trichloroacetate RNA:DNA hybridization is very efficient but DNA:DNA duplexes remain completely denatured. For many applications, this solvent system can eliminate the need to prepare probes that are free of competing or irrelevant DNA molecules. Fifty- to 100-fold more RNA:DNA hybridization is observed when reassociation is performed in 3.0 m sodium trichloroacetate than in solutions containing high concentrations of formamide. A comparison of the use of S1 nuclease or mung bean nuclease suggests that mung bean nuclease can produce more precise and less ambiguous nuclease protection patterns.
Article
A new method for determining nucleotide sequences in DNA is described. It is similar to the "plus and minus" method [Sanger, F. & Coulson, A. R. (1975) J. Mol. Biol. 94, 441-448] but makes use of the 2',3'-dideoxy and arabinonucleoside analogues of the normal deoxynucleoside triphosphates, which act as specific chain-terminating inhibitors of DNA polymerase. The technique has been applied to the DNA of bacteriophage varphiX174 and is more rapid and more accurate than either the plus or the minus method.
Article
A procedure for extracting plasmid DNA from bacterial cells 1s described. The method 1s simple enough to permit the analysis by gel electrophoresis of 100 or more clones per day yet yields plasmid DNA which is pure enough to be digestible by restriction enzymes. The principle of the method is selective alkaline denaturation of high molecular weight chromosomal DNA while covalently closed circular DNA remains double-stranded. Adequate pH control is accomplished without using a pH meter. Upon neutralization, chromosomal DNA renatures to form an insoluble clot, leaving plasmid DNA in the supernatant. Large and small plasmid DNAs have been extracted by this method.
Article
Within the chromosome of the archaebacterium Sulfolobus sp. B12, a 7.4 kb region was identified which displayed extensive sequence similarities to the 15.5 kb genetic element SSV1 carried by the same strain both as a circular form and as a site-specifically integrated copy. DNA sequence analysis indicated that this 7.4 kb region (designated SSV1intB) represented an SSV1-like element distinguishable from the full-length integrated copy (designated SSV1intA) by extensive deletions and point mutations. The physical organization of DNA sequences of SSV1intB indicated that this element was integrated at the same attP site as previously identified for SSV1intA. A comparison of the DNA sequences at the left attachment sites of SSV1intA and SSV1intB revealed that they both represented very similar putative arginine tRNA genes followed by a 10 bp inverted repeat sequence. S1 nuclease mapping experiments indicated that these tRNA genes are transcribed.
Article
In the moderately to extremely halophilic archaebacterium Haloferax mediterranei gas vacuoles are not observed before the stationary phase of growth, and only when the cells are grown in media containing more than 17% total salt. Under the electron microscope, isolated gas vesicles appear as cylindrical structures with conical ends that reach a maximal length of 1.5 microns; this morphology is different from the spindle-shaped gas vesicles found in the Halobacterium halobium wild type which expresses the plasmid-borne p-vac gene, but resembles that of gas vesicles isolated from H. halobium strains expressing the chromosomal c-vac gene. Both the p-vac and the c-vac genes encode very similar structural proteins accounting for the major part of the "membrane" of the respective gas vesicles. The homologous mc-vac gene was isolated from Hf. mediterranei using the p-vac gene as probe. The mc-vac coding region indicates numerous nucleotide differences compared to the p-vac anc c-vac genes; the encoded protein is, however, almost identical to the c-vac gene product. The start point of the 310 nucleotide mc-vac transcript determined by primer extension analysis and S1 mapping was located 20 bp upstream of the ATG start codon, which is at the same relative position as found for the other two vac mRNAs. During the growth cycle, mc-vac mRNA was detectable in Hf. mediterranei cells grown in 15% as well as 25% total salt, with a maximal level in the early stationary phase of growth. The relative abundance of mc-vac mRNA in cells grown at 25% salt was sevenfold higher than in cells grown in 15% total salt.
Article
The gene encoding component H of the DNA-dependent RNA polymerase (RNAP, EC 2.7.7.6) of Sulfolobus acidocaldarius has been identified by comparison of the amino acid sequence with the derived amino acid sequence of an open reading frame (ORF88) in the RNAP operon. Corresponding genes were identified in Halobacterium halobium and were cloned and sequenced from Thermococcus celer and Methanococcus vannielii. All these rpoH genes are situated between the promoters of the RNAP operons and the corresponding rpoB and rpoB2 genes. The archaeal H subunits show high sequence similarity to each other and to the C-terminal portions of the largest of four subunits shared by all three specialized nuclear RNAPs. These correlations are further evidence for the striking similarity between archaeal and eucaryal RNAP structures and transcription systems.