ArticlePDF Available

Adaptability at the protein-DNA interface is an important aspect of sequence recognition by bZIP proteins

June 1993
Proceedings of the National Academy of Sciences 90(10):4513-7

June 1993
90(10):4513-7

DOI:10.1073/pnas.90.10.4513

Source
PubMed

Authors:

Dimitris Tzamarias

University of Crete

Show all 5 authorsHide

The related AP-1 and ATF/CREB families of transcriptional regulatory proteins bind as dimers to overlapping or adjacent DNA half-sites by using a bZIP structural motif. Using genetic selections, we isolated derivatives of yeast GCN4 that affect DNA-binding specificity at particular positions of the AP-1 target sequence. In general, altered DNA-binding specificity results from the substitution of larger hydrophobic amino acids for GCN4 residues that contact base pairs. However, in several cases, DNA binding by the mutant proteins cannot be simply explained in terms of the GCN4-AP-1 structure; movement of the protein and/or DNA structural changes are required to accommodate the amino acid substitutions. The quintet of GCN4 residues that make base-pair contacts do not entirely determine DNA-binding specificity because these residues are highly conserved in the bZIP family, yet many of the bZIP proteins bind to distinct DNA sites. The alpha-helical fork between the GCN4 DNA-binding and dimerization surfaces is important for half-site spacing preferences, because mutations in the fork alter the relative affinity for AP-1 and ATF/CREB sites. The basic region in the protein-DNA complex is a long isolated alpha-helix, with no constraints from other parts of a folded domain. From all of these considerations, we suggest that small shifts in position and orientation or local deformations in the alpha-helical backbone distinguish one bZIP complex from another.

DNA-binding specificity at the +2 position of single-and double-mutant proteins. Protein-DNA complexes formed by incubating equivalent amounts of in vitro synthesized 35S-labeled proteins (determined by SDS/PAGE) with the target sequences containing mutated residues (underlined) at position ±2. optimal site. Consistent with this observation, these proteins activate transcription from a promoter containing ATGIC&CAT upstream of the his3 TATA element. Thus, substitution ofAla-239 with valine affects DNA-binding specificity at both the + 1 and ±2 positions. We also carried out detailed DNA-binding specificity experiments on the Trp-235 protein that had previously been shown to affect recognition at the ±4 position (15). The Trp-235 protein binds extremely strongly to AAGACTCTT but not to AGGACTCCT, AAGACTCIT, or to any sequence variants at the ± 1 or +2 positions. Indeed, the affinity for AAGACTCTT is higher than for the optimal site, indicating that the Trp-235 substitution alters sequence recognition at both positions ±3 and +4, with the more pronounced effect being at +3. Mutations of GCN4 That Affect Half-Site Spacing Specificity. We previously suggested that AP-1 and ATF/CREB

…

Figures - uploaded by Dimitris Tzamarias

Content may be subject to copyright.

Content uploaded by Dimitris Tzamarias

Content may be subject to copyright.

Proc.

Natl.

Acad.

Sci.

USA

Vol.

90,

pp.

4513-4517,

May

1993

Biochemistry

Adaptability

the

protein-DNA

interface

important

aspect

sequence

recognition

bZIP

proteins

(DNA-binding

protein/yeast

GCN4/transcription

factor/gene

regulation/leucine

zipper)

JOON

KIM*,

DIMITRIS

TZAMARIAS*,

THOMAS

ELLENBERGERt,

STEPHEN

HARRISONtt,

AND

KEVIN

STRUHL*§

*Department

Biological

Chemistry

and

Molecular

Pharmacology,

Harvard

Medical

School,

Boston,

02115;

tDepartment

Biochemistry

and

Molecular

Biology

and

tHoward

Hughes

Medical

Institute,

Harvard

University,

Cambridge,

02138

Communicated

Bert

Vogelstein,

February

19,

1993

ABSTRACT

The

AP-1

and

ATF/CREB

families

transcriptional

regulatory

proteins

bind

dimers

overlap-

ping

adjacent

DNA

half-sites

using

bZIP

structural

motif.

Using

genetic

selections,

isolate

derivatives

yeast

GCN4

that

affect

DNA-binding

specificity

particular

posi-

tions

the

AP-1

target

sequence.

general,

altered

DNA-

binding

specificity

results

from

the

substitution

larger

hy-

drophobic

amino

acids

for

GCN4

residues

that

contact

base

pairs.

However,

several

cases,

DNA

binding

the

mutant

proteins

cannot

simply

explained

terms

the

GCN4-

AP-1

structure;

movement

the

protein

and/or

DNA

struc-

tural

changes

are

required

accommodate

the

amino

acid

substitutions.

The

quintet

GCN4

residues

that

make

base-

pair

contacts

not

entirely

determine

DNA-binding

specificity

because

these

residues

are

highly

conserved

the

bZIP

family,

yet

many

the

bZIP

proteins

bind

distinct

DNA

sites.

The

a-helical

fork

between

the

GCN4

DNA-binding

and

dimeriza-

tion

surfaces

important

for

half-site

spacing

preferences,

because

mutations

the

fork

alter

the

relative

affinity

for

AP-1

and

ATF/CREB

sites.

The

basic

region

the

protein-DNA

complex

long

isolated

a-helix,

with

constraints

from

other

parts

folded

domain.

From

all

these

consider-

ations,

suggest

that

small

shifts

position

and

orientation

local

deformations

the

a-helical

backbone

distinguish

one

bZIP

complex

from

another.

The

DNA-binding

domains

most

eukaryotic

transcrip-

tional

regulatory

proteins

can

classified

into

relatively

small

number

distinct

structural

classes.

The

bZIP

motif

(50-60

amino

acid

residues)

consists

two

distinct

seg-

ments,

the

leucine

zipper

and

the

basic

region

(1).

The

C-terminal

residues

form

two-stranded

parallel

coiled

coil

(the

leucine

zipper),

which

mediates

dimerization

(2).

This

leucine

zipper

symmetrically

positions

divergent

pair

basic-region

a-helices,

which

pass

through

the

major

groove

each

DNA

half-site

(3-6).

Upon

specific

DNA-

complex

formation,

the

bZIP

segment

undergoes

folding

transition.

The

previously

unfolded

basic

region

becomes

a-helical

(7-9),

and

quintet

conserved

basic-region

residues

are

positioned

make

contacts

with

the

DNA

(6).

Yeast

GCN4

belongs

the

AP-1

family

transcription

factors

that

includes

the

Jun

and

Fos

oncoproteins.

The

optimal

AP-1-GCN4

recognition

sequence,

ATGA(C/

G)TCAT,

consists

overlapping

half-sites,

which

are

non-

equivalent

because

the

asymmetry

imposed

the

central

C-G

base

pair

(defined

position

(6,

10-12).

GCN4

also

binds

with

only

slightly

reduced

affinity

the

ATF/CREB

sequence

(ATGACGTCAT),

which

the

half-sites

abut

rather

than

overlap

(13).

contrast,

the

structurally

and

immunologically

ATF/CREB

transcription

factors

bind

much

efficiently

ATF/CREB

sites

than

AP-1

sites

(14).

have

therefore

proposed

that

AP-1

and

ATF/

CREB

proteins

make

similar

DNA

sequence-specific

con-

tacts

but

differ

their

half-site

spacing

requirements

(13).

previously

isolated

specificity

mutant

GCN4

genetic

selection

for

derivatives

activating

transcription

from

promoters

containing

mutant

binding

sites

(15).

The

mutant

protein

contains

tryptophan

place

the

invariant

basic-

region

asparagine

(Asn-235),

and

affects

specificity

the

±4

position

the

AP-1

site.

Here,

address

the

basis

DNA-binding

specificity

the

critical

positions

the

AP-1

site

isolating

additional

GCN4

specificity

mutants.

Furthermore,

address

the

specificity

half-site

spacing

generating

GCN4

derivatives

with

altered

preferences

for

the

AP-1

and

ATF/CREB

sites.

The

resulting

changes

DNA-binding

specificity

are

interpreted

terms

the

x-ray

structure

the

GCN4-DNA

complex

(6),

and

the

implica-

tions

these

results

for

the

DNA-binding

specificities

other

bZIP

proteins

are

discussed.

MATERIALS

AND

METHODS

The

methods

for

degenerate

oligonucleotide

mutagenesis,

genetic

selections

for

GCN4

specificity

mutants,

phenotypic

analysis,

and

DNA-binding

specificity

determinations

have

been

described

(15).

RESULTS

Isolation

GCN4

Mutants

That

Functionally

Interact

with

Altered

DNA

Sites.

GCN4

proteins

that

activate transcription

from

altered

AP-1

target

sites

were

isolated

the

genetic

selection

described

(15).

library

104

GCN4

derivatives

averaging

2-bp

substitutions

the

basic

region

(residues

236-246)

was

introduced

into

yeast

strains

containing

sym-

metrically

mutated

AP-1

sites

(AGGACTC_CT;

ATIAC-

TAAT)

upstream

the

his3

TATA

element

and

structural

gene

(Fig.

1).

Yeast

transformants

with

increased

levels

his3

expression

were

selected

their

growth

the

presence

aminotriazole.

GCN4

plasmids

were

recovered

from

these

strains,

sequenced,

and

analyzed

for

their

ability

stimulate

transcription

from

variety

mutant

AP-1

sites

(Fig.

2).

Upon

selection

for

proteins

that

could

activate

transcrip-

tion

from

ACGACTCGT,

isolated

derivative

which

Ala-238

changed

tyrosine.

This

Tyr-238

protein

also

stimulates

transcription

from

his3

promoter

containing

A-iGACTCCT,

but

unable

function

AAGAC-

TCIT.

Although

the

Tyr-238

protein

only

weakly

activates

transcription

from

the

mutant

target

sites,

clearly

effective

than

wild-type

GCN4.

The

Tyr-238

protein

retains

the

ability

activate

transcription

effi'ciently

from

the

opti-

§To

whom

reprint

requests

should

addressed.

4513

The

publication

costs

this

article

were

defrayed

part

page

charge

payment.

This

article

must

therefore

hereby

marked

"advertisement"

accordance

with

U.S.C.

§1734

solely

indicate

this

fact.

Proc.

Natl.

Acad.

Sci.

USA

(1993)

225

23S

238

239

242

246 247

SDPAALKRARiTEMR

RaRAIRKLQRMKQ

leucinezipper

~~~~~I

BJanm

AIwNI

Psd

Xhd

ATGACTCAT

GCN4

-M

ggaattcc

-447

-103

-85

-55

-35

TACTGAGTA

optimal

AP-1

site

-4

-3

-2

-1

ACGACT

CGT

AAGACT

CIT

ACT

AAT

CACAT

symmetrically

mutated

sites

FIG.

Isolation

GCN4

specificity

mutants.

(Upper)

Amino

acid

sequence

the

GCN4

basic

region

adjacent

the

leucine

zipper

shown

with

residues

identified

specificity

mutants

underlined.

The

library

mutant

GCN4

proteins

subjected

genetic

selection

was

generated

replacing

the

region

between

the

AlwNI

and

Pst

sites

with

degenerate

oligonucleotide

containing

average

2-bp

substitutions.

(Lower)

Target

promoters

used

for

genetic

selection

are

derived

from

molecule

containing

the

optimal

GCN4

binding

site

upstream

the

his3

element

and

structural

gene.

The

central

C-G

base

pair

the

optimal

binding

site

defined

position

base

pairs

the

right

are

defined

+4,

andbase

pairs

the

left

are

defined

-1

-4

(12).

Symmetrically

mutated

derivatives

the

GCN4

binding

site

that

respond

the

various

specificity

mutants

are

shown

below

with

nonoptimal

bases

under-

lined.

mal

AP-1

site.

Thus,

the

Tyr-238

substitution

the

GCN4

basic

region

broadens

the

specificity

the

±3

position.

Two

other

derivatives

were

selected

for

their

ability

activate

transcription

strongly

from

promoter

containing

AT-IACTAAT.

Both

these

derivatives

stimulate

transcrip-

tion

from

the

optimal

AP-1

site,

but

they

are

inactive

sites

with

other

symmetric

substitutions

the

±2

position.

Both

these

±2

specificity

mutants

contain

two

amino

acid

substitutions.

one

case,

Ala-239

and

Ser-242

are

changed

Val-239

and

Leu-242.

Examination

the

individual

sub-

stitutions

indicates

that

the

Val-239

protein

functions

AT-IACTAAT,

although

less

efficiently

than

the

double-

mutant

protein,

whereas

the

Leu-242

protein

does

not.

the

other

case,

Ser-242

and

Lys-246

are

changed

Cys-242

and

Gln-246;

both

these

substitutions

contribute

transcrip-

tional

activity

ATIACTAAT.

DNA-Binding

Specificities

the

Mutant

GCN4

Proteins.

The

various

proteins

were

synthesized

vitro

and

incubated

with

target

DNA

sequences

representing

all

possible

symmetric

mutations

positions

±1,

±2,

±3,

and

±4

(Figs.

and

Table

1).

general,

the

mutant

proteins

have

gained

the

ability

bind

specific

sites

which

wild-type

GCN4

does

not

bind.

Otherwise,

the

mutant

proteins

retain

normal

GCN4

sequence

recognition

properties

including

binding

the

optimal

site

with

near

wild-type

affinity.

all

cases,

the

DNA-binding

properties

the

mutant

proteins

are

excel-

lent

agreement

with

their

transcriptional

activation

proper-

ties

vivo.

The

Tyr-238

protein

behaves

similarly

GCN4

except

that

binds

detectably

(although

weakly)

AGGACTC_CT

and

A_CGACTCGT,

confirming

that

only

affects

sequence

recognition

the

±3

position.

The

Cys-242/Gln-246

double-

mutant

protein

also

behaves

similarly

GCN4,

except

that

binds

AT.IACTAAT

with

affinity

comparable

that

the

optimal

site.

Both

amino

acid

substitutions

are

impor-

tant

for

high-affinity

binding

ATTACTAAT,

because

the

Cys-242

and

Gln-246

single-mutant

proteins

bind

this

site

ATGICACAT

ATGACTCAT

AThACT&AT

ATGACTCAT

ATIACTAAT

(.9

-j

Q N

N N

Ny Ny

10mM

O5mM

OmM

tJN

ATGACTCAT

AfGACTCCT

ACGACTCÆT

AA.GACTCIT

4mM

FIG.

Phenotypic

analysis.

Strains

containing

the

indicated

GCN4

and

his3

promoter

derivatives

were

plated

medium

con-

taining

various

concentrations

aminotriazole

(AT).

From

top

bottom

are

shown

analyses

specificity

positions

±1,

±2,

and

±3.

very

weakly.

Thus,

the

dual

DNA-binding

specificity

the

Cys-242/Gln-246

protein

restricted

the

position.

accord

with

their

genetic

properties,

the

Val-239

and

Val-239/Leu-242

proteins

have

slightly

reduced

affinity

for

the

optimal

site,

and

they

efficiently

bind

AT-IACTAAT;

the

double-mutant

protein

active.

Although

selected

change

±2,

the

Val-239

and

the

double-mutant

protein

also

bind

ATGICACAT

well

they

that

the

GCN4

W235

V239,L242

C242,Q246

Y238

GCN4

Y238

V239,L242

W235

C242,Q246

aST

-C

GCN4

V239,L242

C242,Q246

Y238

W235

X£G

£G

GCN4

V239

V239,L242

L242

W235

Y238

C242,Q246

-C

FIG.

DNA-binding

specificities

GCN4

and

the

mutant

proteins.

Protein-DNA

complexes

formed

incubating

equivalent

amounts

vitro

synthesized

35S-labeled

proteins

(determined

SDS/PAGE)

with

the

indicated

target

sequences

(mutated

residues

are

underlined).

From

top

bottom

are

indicated

DNA-binding

specificities

positions

±4,

±3,

±2,

and

±1.

-J

'6r

LI)

4514

Biochemistry:

Kim

al.

Proc.

Natl.

Acad.

Sci.

USA

(1993)

4515

GCN4

C242

Q246

C242,

Q246

GCN4

V239,L242

L242

V239

QiG

FIG.

DNA-binding

specificity

the

position

single-

and

double-mutant

proteins.

Protein-DNA

complexes

formed

incu-

bating

equivalent

amounts

vitro

synthesized

35S-labeled

pro-

teins

(determined

SDS/PAGE)

with

the

target

sequences

con-

taining

mutated

residues

(underlined)

position

±2.

optimal

site.

Consistent

with

this

observation,

these

proteins

activate

transcription

from

promoter

containing

ATGIC&-

CAT

upstream

the

his3

TATA

element.

Thus,

substitution

Ala-239

with

valine

affects

DNA-binding

specificity

both

the

and

±2

positions.

also carried

out

detailed

DNA-binding

specificity

ex-

periments

the

Trp-235

protein

that

had

previously

been

shown

affect

recognition

the

±4

position

(15).

The

Trp-235

protein

binds

extremely

strongly

AAGACTCTT

but

not

AGGACTCCT,

AAGACTCIT,

any

se-

quence

variants

the

positions.

Indeed,

the

affinity

for

AAGACTCTT

higher

than

for

the

optimal

site,

indi-

cating

that

the

Trp-235

substitution

alters

sequence

recogni-

tion

both

positions

±3

and

+4,

with

the

pronounced

effect

being

+3.

Mutations

GCN4

That

Affect

Half-Site

Spacing

Specific-

ity.

previously

suggested

that

AP-1

and

ATF/CREB

proteins

make

similar

DNA

contacts

but

differ

half-site

spacing

preferences,

and

predicted

that

the

connection

between

the

leucine

zipper

and

basic

region

(residues

244-

250)

determines

the

flexibility

and

specificity

half-site

spacing

(13).

this

regard,

there

consistent

difference

the

position

corresponding

GCN4

residue

247:

ATF/

CREB

proteins

have

positively

charged

residue

(nearly

always

lysine),

whereas

AP-1

proteins

not

(GCN4

con-

tains

leucine)

(16).

therefore

analyzed

Lys-247

and

Arg-247

derivatives

GCN4

for

their

relative

binding

AP-1

and

ATF/CREB

sites

(Fig.

5).

Unlike

GCN4,

which

prefers

the

AP-1

site

over

the

ATF/CREB

site

factor

(13),

the

Lys-247

and

Table

Binding

GCN4

mutant

proteins

various

target

sequences

GCN4

derivative

Wild

Trp- Tyr-

Val-

Val-239/

Cys-242/

Target

site

type

235

238

239

Leu-242

Gln-246

ATGACTCAT

+++

++ ++

+++

CTGACTCA_I

- -

_GTGACTCAC

+++

ITGACTCAA

AAGACTCIT

+++

A_GACTC_jT

A-iGACTCCT

ATAACTIAT

ATCACTGAT

ATIACTAAT

+++

ATGCC_ICAT

ATG-jCCCAT

ATGICACAT

Relative

DNA-binding

abilities

(based

data

Figs.

and

additional

experiments)

are

indicated

follows:

+ +

wild-type

affinity;

somewhat

weaker

than

wild-type

affinity;

weak

binding;

not

detectable.

When

tested,

transcriptional

activation

the

GCN4

derivatives

the

indicated

target

sites

vivo

was

excellent

accord

with

the

DNA-binding

properties

vitro.

Iys247

GCN4

arg247

Iys247

tyr249

vaI2

O01O

01-0

FIG.

GCN4

derivatives

that

alter

half-site

spacing

specificity.

Protein-DNA

complexes

formed

incubating

equivalent

amounts

vitro

synthesized

35S-labeled

proteins

(determined

SDS/

PAGE)

with

the

individual

mixture

DNA

fragments

containing

the

ATF/CREB

AP-1

target

sequences.

Differences

electro-

phoretic

mobility

between

complexes

with

the

ATF/CREB

and

AP-1

binding

sites

are

due

the

length

differences

the

two

DNA

fragments.

Arg-247

proteins

bind

the

AP-1

and

ATF/CREB

sites

with

comparable

affinities.

comparison

GCN4,

these

pro-

teins

bind

with

reduced

affinity

the

AP-1

site

but

with

wild-type

affinity

the

ATF/CREB

site.

GCN4

derivative

which

residues

247,

249,

and

250

are

replaced

the

corresponding

residues

CREB

appears

show

further

reduction

AP-1

binding

activity

such

that

the

ATF/CREB

site

preferred

factor

Thus,

these

substitutions

alter

half-site

spacing

specificity,

but

they

not

fully

convert

GCN4

into

protein

with

typical

ATF/CREB

DNA-

binding

properties.

Nevertheless,

the

results

indicate

that

the

region

between

the

leucine

zipper

and

DNA-binding

surface

critical

for

half-site

spacing

specificity,

with

position

247

playing

important

but

not

fully

determinative

role.

Modeling

the

Mutant

Protein-DNA

Complexes.

The

crystal

structure

the

GCN4

bZIP-AP-1

DNA

complex

(6)

demonstrates

that

Asn-235,

Ala-238,

Ala-239,

Ser-242,

and

Arg-243

are

contact

with

the

central

the

binding

site.

addition,

numerous

basic

residues

anchor

GCN4

its

binding

site

hydrogen

bonds

and

electrostatic

interactions

with

the

phosphodiester

backbone.

This

structure

provides

framework

for

interpreting

the

functional

consequences

amino

acid

substitutions

the

GCN4

mutant

proteins.

Be-

cause

these

proteins

function

well

the

optimal

AP-1

site

present

the

crystallized

protein-DNA

complex,

have

tried

configure

the

mutant

side

chains

orientations

that

minimally

disrupted

the

wild-type

structure.

described

below,

some

the

substituted

residues

cannot

accom-

modated

the

wild-type

orientation

the

GCN4

basic

region

DNA.

Tyr-238.

the

GCN4

complex,

the

thymine

methyl

group

±3

interacts

with

the

methyl

group

Ala-238.

The

Tyr-238

substitution

creates

steric

clash

with

the

phosphates

bases

and

This

clash

can

relieved

local

adjustment

the

DNA

backbone

conformation-for

example,

ob-

served

complexes

the

bacteriophage

434

repressor

with

different

operators

(17).

The

tyrosine

hydroxyl

group

might

then

donate

hydrogen

bond

the

phosphate

the

pyrimidine

residue.

However,

the

tyrosine

ring

would

still

crowd

the

DNA

position

±4,

requiring

further

adjustment.

distributed

set

small,

local

structural

changes,

relative

wild-type,

may

contribute

broadened

specificity

posi-

tion

±3.

Trp-235.

Asn-235

interacts

directly

with

both

strands

the

optimal

target

site

through

hydrogen

bonds

with

the

±3

thymine

and

the

±2

cytosine,

and

may

also

communicate

with

position

±4

through

hydrogen

bond

intervening

water

molecule.

Trp-235

oriented

with

the

long

axis

its

indole

ring

pointing

away

from

the

DNA,

can

accom-

modated

the

wild-type

structure

without

interfering

with

DNA

contacts

made

other

residues.

The

crystal

structure

consistent

with

the

possibility

that

complex

the

Trp-235

protein

with

AAGACTCTT,

the

tryptophanyl

side

Biochemistry:

Kim

al.

Proc.

Natl.

Acad.

Sci.

USA

(1993)

chains

might

stack

against

the

±3

thymine

methyl

groups

the

mutant

site.

The

Trp-235

residue

relatively

±4,

but

the

basis

the

observed

specificity

change

this

position

unclear.

The

Trp-235

substitution

eliminates

two

hydrogen

bonds,

including

the

only

direct

contact

±2.

therefore

surprising

that

the

Trp-235

protein

binds

the

opti-

mal

GCN4

site

with only

slightly

diminished

affinity

and

that

shows

the

wild-type

preferences

±2.

Cys-242/Gln-246.

GCN4,

Lys-246

not

contact

with

DNA,

but

Ser-242

directly

interacts

with

the

±3

thymine

methyl.

Thus,

the

Gln-246

substitution

must

affect

DNA-

binding

specificity

indirectly,

probably

altering

the

posi-

tion

the

basic

region

accommodate

new

bases

position

±2.

unclear

why

Cys-242

effective

than

Ser-242

allowing

dual

specificity

position

±2.

Val-239

and

Val-239/Leu-242.

Ala-239

contacts

the

thymine

methyl

group

the

wild-type

GCN4

complex.

Substitution

the

larger

Val-239

residue

would

not

affect

the

thymine

contact,

but

would

crowd

the

Arg-243

side

chain

that

contacts

the

central

base

pair.

Arg-243

invariant

the

set

known

bZIP

proteins,

and

its

contact

the

central

guanine

energetically

significant

because

GCN4

will

bind

ATGAC

half-site

but

not

ATGAG

(13).

Crowding

Arg-243

Val-239

requires

some

conformational

adjust-

ment,

which

might

account

for

the

reduction

affinity

the

Val-239

and

Val-239/Leu-242

proteins

for

optimal

AP-1

site.

Although

Val-239

located

near

the

±1

base

pair,

unknown

how

tolerates

the

T*A

but

not

the

G-C

the

C-G

substitution.

Lys-247

and

Arg-247.

Residue

247

each

monomer

lies

within

the

"fork"

region

where

the

basic

regions

diverge

from

the

leucine

zipper.

assume

that

the

protein

contacts

AP-1

and

ATF/CREB

half-sites

similar

manner,

the

fork

will

widely

spread

ATF/CREB

com-

plexes.

Because

the

Lys-247

and

Arg-247

proteins

lose

af-

finity

for

AP-1

sites

but

not

for

ATF/CREB

sites

with

normal

affinity,

suggest

that

the

Lys-247

and

Arg-247

substitu-

tions

interfere

with

the

configuration

the

fork

necessary

for

AP-1

site

binding.

unlikely,

however,

that

such

interfer-

ence

reflects

electrostatic

repulsion

between

Lys-247/Lys-

247

Arg-247/Arg-247

pairs,

because

the

corresponding

Leu-247

residues

GCN4

are

not

proximity.

DISCUSSION

Functional

Analyses

the

GCN4-AP-1

Complex.

The

ge-

netic

selection

GCN4

derivatives

that

function

mutant

DNA

sequences

provides

method

for

identifying

amino

acid

residues

that

contribute

DNA-binding

specificity.

The

mutant

proteins

described

here

generally

retain

activity

the

optimal

AP-1

sequence

while

gaining

the

ability

bind

specific

mutant

target

sites.

Because

these

GCN4

derivatives

were

isolated

from

complex

libraries

mutant

proteins

rather

than

directed

mutagenesis,

likely

that

the

residues

identified

here

are

important

determinants

the

strict

DNA

sequence

specificity

GCN4.

Indeed,

the

five

residues

that

contact

the

central

(6),

four

(Asn-235,

Ala-238,

Ala-239,

and

Ser-242)

were

identified

the

GCN4

specificity

mutants.

amino

acid

substitutions

cause

only

local

structural

changes,

then

amino

acids

and

nucleotides

identified

the

specificity

mutants

might

predicted

interact

the

wild-type

complex.

Several

the

specific

contacts

inferred

this

way

(Asn-235

and

+3,

Ala-238

and

±3,

and

Ala-239

and

±1)

are

indeed

observed

the

crystal

structure.

How-

ever,

GCN4

mutants

affecting

specificity

±2

contain

substitutions

residues

239,

242,

and

246

that

are

not

contact

with

base-pair

the

wild-type

complex.

These

observations

imply

that

complexes

some

the

variant

proteins

differ

from

the

wild-type

structure

ways

than

just

local

perturbations

the

vicinity

the

altered

residues.

Relationship

Between

the

GCN4

Specificity

Mutants

and

Other

bZIP

Proteins.

Although

the

specificity

mutants

were

sought

primarily

understand

the

basis

GCN4

DNA-

binding

specificity,

some

them

are

relevant

other

bZIP

proteins.

First,

C/EBP

and

several

other

bZIP

proteins

contain

valine

the

position

corresponding

Ala-239,

where

valine

substitution

GCN4

affects

specificity

and

±2,

and

Schizosaccharomyces

pombe

PAP1

and

Sac-

charomyces

cerevisiae

YAP1

contain

glutamine

this

position.

Thus,

position

239

likely

play

role

the

distinct

DNA-binding

specificities

GCN4,

C/EBP,

and

YAP1

(3,

10,

18).

Second,

two

the

GCN4

specificity

mutants

bind

with

high

affinity

ATTACTAAT.

Several

AP-1

and

ATF/CREB

proteins

also

recognize

this

sequence,

and

T(G/T)AC

has

been

proposed

the

half-site

consensus.

possible

that

the

mutant

and

natural

bZIP

proteins

recognize

ATTACTAAT

the

same

way.

Third,

position

247

plays

important

role

half-site

spacing

and

likely

account

for

some

the

differences

between

AP-1

and

ATF/CREB

factors;

may

also

important

for

determining

half-site

relationships

other

bZIP

proteins.

Adaptability

the

Protein-DNA

Interface

Critical

Determinant

for

DNA-Binding

bZIP

Proteins.

The

basic

region

GCN4

and

other

bZIP

proteins

forms

extended

a-helix

when

binds

DNA,

and

other

tertiary

inter-

actions

within

the

protein

stabilize

its

conformation.

contrast,

most

the

other

well-studied

prokaryotic

and

eukaryotic

DNA-binding

domains

contain

compact,

globular

modules.

Constraints

within

their

folded

structures

restrict

adaptability

the

DNA

recognition

surface.

Flexibility

instead

built

into

elements

such

the

arm

repressor

and

the

linker

segment

GAL4,

which

fold

when

the

protein

binds

DNA

the

joints

zinc-finger

proteins,

which

allow

successive

fingers

wrap

around

DNA

the

major

groove.

Moreover,

the

globular

modules

are

generally

tightly

an-

chored

the

DNA

backbone

through

peptide-NH

groups

small

polar

residues.

result,

among

proteins

with

common

structural

motif,

there

strong

relationship

be-

tween

the

amino

acid

residues

the

recognition

surface

and

DNA-binding

specificity.

Proteins

containing

similar

amino

acid residues

the

recognition

surface

generally

have

similar

DNA-binding

specificities,

whereas

proteins

with

distinct

specificities

differ

these

crucial

amino

acid

posi-

tions.

Thus,

substitutions

amino

acid

residues

that

nor-

mally

contact

base

pairs

usually

cause

large

decreases

affinity,

because

altered

protein

cannot

adapt

unaltered

site,

and

efficient

binding

mutant

protein

altered

site

can

often

explained

new

interactions

between

the

substituted

amino

acids

and

base

pairs.

Our

results

suggest

that

adaptability

the

local

confor-

mation

and/or

positioning

the

basic

region

important

aspect

sequence

recognition

bZIP

proteins.

For

many

the

GCN4

specificity

mutants,

the

substituted

residues

cannot

accommodated

the

structure

the

complex.

The

Val-239

substitution,

which

affects

specificity

±1

and

±2,

requires

some

adjustment

the

protein

order

relieve

steric

clash

with

the

invariant

Arg-243

residue.

The

Tyr-238

substitution,

which

broadens

specificity

+3,

re-

quires

movement

the

DNA

backbone

away

from

the

protein.

Other

substitutions

larger,

hydrophobic

residues

are

permitted

positions

238

and

239

(19),

and

these

pre-

sumably

cause

some

perturbation

the

protein-DNA

inter-

face.

The

Trp-235

substitution

eliminates

the

only

contact

±2,

yet

retains

normal

DNA-binding

specificity

this

position.

Given

the

central

role

Asn-235

the

wild-type

complex

(hydrogen

bonds

±2

and

±3

and

possible

H20-mediated

hydrogen

bond

±4),

striking

that

some

substitutions

have

relatively

modest

effects

DNA-binding

4516

Biochemistry:

Kim

al.

Proc.

Natl.

Acad.

Sci.

USA

(1993)

4517

affinity

(15,

19).

Finally,

two

GCN4

specificity

mutants

alter

specificity

±2

even

though

the

original

(and

possibly

the

substituted)

amino

acids

not

contact

±2.

These

observa-

tions

are

not

simply

artifacts

the

mutant

proteins

because,

discussed

above,

most

them

have

counterparts

other

bZIP

proteins.

Adjustments

the

a-helical

geometry

the

GCN4

fork

segment

and

basic

region

are

also

likely

accommodate

the

different

half-site

spacings

the

AP-1

and

ATF/CREB

sites.

Residue

247,

which

does

not

contact

DNA

but

lies

the

fork

between

the

leucine

zipper

and

basic

region,

important

for

half-site

spacing.

Assuming

that

AP-1

and

ATF/CREB

half-

sites

are

contacted

the

same

GCN4

residues,

then

the

protein

must

sufficiently

flexible

allow

rotation

360

and

translation

~3.3

between

half-sites,

while

maintain-

ing

these

protein-DNA

contacts.

This

amount

flexibility

unprecedented

other

DNA-binding

proteins,

presumably

because

tertiary

folding

constraints

limit

movement

within

other

DNA-binding

domains.

Comparative

analysis

bZIP

protein

sequences

and

their

DNA-binding

specificities

provides

independent

argument

for

conformational

variations

basic

regions.

The

five

GCN4

residues

that

make

base-pair

contacts

(6)

are

very

highly

conserved

bZIP

proteins;

Asn-235

and

Arg-243

are

invariant,

whereas

Ala-238/Ala-239

and

Ser/Cys-242

are

present

>80%

bZIP

domains

(19).

Nevertheless,

bZIP

proteins

can

differ

considerably

their

DNA-binding

spec-

ificities.

This

situation

marked

contrast

that

observed

helix-turn-helix

proteins

which

amino

acid

similarity

the

recognition

surface

strongly

correlated

with

DNA-

binding

specificity.

Nonconserved

residues

the

basic

re-

gion

must

play

crucial

role

the

different

DNA-binding

specificities

bZIP

proteins,

either

direct

base-pair

interactions

indirect

effects

the

conserved

quintet.

Both

mechanisms

require

conformational

variation

the

DNA

recognition

surface

from

that

the

GCN4-AP-1

complex.

These

differences

may

result

from

variations

the

a-helical

geometry

and/or

overall

orientation

the

major

groove

the

basic

region.

The

basic

regions

bZIP

domains

become

ordered

only

upon

association

with

target

DNA

(7-9)

and

are

not

con-

strained

tertiary

interactions

within

the

protein.

The

absence

rigid,

globular

structure

makes

plausible

that

basic

regions

bZIP

proteins

adopt

different

conformations

along

DNA.

Variable

conformations

given

basic

region

are

likely

allow

the

dual

specificities

C/EBP

and

the

GCN4

derivatives

described

here.

the

case

GCN4,

the

basic

region

held

the

major

groove

primarily

long

arginine

and

lysine

side

chains,

and

there

likely

some

flexibility

the

way

anchored.

these

basic

residues

are

highly

conserved,

such

flexibility

likely

general

feature

bZIP

domains.

precise

description

individual

protein-DNA

complexes

will

require

the

high-resolution

structures.

However,

the

combined

evidence

from

the

GCN4-AP-1

complex

structure,

the

sequence

comparison

bZIP

proteins,

and

the

structural

and

functional

interpreta-

tion

our

GCN4

specificity

mutants

provides

strong

case

that

adaptation

the

protein-DNA

interface

important

aspect

DNA-binding

specificity

bZIP

proteins.

This

work

was

supported

postdoctoral

fellowships

from

the

Damon

Runyon-Walter

Winchell

Foundation

(J.K.),

Human

Fron-

tiers

Sciences

(D.T.),

and

the

National

Institutes

Health

and

the

Lucille

Markey

Charitable

Trust

(T.E.)

and

research

grants

from

the

National

Institutes

Health

(GM30186

and

GM46555

K.S.).

Landschulz,

H.,

Johnson,

McKnight,

(1989)

Science

243,

1681-1688.

O'Shea,

K.,

Klemm,

D.,

Kim,

Alber,

(1991)

Science

254,

539-544.

Agre,

P.,

Johnson,

McKnight,

(1989)

Science

246,

922-926.

Talanian,

V.,

McKnight,

Kim,

(1990)

Science

249,

769-771.

Pu,

Struhl,

(1991)

Proc.

Natl.

Acad.

Sci.

USA

88,

6901-6905.

Ellenberger,

E.,

Brandl,

J.,

Struhl,

Harrison,

(1992)

Cell

71,

1223-1237.

O'Neil,

T.,

Hoess,

DeGrado,

(1990)

Science

249,

774-778.

Weiss,

A.,

Ellenberger,

T.,

Wobbe,

R.,

Lee,

P.,

Harrison,

Struhl,

(1990)

Nature

(London)

347,

575-578.

Patel,

L.,

Abate,

Curran,

(1990)

Nature

(London)

347,

572-575.

10.

Hill,

E.,

Hope,

A.,

Macke,

Struhl,

(1986)

Science

234,

451-457.

11.

Hope,

Struhl,

(1987)

EMBO

2781-2784.

12.

Oliphant,

R.,

Brandl,

Struhl,

(1989)

Mol.

Cell.

Biol.

2944-2949.

13.

Sellers,

W.,

Vincent,

Struhl,

(1990)

Mol.

Cell.

Biol.

10,

5077-5086.

14.

Hai,

T.,

Liu,

F.,

Allegretto,

A.,

Karin,

Green,

(1988)

Genes

Dev.

1216-1226.

15.

Tzamarias,

D.,

Pu,

Struhl,

(1992)

Proc.

Natl.

Acad.

Sci.

USA

89,

2007-2011.

16.

Vincent,

Struhl,

(1992)

Mol.

Cell.

Biol.

12,

5394-

5405.

17.

Harrison,

Aggarwal,

(1990)

Ann.

Rev.

Biochem.

59,

933-969.

18.

Moye-Rowley,

S.,

Harshman,

Parker,

(1989)

Genes

Dev.

283-292.

19.

Pu,

Struhl,

(1991)

Mol.

Cell.

Biol.

11,

4918-4926.

Biochemistry:

Kim

al.

Yap, a novel bZlP family of proteins in Saccharomyces cerevisiae

Article

Full-text available

Jan 1997

Organisms respond to environmental changes by modulating the expression of their genes. In eukaryotic organisms. AP-1 factors are involved in the cellular response to a wide variety of extraceliular stimuli. So far described, the AP-1 family of transcription factors in yeast comprises three members: Gcn4, Yap1, and Yap2. These proteins are structurally related by the presence of a bZIP domain, which mediates the dimerization and the specific DNA-binding activity. The most conserved feature of this domain is the basic region that directly interacts with DNA. Gcn4. the AP-1 factor best characterized, coordinates transcriptional activation in response to amino acid starvation and other environmental stimuli. Both Yapl and Yap2 support increased resistance to a variety of drugs and metals, while only Yap1 has a role in the oxidative stress response. Lack of correlation between the drug-sensitivity of yapl and yap2 deleted strains and drug-resistance mediated by the Yap overexpression, suggests the existence of other Yap-like proteins that functionally overlap with Yap1 and Yap2. We identify an extended Yap family of transcription factors and demonstrate that their specific DNA binding activity is distinct from conventional AP-1 factors. Although several of these Yap factors an activate transcription from target sequences, they respond differentiallyto environmental stress, suggesting that different Yap factors have distinct biological functions.

A targeted DNA substrate mechanism for the inhibition of HIV-1 integrase by inhibitors with antiretroviral activity

Article

Full-text available

Dec 2015

We recently reported that viral DNA could be the primary target of raltegravir (RAL), an efficient anti-HIV-1 drug, which acts by inhibiting integrase. To elucidate this mechanism, we conducted a comparative analysis of RAL and TB11, a diketoacid abandoned as an anti-HIV-1 drug for its weak efficiency and marked toxicity, and tested the effects of the catalytic cofactor Mg2+ (5mM) on drug-binding properties. We used circular dichroism and fluorescence to determine drug affinities for viral DNA LTRs and peptides derived from the integrase active site and DNA retardation assays to assess drug intercalation into DNA base pairs. We found that RAL bound more tightly to LTR ends than did TB11 and that Mg2+ significantly increased the affinity of both RAL and TB11. We also observed a good relationship between drug binding with processed LTR and strand transfer inhibition. This unusual type of inhibition was caused by Mg2+-assisted binding of drugs to DNA substrate, rather than to enzyme. Notably, while RAL bound exclusively to the cleavable/cleaved site, TB11 further intercalated into DNA base pairs and interacted with the integrase-derived peptides. These unwanted binding sites explain the weaker bioavailability and higher toxicity of TB11 compared with the more effective RAL

Molecular mechanisms of the protein-protein interaction–regulated binding specificity of basic-region leucine zipper transcription factors

Article

Aug 2019

It is well known that the DNA-binding specificity of transcription factors (TFs) is influenced by protein-protein interactions (PPIs). However, the underlying molecular mechanisms remain largely unknown. In this work, we adopted the cAMP-response element-binding protein (CREB) of the basic leucine zipper (bZIP) TF family as a model system, and a workflow of combined bioinformatics and molecular modeling analysis of protein-DNA interaction was tested. First, the multiple sequence alignment and SDPsite method were used to find potential bZIP family binding specificity determining positions (SDPs) within the protein-protein interaction region. Second, the mutation system was analyzed using molecular dynamics simulation. Molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) free energy calculations confirmed the enhancement of the binding affinity of the mutation, which was in agreement with experimental results. The root mean square fluctuation (RMSF) and hydrogen bonding changes suggested an open and close protein dimerization process after the system was mutated, which resulted in the change of the hydrogen bonding of the protein-DNA interface and a slight conformational change. We believe that this work will contribute to understanding the protein-protein interaction–regulated binding specificity of bZIP transcription factors.

Identification of a Transcriptional Activation Domain in Yeast Repressor Activator Protein 1 (Rap1) Using an Altered DNA-Binding Specificity Variant

Article

Full-text available

Feb 2017

Repressor Activator Protein 1 (Rap1) performs multiple vital cellular functions in the budding yeast Saccharomyces cerevisiae. These include regulation of telomere length, transcriptional repression of both telomere-proximal genes and the silent mating type loci, and transcriptional activation of hundreds of mRNA-encoding genes, including the highly transcribed ribosomal protein- and glycolytic enzyme-encoding genes. Studies of the contributions of Rap1 to telomere length regulation and transcriptional repression have yielded significant mechanistic insights. However, the mechanism of Rap1 transcriptional activation remains poorly understood because Rap1 is encoded by a single-copy essential gene and is involved in many disparate, essential cellular functions, preventing easy interpretation of attempts to directly dissect Rap1 structure-function relationships. Moreover, conflicting reports on the ability of Rap1-heterologous DNA-binding domain fusion proteins to serve as chimeric transcriptional activators challenge use of this approach to study Rap1. Described here is the development of an altered DNA-binding-specificity variant of Rap1 (Rap1AS). We used Rap1AS to map and characterize a 41-amino acid activation domain (AD) within the Rap1 C-terminus. We found that this AD is required for transcription of both chimeric reporter genes and authentic chromosomal Rap1 enhancer-containing target genes. Finally, as predicted for a bona fide AD, mutation of this newly identified AD reduced the efficiency of Rap1 binding to a known transcriptional coactivator TFIID-binding target, Taf5. In summary, we show here that Rap1 contains an AD required for Rap1-dependent gene transcription. The Rap1AS variant will likely also be useful for studies of the functions of Rap1 in other biological pathways.

Adaptability and Specificity in DNA Binding by trp Repressor

Chapter

Jan 1994

Cellular processes such as growth and differentiation are controlled by transcription factors that regulate gene expression by binding to specific DNA sites. Our present understanding of the mechanisms of repression and activation (gene regulation) has evolved from extensive studies on these sequence-specific DNA binding proteins and their DNA targets. A central question in these studies concerns how the protein distinguishes its target site(s) from an enormous background of nonspecific sites. Many regulatory proteins must recognize several closely related but nonidentical sites, or must be able to interact or combine with other factors or subunits that confer new DNA specificities. An emerging picture for the recognition process involves the mutual adjustment, or adaptation, of molecular surfaces to provide the required level of energetic interaction for specific recognition.

General and Cross-Pathway Controls of Amino Acid Biosynthesis

Chapter

Jan 1996

M. S. Sachs

Hogness and Mitchell (1954) initially reported that the level of Neurospora crassa tryptophan synthetase increased in a histidine auxotroph grown under conditions of histidine limitation. Carsiotis and Lacy (1965) then observed that levels of this enzyme, and another Trp biosynthetic enzyme, indole glycerol phosphate synthetase, increased in a variety of different His- mutants that were starved for His (Carsiotis and Lacy 1965). By 1974, it was established that either His or Trp starvation resulted in elevation of enzymes involved in Arg, His and Trp biosynthesis (Carsiotis and Jones 1974; Carsiotis et al. 1974). These studies demonstrated that addition of 3AT (3-amino-1,2,4triazole) to cultures resulted in His starvation. They introduced the concept of cross-pathway control (Carsiotis and Jones 1974) to describe “[the] phenomenon wherein starvation for a single amino acid causes derepression of biosynthetic enzymes of other amino acids as well as those of the deficient amino acid.” A similar phenomenon observed in Saccharomyces cerevisiae became known as the general control of amino acid biosynthesis (Guerzoni 1972; Schürch et al. 1974; Delforge et al. 1975; Wolfner et al. 1975).

RAMAN SPECTRAL STUDIES OF NUCLEIC-ACIDS .47. AN ALTERED SPECIFICITY MUTATION IN THE LAMBDA-REPRESSOR INDUCES GLOBAL REORGANIZATION OF THE PROTEIN-DNA INTERFACE

Article

Apr 1994

The lambda repressor exhibits structural characteristics of lock and key complementarity through the helix-turn-helix motif, and of induced fit by virtue of DNA-dependent folding of the N-terminal arm. In both cases, molecular recognition is mediated by direct contacts between amino acids and DNA bases. The extent to which such contacts function as discrete elements in a protein-DNA recognition code is not known. Because of the relevance of protein recognition to the broader issue of protein design, and because the lambda system serves as a prototype for gene regulation, we have employed laser Raman and H-1 NMR spectroscopy to compare free and operator-bound structures of A repressor variants which are known to exhibit altered DNA-binding specificities. Ex perimental design is based upon a previous biochemical study of mutations in the repressor N-terminal arm (K4Q) and helix-turn-helix motif (G48S) (Nelson, H. C, M., and Sauer, R. T. (1986) J. Mol. Biol. 192, 27-38). These mutations, which were originally isolated by loss of function (K4Q) and second-site reversion (G48S), are of particular interest in light of their complex effects on sequence specificity at multiple positions in the operator site (Benson, N., Adams, C., and Youderian, P. (1992) Genetics 130, 17-26). Laser Raman and H-1 NMR spectra of repressor variants carrying one (G48S) or two mutations (K4Q/G48S) are similar to those of the native wild type repressor and are in accord with the x-ray crystal structure. Remarkably, however, the complexes of wild type and mutant repressors exhibit extensive differences both in the global DNA structure and in the environments of key functional groups along the major groove. By demonstrating that single amino acid substitutions can induce global reorganization of a protein-DNA interface, the present results establish that repressor-operator recognition in solution cannot be explained in terms of a simple recognition code.

Site-directed saturation mutagenesis of yeast Gcn4p at codon 242

Article

Feb 1999

Gcn4p, a transcriptional activator protein of the yeast, Sacchromyces cerevisiae, binds to the specific sequence in the promoters of many amino acid biosynthetic genes for general control. The serine residue (Set 242) of Gcn4p directly contacts the DNA. Here, for inspecting the DNA binding properties and the level of transcriptional activation of Gcn4p, we introduced a polymerase chain reaction (PCR) site-directed saturation mutation library into the Ser 242 site using 2 outside primers and 2 oligonucleotides with its codons fully degenerated. The sequencing analysis of 146 samples revealed the even nucleotide distribution within the experimental error showing 23, 26, 25, and 26% frequency of U, C, A, and G bases, respectively. This method turned out to be a simple, fast, and economical method for constructing a library of all 20 amino acids at specific codon.

Sequence-specific DNA binding by short peptides

Chapter

Dec 2002

This chapter discusses the recent development in designing novel sequence-specific DNA-binding peptides by using a combination of synthetic organic, biochemical, and molecular biological approaches to study the principles of molecular recognition associated with protein–DNA interactions. Understanding recognition from the chemical and physical standpoints requires a better understanding of the energetic differences between the specific and nonspecific protein–DNA interactions. The chapter describes model systems, which addresses the issues of protein–protein and protein–DNA recognition in far greater detail than is possible with the native protein systems. These peptide dimers apply a steric constraint on the two DNA contact regions of the dimeric peptides because the formation of well-ordered dimer determines the relative orientation of each monomer. Another role for the protein dimerization domain is to modulate the cooperativity of DNA binding by noncovalent protein–protein interactions. The protein–protein interaction plays an essential role in both enhancing the selectivity of specific DNA binding and increasing the sensitivity of equilibrium binding to changes in protein concentration. The chapter also presents the model systems with noncovalent dimerization domains.

Synthesis of a Peptide-Intercalator Hybrid Based on the bZIP Motif From GCN4

Article

Feb 1996
TETRAHEDRON

An artificial peptide, designed to combine the DNA-recognition portion of the bZIP motif from the yeast transcription factor GCN4 with an intercalating portion, 9-aminoacridine, has been synthesised.

Folding transition in the DNA binding domain of GCN4 on specific binding to DNA

Article

Full-text available

Oct 1990

PROTEIN-DNA recognition is often mediated by a small domain containing a recognizable structural motif, such as the helix–turn–helix1 or the zinc-finger2. These motifs are compact structures that dock against the DNA double helix. Another DNA recognition motif, found in a highly conserved family of eukaryotic transcription factors including C/EPB, Fos, Jun and CREB, consists of a coiled-coil dimerization element—the leucine-zipper—and an adjoining basic region which mediates DNA binding3. Here we describe circular dichroism and 1NMR spectroscopic studies of another family member, the yeast transcriptional activator GCN44,5. The 58-residue DNA-binding domain of GCN4, GCN4-p, exhibits a concentration-dependent α-helical transition, in accord with previous studies of the dimerization properties of an isolated leucine-zipper peptide6. The GCN4-p dimer is ∼70% helical at 25 °C, implying that the basic region adjacent to the leucine zipper is largely unstructured in the absence of DNA. Strikingly, addition of DNA containing a GCN4 binding site (AP-1 site) increases the α-helix content of GNC4-p to at least 95%. Thus, the basic region acquires substantial α-helical structure when it binds to DNA. A similar folding transition is observed on GCN4-p binding to the related ATF/CREB site, which contains an additional central base pair. The accommodation of DNA target sites of different lengths clearly requires some flexibility in the GCN4 binding domain, despite its high α-helix content. Our results indicate that the GCN4 basic region is significantly unfolded at 25 °C and that its folded, α-helical conformation is stabilized by binding to DNA.

Mutations in the bZIP domain of yeast GCN4 alter DNA-binding specificity

Article

Full-text available

Apr 1992

The bZIP class of eukaryotic transcriptional regulators utilize a distinct structural motif that consists of a leucine zipper that mediates dimerization and an adjacent basic region that directly contacts DNA. Although models of the protein-DNA complex have been proposed, the basis of DNA-binding specificity is essentially unknown. By genetically selecting for derivatives of yeast GCN4 that activate transcription from promoters containing mutant binding sites, we isolate an altered-specificity mutant in which the invariant asparagine in the basic region of bZIP proteins (Asn-235) has been changed to tryptophan. Wild-type GCN4 binds the optimal site (ATGACTCAT) with much higher affinity than the mutant site (TTGACTCAA), whereas the Trp-235 protein binds these sites with similar affinity. Moreover, the Trp-235, Ala-235, and Gln-235 derivatives differ from GCN4 in their strong discrimination against GTGACTCAC. These results suggest a direct interaction between Asn-235 and the +/- 4 position of the DNA target site and are discussed in terms of the scissors-grip and induced-fork models of bZIP proteins.

ACR1, a yeast ATF/CREB repressor

Article

Dec 1992

Members of the mammalian ATF/CREB family of transcription factors, which are associated with regulation by cyclic AMP and viral oncogenes, bind common DNA sequences (consensus TGACGTCA) via a bZIP domain. In the yeast Saccharomyces cerevisiae, ATF/CREB-like sequences confer either repression or activation of transcription, depending on the promoter context. By isolating mutations that alleviate the repression mediated by ATF/CREB sites, we define a new yeast gene, ACR1, which encodes an ATF/CREB transcriptional repressor. ACR1 contains a bZIP domain that is necessary for homodimer formation and specific DNA binding to an ATF/CREB site. Within the bZIP domain, ACR1 most strongly resembles the mammalian cyclic AMP-responsive transcriptional regulators CREB and CREM; it is less similar to GCN4 and YAP1, two previously described yeast bZIP transcriptional activators that recognize the related AP-1 sequence (consensus TGACTCA). Interestingly, deletion of the ACR1 gene causes increased transcription through ATF/CREB sites that does not depend on GCN4 or YAP1. Moreover, extracts from acr1 deletion strains contain one or more ATF/CREB-like DNA-binding activities. These genetic and biochemical observations suggest that S. cerevisiae contains a family of ATF/CREB proteins that function as transcriptional repressors or activators.

ACR1, a yeast ATF/CREB repressor

Article

Jan 1993

The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted ?? Helices: Crystal structure of the protein-DNA complex

Article

Jan 1993
CELL

The yeast transcriptional activator GCN4 is 1 of over 30 identified eukaryotic proteins containing the basic region leucine zipper (bZIP) DNA-binding motif. We have determined the crystal structure of the GCN4 bZIP element complexed with DNA at 2.9 A resolution. The bZIP dimer is a pair of continuous alpha helices that form a parallel coiled coil over their carboxy-terminal 30 residues and gradually diverge toward their amino termini to pass through the major groove of the DNA-binding site. The coiled-coil dimerization interface is oriented almost perpendicular to the DNA axis, giving the complex the appearance of the letter T. There are no kinks or sharp bends in either bZIP monomer. Numerous contacts to DNA bases and phosphate oxygens are made by basic region residues that are conserved in the bZIP protein family. The details of the bZIP dimer interaction with DNA can explain recognition of the AP-1 site by the GCN4 protein.

The leucine zipper symmetrically positions the adjacent basic regions for specific DNA binding

Article

Sep 1991

The bZIP structural motif present in several eukaryotic transcription factors is defined by the leucine zipper, a coiled-coil dimerization interface, and an adjacent basic region that directly interacts with DNA. To examine the functional importance of the highly conserved spacing between the leucine zipper and the basic region, we have analyzed the DNA-binding ability of yeast GCN4 proteins containing amino acid insertions between these two subdomains. Proteins containing a surprisingly wide variety of seven-amino acid insertions, but none containing two-, four-, or six-amino acid insertions, are functional. However, heterodimers between wild-type GCN4 and functional derivatives containing seven amino acid insertions are unable to bind DNA. These observations provide strong experimental support for several aspects of the scissors grip and induced fork models for DNA-binding by bZIP proteins. Specifically, they demonstrate that continuous alpha-helices symmetrically diverging from the leucine zipper correctly position the two basic regions for specific binding to abutting DNA half-sites. In addition, the results indicate that GCN4 homodimers are primarily responsible for transcriptional activation in yeast cells.

Highly conserved residues in the bZIP domain of yeast GCN4 are not essential for DNA binding

Article

Nov 1991

Yeast GCN4 and the Jun oncoprotein are transcriptional activators that bind DNA via a bZIP domain consisting of a leucine zipper dimerization element and an adjacent basic region that directly contacts DNA. Two highly conserved alanines (Ala-238 and Ala-239 in GCN4) and an invariant asparagine (Asn-235) in the basic region have been proposed to play important roles in DNA sequence recognition by bZIP proteins. Surprisingly, these conserved residues can be functionally replaced in GCN4 and in a derivative containing the Jun basic region (Jun-GCN4). The ability of an amino acid to functionally substitute for Asn-235 does not correlate with its preference for assuming the N-cap position of an alpha helix. This finding argues against the proposal of the scissors grip model that the invariant asparagine forms an N cap that permits the basic region to bend sharply and wrap around the DNA. In contrast to a prediction of the induced fork model, the pattern of functional substitutions of the conserved alanines together with the results of uracil interference experiments suggests that Ala-238 and Ala-239 do not make base-specific DNA contacts. Finally, the Jun-GCN4 chimeric proteins appear much more active in vivo than expected from their DNA-binding properties in vitro. The mechanistic and evolutionary implications of these results are discussed.

Altered protein conformation on DNA binding by Fos and Jun

Article

Nov 1990

The protein products of the c-fos and c-jun proto-oncogenes (Fos and Jun, respectively) form a heterodimeric protein complex that interacts with the activator protein-1 (AP-1) binding site and regulates gene transcription in response to extracellular stimuli. Protein dimerization is mediated primarily by a coiled-coil-like structure termed the leucine-zipper and DNA binding occurs primarily through regions of each protein rich in basic amino acids that contact both strands of the AP-1 site. The precise nature of the protein-DNA interaction is unknown as studies concerned with dimerization and DNA binding by Fos and Jun have relied on indirect methods to investigate protein-protein-DNA interactions. Here we have developed assay systems using fluorescence spectroscopy and circular dichroism to monitor dimerization and DNA binding directly. The results indicate that the interaction of Fos and Jun with DNA results in an altered conformation of the protein dimers and an increased alpha-helical content. These techniques may have general application in studies concerning the interaction of transcriptional regulatory proteins with specific DNA target sequences.

DNA Recognition by Proteins with the Helix-Turn-Helix Motif

Article

Feb 1990

Mutations that define the optimal half-site for binding yeast GCN4 activator protein and identify an ATF/CREB-like represser that recognizes similar DNA sites

Article

Nov 1990

The yeast GCN4 transcriptional activator protein binds as a dimer to a dyad-symmetric sequence, indicative of a protein-DNA complex in which two protein monomers interact with adjacent half-sites. However, the optimal GCN4 recognition site, ATGA(C/G)TCAT, is inherently asymmetric because it contains an odd number of base pairs and because mutation of the central C.G base pair strongly reduces specific DNA binding. From this asymmetry, we suggested previously that GCN4 interacts with nonequivalent and possibly overlapping half-sites (ATGAC and ATGAG) that have different affinities. Here, we examine the nature of GCN4 half-sites by creating symmetrical derivatives of the optimal GCN4 binding sequence that delete or insert a single base pair at the center of the site. In vitro, GCN4 bound efficiently to the sequence ATGACGTCAT, whereas it failed to bind to ATGAGCTCAT or ATGATCAT. These observations strongly suggest that (i) GCN4 specifically recognizes the central base pair, (ii) the optimal half-site for GCN4 binding is ATGAC, not ATGAG, and (iii) GCN4 is a surprisingly flexible protein that can accommodate the insertion of a single base pair in the center of its compact binding site. The ATGACGTCAT sequence strongly resembles sites bound by the yeast and mammalian ATF/CREB family of proteins, suggesting that GCN4 and the ATF/CREB proteins recognize similar half-sites but have different spacing requirements. Unexpectedly, in the context of the his3 promoter, the ATGACGTCAT derivative reduced transcription below the basal level in a GCN4-independent manner, presumably reflecting DNA binding by a distinct ATF/CREB-like repressor protein. In other promoter contexts, however, the same site acted as a weak upstream activating sequence.

Adaptability at the protein-DNA interface is an important aspect of sequence recognition by bZIP proteins

Abstract and Figures

Recommended publications

Preferential binding of IFI16 protein to cruciform structure and superhelical DNA

Telomerase- and capping-independent yeast survivors with alternate telomere states

Kr=FCppel-associated boxes are potent transcriptional repression domains

Structural regularities of helicoidally-like biopolymers in the framework of algebraic topology: II....