PreprintPDF Available

The structure of SALSA/DMBT1 SRCR domains reveal the conserved ligand-binding mechanism of the ancient SRCR-fold

Authors:

Abstract and Figures

The scavenger receptor cysteine-rich (SRCR) family of proteins comprise more than 20 membrane-associated and secreted molecules. Characterised by the presence of one or more copies of the ∼110 amino acid SRCR domain, this class of proteins have widespread functions as anti-microbial molecules, scavenger- and signalling-receptors. Despite the high level of structural conservation of SRCR domains, no molecular basis for ligand interaction has been described. The SRCR protein SALSA, also known as dmbt1/gp340, is a key player in mucosal immunology. Based on detailed structures of the SALSA SRCR domains 1 and 8, we here reveal a novel universal ligand binding mechanism for SALSA ligands. The binding interface incorporates a dual cation binding site, which is highly conserved across the SRCR super family. Along with the well-described cation dependency on most SRCR domain-ligand interactions, our data suggest that the binding mechanism described for the SALSA SRCR domains is applicable to all SRCR domains. We thus propose to have identified in SALSA a conserved functional mechanism for ligand recognition by the SRCR class of proteins.
Content may be subject to copyright.
1
The structure of SALSA/DMBT1 SRCR domains reveal 1
the conserved ligand-binding mechanism of the ancient 2
SRCR-fold 3
4
5
Martin P. Reichhardt1,*, Vuokko Loimaranta2, Susan M. Lea1,3 and Steven Johnson1,* 6
7 1Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK 8
9 2Institute of Dentistry, University of Turku, Turku, Finland 10
11 3Central Oxford Structural Molecular Imaging Centre, University of Oxford, Oxford, OX1 12
3RE, UK 13
14 *Correspondence: steven.johnson@path.ox.ac.uk or martin.reichhardt@path.ox.ac.uk 15
16
17
18
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
2
Abstract 1
2
The scavenger receptor cysteine-rich (SRCR) family of proteins comprise more than 20 3
membrane-associated and secreted molecules. Characterised by the presence of one or more 4
copies of the 110 amino acid SRCR domain, this class of proteins have widespread 5
functions as anti-microbial molecules, scavenger- and signalling-receptors. Despite the high 6
level of structural conservation of SRCR domains, no molecular basis for ligand interaction 7
has been described. The SRCR protein SALSA, also known as dmbt1/gp340, is a key player 8
in mucosal immunology. Based on detailed structures of the SALSA SRCR domains 1 and 8, 9
we here reveal a novel universal ligand binding mechanism for SALSA ligands. The binding 10
interface incorporates a dual cation binding site, which is highly conserved across the SRCR 11
super family. Along with the well-described cation dependency on most SRCR domain-12
ligand interactions, our data suggest that the binding mechanism described for the SALSA 13
SRCR domains is applicable to all SRCR domains. We thus propose to have identified in 14
SALSA a conserved functional mechanism for ligand recognition by the SRCR class of 15
proteins. 16
17
Keywords 18
19
SRCR / SALSA / dmbt1 / gp340 / mucosal immunology 20
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
3
Introduction 1
2
The Salivary Scavenger and Agglutinin (SALSA), also known as gp340, ‘deleted in 3
malignant brain tumors 1’ (DMBT1) and salivary agglutinin (SAG), is a multifunctional 4
molecule found in high abundance on human mucosal surfaces [1-4]. SALSA has widespread 5
functions in innate immunity, inflammation, epithelial homeostasis and tumour suppression 6
[5-7]. SALSA binds and agglutinates a broad spectrum of pathogens including, but not 7
limited to, human immunodeficiency virus type 1, Helicobacter pylori, Salmonella enterica 8
serovar Typhimurium, and many types of streptococci [8-11]. In addition to its microbial 9
scavenging function, SALSA has been suggested to interact with a wide array of endogenous 10
immune defence molecules. These include secretory IgA, surfactant proteins A (SP-A) and D 11
(SP-D), lactoferrin, mucin-5B, and components of the complement system [1, 2, 12-18]. 12
SALSA thus engages innate immune defence molecules and has been suggested to 13
cooperatively mediate microbial clearance and maintenance of the integrity of the mucosal 14
barrier. 15
16
The 300-400 kDa SALSA glycoprotein is encoded by the DMBT1 gene. The canonical form 17
of the gene encodes 13 highly conserved scavenger receptor cysteine-rich (SRCR) domains, 18
followed by two C1r/C1s, urchin embryonic growth factor and bone morphogenetic protein-1 19
(CUB) domains that surround a 14th SRCR domain, and finally a zona pellucida (ZP) domain 20
at the C-terminus [19, 20]. The first 13 SRCRs are 109 amino acid (aa) domains found as 21
“pearls on a string” separated by SRCR interspersed domains (SIDs) (Figure 1a) [1, 21]. The 22
SIDs are 20-23 aa long stretches of predicted disorder containing a number of glycosylation 23
sites which have been proposed to force them into an extended conformation of roughly 7 nm 24
[7]. In addition to this main form, alternative splicing and copy number variation mechanisms 25
lead to expression of variants of SALSA containing variable numbers of SRCRs in the N-26
terminal region. 27
28
The SRCR protein super family include a range of secreted and membrane-associated 29
molecules, all containing one or more SRCR domains. For a number of these molecules the 30
SRCR domains have been directly implicated in ligand-binding. These include CD6 31
signalling via CD166, CD163 mediated clearance of the Haemoglobin-Haptoglobin complex, 32
Mac-2 binding protein’s (M2bp) interaction with matrix components, and the binding of 33
microbial ligands by the scavenger receptors SR-A1, SPα and MARCO [22-27]. While the 34
multiple SALSA SRCR domains likewise have been implicated in ligand-binding, the 35
molecular basis for its diverse interactions remain unknown. 36
37
To understand the multiple ligand binding properties of the SALSA molecule, we undertook 38
an X-ray crystallographic study to provide detailed information of the SALSA interaction 39
surfaces. We here provide the atomic resolution structures of SALSA SRCR domains 1 and 40
8. We identify cation-binding sites and demonstrate their importance for ligand binding. By 41
comparing our data to previously published structures of SRCR domains, we propose a 42
generalised binding mechanism for this ancient, evolutionarily conserved, fold. 43
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
4
Methods 1
Expression of recombinant proteins 2
Insect cell expression. Codon-optimized DNA (GeneArt, ThermoFisher) was cloned into a 3
modified pExpreS2-2 vector (ExpreS2ion Biotechnologies, Denmark) with a C-terminal His-4
6 tag. The purified plasmid was transformed into S2 cells grown in EX-CELL 420 (Sigma) 5
with 25 µL ExpreS2 Insect-TR 5X (ExpreS2ion Biotechnologies). Selection for stable cell 6
lines (4 mg/mL geneticin (ThermoFisher)) and expansion were carried out according to the 7
manufacturer’s instructions. E. coli expression. DNA strings (GeneArt, ThermoFisher) were 8
cloned into pETM-14 and transformed into M15pRep cells. Protein expression was carried 9
out in LB media (with 30 µg/mL kanamycin). Cells were induced with 1 mM IPTG. The 10
cultures were centrifuged (3,220 x g, 15 min) and the cell pellets resuspended and lysed in 11
PBS containing 1 mg/mL DNase and 1 mg/mL lysozyme. 12
Protein purification 13
14
SRCR-domains: Insect culture supernatant was collected by centrifugation (1000 x g at 30 15
min), filtered and loaded onto a Ni2+-chromatography column (1 ml, Roche), washed in 20 16
CV buffer (50 mM Tris, pH: 9.0, 200 mM NaCl). Bound protein was eluted with 250 mM 17
imidazole. Following this size-exclusion chromatography (SEC) was carried out on an S75 18
16/60 HR column(GE) equilibrated in 10 mM Tris, pH: 7.5, 200 mM NaCl. Spy-2: Lysed 19
cell pellets were homogenized and centrifuged at 20,000 x g for 30 min. The filtered 20
supernatant was loaded onto a Ni2+-chromatography column (5 ml, Qiagen) and washed in 20 21
CV buffer (50 mM Tris, pH: 8.5, 200 mM NaCl, 20 mM imidazole). Bound protein was 22
eluted with 250 mM imidazole, concentrated and subjected to SEC (S75 16/60 HR column, 23
GE). 24
25
Crystallization, X-ray data collection, and structure determination 26
27
Purified SRCR1 and SRCR8 were concentrated to 20 mg/ml. SRCR1 was mixed with an 28
equal volume of mother liquor containing 0.2 M MgCl2 hexahydrate, 10 % (w/v) PEG8000, 29
0.1 M Tris, pH: 7.0, and crystallised in 400 nl drops by the vapor diffusion method at 21 °C. 30
SRCR8 was mixed with an equal volume of mother liquor containing 0.1 M LiSO4, 20 % 31
(w/v) PEG6000, 0.01 M HEPES, pH: 6.5, and crystallised in 800 nl drops. For SRCR8 + 32
cations crystals were grown in 0.2 M MgCl2 hexahydrate, 20 % (v/v) isopropanol, 0.1 M 33
HEPES, pH: 7.5, and crystallised in 400 nl drops. The crystallization buffer was 34
supplemented with 10 mM Mg2+ and 10 mM Ca2+ 24 hours prior to freezing. All crystals 35
were cryoprotected in mother liquor supplemented with 30% glycerol and flash frozen in 36
liquid N2. Data were collected at a temperature of 80 K on beamlines I03 and I04 at the 37
Diamond Light Source (Harwell, UK), as specified in Table 1. The Structure of SRCR8 was 38
solved by molecular replacement using MolRep within CCP4 [28] with the structure of CD6 39
SRCR domain 3 (PDB ID 5a2e [27]). The structure of SRCR1 and SRCR8 soaked in cations 40
were solved by molecular replacement using the structure of SRCR8. Refinement and 41
rebuilding was carried out in Phenix and Coot [29, 30]. The structures are characterized by 42
the statistics shown in Table 1 with no Ramachandran outliers. Protein structure figures were 43
prepared using Pymol Version 2.0 (Schrödinger, LLC). 44
45
Hydroxyapatite binding assay 46
47
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
5
150 µl Hydroxyapatite nanoparticle suspension (Sigma) was washed into buffer (10 mM 1
HEPES, pH: 7.5, 150 mM NaCl, 1 mM Ca2+). Beads were incubated in 80 ul SRCR8, 2
SRCR8 D34A or SRCR8 D35A (all at 0.5 mg/ml) with shaking for 1 hour at RT. Beads were 3
spun and washed 6x in 1 ml buffer. Bound protein was eluted in 100 µl 0.5 M EDTA, and 4
visualized by SDS-PAGE (4-20 %, Biorad) and Coomassie staining (Instant Blue, Expedeon, 5
UK). 6
Heparin binding assay 7
SRCR8, SRCR8 D34A or SRCR8 D35A in 10 mM HEPES, pH: 7.5, 10 mM NaCl, 1 mM 8
Ca2+ were loaded onto a HiTrap Heparin HP column (1 ml, GE). Bound protein was then 9
eluted with 10 mM HEPES, pH: 7.5, 10 mM NaCl, 20 mM EDTA. 10
Spy-2 binding assay 11
On a Maxisorp plate (Nunc, Denmark) 100 µl purified Spy-2 was coated O/N at 4 oC in a 12
concentration ranging from 0.032 – 3.2 µM in coating buffer (100 mM NaHCO3-buffer, pH: 13
9.5). The plate was blocked in 1 % gelatine in PBS, and SRCR8, SRCR8 D34A, and SRCR8 14
D35A were added (all at 7.1 µM in 10 mM HEPES, pH 7.5, 150 mM NaCl, 1 mM Ca2+, 0.05 15
% Tween20). Bound protein was detected with monoclonal anti-SALSA antibody diluted 16
1:10,000 (1G4, Novus Biologicals, UK) and HRP-conjugated rabbit anti-mouse antibody 17
1:10,000 (Promega, W4028). The plate was developed with 2,2’-Azino-bis(3-18
ethylbenzothiazoline-6-sulfonic acid) (Sigma), and analysed by spectrophotometry at 405 19
nm. To test calcium-specific dependency of the interaction the WT-assay above was repeated 20
in a buffer containing: 10 mM HEPES, pH 7.5, 150 mM NaCl, 1 mM Mg2+, 1 mM EGTA, 21
0.05 % Tween20. 22
Data availability 23
24
Structure factors and coordinates from this publication have been deposited to the PDB 25
database https://www.wwpdb.org, and assigned the identifiers: SRCR1 pdbid: 6sa4, SRCR8 26
pdbid: 6sa5, SRCR8 with calcium pdbid: 6san. 27
28
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
6
Results 1
2
The scavenger receptor SALSA has a very wide range of described ligands, including 3
microbial-, host innate immune- and extracellular matrix (ECM)-molecules. To understand 4
the very broad ligand-binding abilities of the SALSA molecule, we applied a crystallographic 5
approach to determine the structure of the ligand binding SRCR domains of SALSA. SRCR 6
domains 1 and 8 (SRCR1 and SRCR8) were expressed in Drosophila melanogaster 7
Schneider S2 cells with a C-terminal His-tag. The domains were purified by Ni-chelate and 8
size-exclusion chromatography, crystallised, and the structures were solved by molecular 9
replacement. This yielded the structures of SRCR1 and SRCR8 at 1.77 Å and 1.29 Å, 10
respectively (Fig 1). For crystallographic details, see Table 1. 11
12
13
Figure 1: Crystal structure of SALSA domains SRCR1 and SRCR8. (A) Schematic representation of the 14
domain organization of full-length SALSA. SRCR1 and SRCR8 are highlighted in green and blue, respectively. 15
All SRCR domains share > 88 % sequence identity. 100 % identity is shared by SRCR3 and 7 (yellow) and 16
SRCR10 and 11(purple). (B) Front and side views of an overlay of SRCR1 (green) and SRCR8 (blue), showing 17
4 conserved disulphide bridges (yellow). Both SRCR1 and SRCR8 were found to coordinate a metal, modelled 18
as Mg2+ (dark green). The limited structural variation observed between SRCR1 and SRCR8 (92 % sequence 19
identity) imply that these are appropriate representations of all SALSA SRCR domains 1-13. 20
21
22
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
7
Table 1 Data collection and refinement statistics (molecular replacement) 1
2
SRCR1 (pdbid 6sa4)
SRCR8 (pdbid 6sa5)
Data collection
Space group
P 21 21 21
P 21 21 21
Cell dimensions
a, b, c (Å)
36.77, 45.19, 69.37
32.82, 40.82, 62.99
α, β, γ (°)
90.00, 90.00, 90.00
90.00, 90.00, 90.00
Resolution (Å)
28.52-1.77 (1.80-1.77)
40.82-1.29 (1.31-1.29)
Rmerge
0.17 (1.36)
0.117 (1.12)
I / σI
8.1 (1.1)
8.6 (0.9)
Completeness (%)
99.8 (99.3)
100 (99.6)
Redundancy
6.3 (6.6)
11.4 (8.0)
Refinement
Resolution (Å)
28.52-1.77 (1.95-1.77)
34.27-1.29 (1.35-1.29
No. reflections
11730
21958
Rwork / Rfree
0.186/0.229
(0.266/0.348)
0.155/0.188 (0.326-0.279)
No. atoms
Protein
824
829
Ligand/ion
26
18
Water
109
124
B-factors
Protein
22.36
16.80
Ligand/ion
49.33
54.03
Water
31.15
33.66
R.m.s. deviations
Bond lengths (Å)
0.006
0.009
Bond angles (°)
0.809
1.026
*Number of xtals was one for each structure. *Values in parentheses are for highest-resolution shell. Data from 3
SRCR1 and SRCR8 crystals were collected on Diamond beamline I04, while data for SRCR8MgCa were 4
collected on beamline I03. 5
6
The SALSA SRCR domains reveal a classic globular SRCR-fold, with 4 conserved 7
disulphide bridges, as described for the SRCR type B domains. The fold contains one alpha-8
helix and one additional single helical turn. The N- and C-termini come together in a four-9
stranded beta-sheet. SRCR1-13 are highly conserved, with 88-100 % identity. Variation is 10
only observed in 9 out of the 109 amino acid residues, all of these observed in peripheral 11
loops, without apparent structural significance. Combined, the data from SRCR1 and SRCR8 12
are thus valid representations of all SALSA SRCR domains. Both SRCR1 and SRCR8 are 13
stabilized by a metal-ion buried in the globular fold. The placement suggests the ion is bound 14
during the folding of the domain, and is modelled as Mg2+, which is present in the original 15
expression medium and in the crystallization conditions of SRCR1. 16
17
So far, all described ligand-binding interactions of SALSA have been shown to be Ca2+-18
dependent. We therefore proceeded to address the cation binding potential of the SRCR 19
domains in further detail. Crystals were grown of purified SRCR8, and Ca2+ and Mg2+ were 20
soaked in before freezing. This yielded the crystal structure of SRCR8 with the original Mg2+ 21
ion, site 1, and two additional cations bound, sites 2 and 3, SRCR8MgCa (Fig 2). All three 22
sites are class three cation binding sites, with the coordination obtained from residues in 23
distant parts of the sequence [31]. 24
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
8
1 Figure 2: Crystal structure of SRCR8MgCa bound indicates mechanism of ligand-binding. (A) Surface 2
charge distribution of SRCR8MgCa (calculated without the presence of cations) shows a positive cluster on one 3
side with a strong negative cluster on the other. The negative cluster expands across approximately 300 Å2 and 4
mediates the binding of three cations. (B) Representation of the residues coordinating the three cations. The 5
upper Mg2+, site 1, sits somewhat buried in the structure, and may be essential for structural stability. The lower 6
cations at sites 2 and 3 are more exposed. (C) Detailed view of the coordination of the upper Mg2+, site 1. The 7
coordination number of six is achieved by two waters, two backbone carbonyls and two sidechain carboxylates. 8
(D) Detailed view of the coordination of the cation at site 2, modelled as Mg2+ based on bond length and 9
coordination number. Here the coordination number of six is achieved by three waters and three sidechain 10
carboxylates. (e) Detailed view of the coordination of the cation at site 3, modelled as Ca2+, based on bond 11
length and the higher coordination number. The coordination number of eight is achieved by one water, four 12
sidechain carboxylates, one sidechain amide and a glycerol. 13
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
9
A surface charge analysis of the SRCR domain showed a strong electronegative surface 1
generated by a number of loop extensions. Soaking of the SRCR8-crystals in cations revealed 2
that this surface binds three cations, a Mg2+ at the base of the alpha helix (also observed in 3
the initial SRCR1 and SRCR8 structures), and two in close proximity to each other, tying the 4
superficial loop at aa1078-1085 to the rest of the domain. The Mg2+ at site 1 is coordinated 5
by the backbone carbonyl groups of S1021 and V1060, as well as the side chains of D1023 6
and D1026, and 2 waters, and is buried in the domain fold. The Mg2+ at site 2 is coordinated 7
by D1019, D1020 and E1086 plus 3 waters, while the Ca2+ at site 3 is coordinated by the 8
sidechains of D1020, D1058, D1059 and N1081 with additional contributions from a water 9
and a glycerol molecule from the cryoprotection. In contrast to the Mg2+ at site 1, these 10
cations are exposed on the surface of the domain. Assignment of the ions at the paired site 11
was carried out based on analysis of the bond lengths and coordination number of the two 12
sites. Mg2+- bonds are typically shorter than those of Ca2+, and Ca2+ is often observed to have 13
a higher coordination number [32]. However, it is worth noting that either site could 14
accommodate either cation depending on local concentration.
15
16
A key feature of the cation binding sites 2 and 3 is that the protein only contributes a fraction 17
of the coordination sphere, with the remainder contributed by waters or hydroxyl containing 18
small molecules from the crystallisation solution. According to the literature, the majority of 19
described SALSA ligands are negatively charged. Thus, the surface-exposed cations likely 20
provide a mechanism for ligand binding for the SALSA SRCR domains, whereby the anions 21
of the ligand substitute for the waters or the glycerol at site 3 observed in our structure. To 22
test this hypothesis, site directed mutagenesis was employed targeting the key residues 23
coordinating site 2 and 3. Included in the further analysis were single mutations D1019A and 24
D1020A. While mutation of D1019 is expected to only disrupt binding of cations at site 2, 25
mutation of the shared D1020, will likely affect binding of both cations. 26
27
As SALSA recognizes a very broad range of biological ligands, we set out to test the effect of 28
SRCR domain point mutations on interactions with a wide array of biological ligands. These 29
included binding to: (1) hydroxyapatite, a phosphate rich mineral essential for the binding of 30
SALSA to the teeth-surface, where it mediates anti-microbial effects [33]; (2) heparin, a 31
sulphated glycosaminoglycan as a mimic for the ECM/cell surface, for which binding of 32
SALSA has been described to affect cellular differentiation as well as microbial colonisation 33
[34]; (3) Group A Streptococcus surface protein, Spy0843, a Leucine-rich repeat (Lrr) protein 34
demonstrated to bind to SALSA [35] (Fig 3). 35
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
10
1
Figure 3: Mutating the cation-binding residues of SRCR domains abolish function. Through multiple 2
ligand binding assays, we demonstrated the functional importance of calcium binding by the SRCR domains. 3
Mutations affecting site 2 (D1019A) and mutations affecting sites 2 and 3 (D1020A) both abolish function. (A) 4
WT and mutant forms of SRCR8 were incubated with hydroxyapatite beads in a Ca2+ containing buffer. After 5
extensive washing, bound protein was eluted with EDTA. Eluted fractions were run on a 4 - 20 % SDS-PAGE 6
gel and visualized by Coomassie staining. Only WT SRCR8 bound hydroxyapatite. (B) WT and mutant forms 7
of SRCR8 were flown over a Heparin (HiTrap HP, 1 ml) column in a Ca2+-containing buffer. Protein bound to 8
the column was eluted with 0.5 M EDTA. Only WT SRCR8 bound the Heparin-column. Traces: SRCR8 (blue), 9
D1020A (pink), D1019A (red), conductivity (brown). (C) In an ELISA-based setup, a concentration range of 10
the Spy-2 domain of Spy0843 was coated (0.032 3.2 µM). WT and mutant SRCR8 domains were added in 11
molar excess (7.1 µM), and binding was detected with a monoclonal anti-SALSA antibody. Binding was only 12
observed for WT SRCR8. (D) Overview of ligand binding studies, + denotes binding, - denotes no binding. 13
14
The different binding assays provide an understanding of a generalized binding mechanism 15
of the SALSA SRCR domains. While the WT SRCR domain bound to all three ligands, both 16
of the cation binding site mutations, D1019A and D1020A, abolished binding. This is 17
consistent with the bound cations acting as a bridge for ligand interaction, and thus provides a 18
mechanistic explanation for the binding properties of the SALSA SRCR domains. In the 19
literature, SALSA ligand binding has been described as specifically calcium-dependent. To 20
verify this, we conducted binding assays in a MgEGTA containing buffer (Figure 3C). The 21
exchange of magnesium for calcium abolished ligand interactions. This confirms the 22
calcium-specific mediation of binding. As mutation of site 2 alone (modelled as Mg2+ in our 23
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
11
structure) abolish binding, our data suggest that Ca2+ will occupy both site 2 and site 3 under 1
physiological conditions. 2
3
All known members of the SRCR-super family share a very high degree of identity, both at 4
sequence and structural level. An FFAS search [36] of the SRCR8 sequence showed highest 5
similarities to CD163 SRCR5 (score: -65.4, 46 % identity), Mac-2-binding protein (M2bp) 6
(score: -64.4, 54 % identity), neutrotrypsin (score: -61.5, 50 % identity), MARCO (score: -7
59.4, 50 % identity), CD5 SRCR1 (score: -48.8, 26 % identity), and CD6 SRCR2 (score: -8
45.6, 60 % identity). Using the Dali server [37], searches for the cation-binding SRCR8MgCa 9
structure identified the two top hits as M2bp (pdbid: 1by2) and CD6 SRCR3 (pdbid: 5a2e). 10
These were identified with respective Z-scores of 21.4 (r.m.s.d. of 1.1 Å with 106 out of 112 11
residues aligned) and 20.4 (r.m.s.d. of 1.5 Å with 109 out of 109 residues aligned). Despite 12
the classical division of SRCR-super family proteins into group A and B, based on the 13
conserved 3 versus 4 cysteine bridges, the SRCR fold is very highly conserved, and the 14
SALSA SRCR domain structures correlate closely to both group A and group B SRCR-super 15
family domains (Fig 4A). 16
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
12
1 Figure 4: Conserved ligand binding motif across SRCR domains. SRCR-domains from multiple proteins 2
engage in cation-dependent ligand interactions. (A) Structural overlay of domains from seven SRCR-super 3
family proteins, all with ligand binding mediated through the SRCR domain. This reveals a highly conserved 4
fold across both type A and type B SRCR domains. SRCR1 (pale green), SRCR8MgCa (light blue), MARCO 5
(pdbid: 2oy3, sand), CD163 (pdbid: 5jfb, purple), CD5 (pdbid: 2OTT, grey), CD6 (pdbid: 5a2e, pink), M2bp 6
(pdbid: 1by2, yellow), and murine neurotrypsin (pdbid: 6h8m, teal) [38]. SALSA Magnesium: dark green, 7
MARCO Magnesium: red, Calcium: dark purple. (B) Surface representation of CD6 SRCR3, MARCO and 8
SALSA SRCR8 in same orientation. Point mutations with a verified impact on ligand-binding are highlighted in 9
red, indicating a conserved surface involved in ligand binding. (C) Clustal Omega (EMBL-EBI) sequence 10
alignment of SRCR domains from ten SRCR-super family proteins. Conservation of the cation binding sites are 11
displayed in green (site 2) and purple (site 3). Dark colouring indicates 100 % identity with the SALSA sites, 12
and lighter colouring indicates conservation of residues commonly implicated in calcium-binding (D, E, Q or 13
N). Cysteines are highlighted in yellow, and overall sequence identity is denoted by: * (100 %), : (strongly 14
similar chemical properties) and . (weakly similar chemical properties). 15
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
13
For members of the SRCR-super family where the SRCR domain directly partake in ligand 1
binding, both microbial and endogenous protein ligands have been described. For MARCO, 2
crystallographic structures identified a cation-binding site exactly corresponding to site 2 in 3
the SALSA SRCR8 domain [26] and point mutations of this site abolished function. 4
Common to the MARCO and SALSA SRCR domains is the cluster of negatively charged 5
residues coordinating the functionally important cations. A similar cluster is also observed in 6
the SRCR3 domain of CD6, and have been show by mutagenesis to be directly involved in 7
binding to the human surface receptor CD166. Indeed, a point mutation of D291A 8
(corresponding to D1019 of SALSA-SRCR8) reduced the ligand-binding potential of CD6 to 9
less than 10 % [27]. Furthermore, mutational studies of SRCR domains 2 and 3 from CD163 10
proved an involvement of this specific site in binding of the haemoglobin-haptoglobin 11
complex [39]. All structural evidence from mutational studies of SRCR domains thus indicate 12
a conserved surface mediating ligand binding (Fig 4B). 13
14
Interestingly, various levels of calcium-dependency on ligand interactions have been 15
described for all SRCR domains directly involved with binding. SR-A1, Spα, MARCO, CD5 16
and CD6 all rely on calcium for interactions with microbial ligands [24-26, 40-42]. 17
Furthermore, the binding of CD163 to the haemoglobin-haptoglobin complex is calcium-18
dependent, while CD6 also recognize endogenous surface structures (other than CD166) in a 19
calcium-dependent manner [43]. This suggests that the binding mechanism identified for 20
SALSA is a general conserved feature of all SRCR domains. Indeed, sequence alignment of 21
SRCR domains from ten different SRCR-super family proteins, all with SRCR domains 22
directly involved with ligand binding, reveal a very high level of conservation of the two 23
cation binding sites identified in the SALSA domains (Fig 4C). A search with the SRCR8 24
model on the Consurf server [44], show that D1019, D1020, D1058 and E1086 all score 7 25
(out of 9, highly conserved), while N1081 and D1059 score 6 and 4, respectively. Whenever 26
sequence identity is not conserved, substitutions are observed with other residues 27
overrepresented in calcium-binding sites (D, E, Q and N) [31, 32]. The cation binding sites 28
identified in the SALSA SRCR-domains, thus appear to be a highly conserved feature of the 29
general SRCR-fold. Structural inspection of the overlay of SALSA SRCR8 with the 30
corresponding region in the other known SRCR-domain structures, clearly show the potential 31
for cation binding at these sites. We thus propose that the cation-binding sites identified here 32
are an essential feature of the ancient SRCR-fold, and is a conserved mechanism responsible 33
for mediating ligand-binding in the class of SRCR-super family proteins. 34
35
36
37
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
14
Discussion 1
While SALSA has previously been described to interact with a wide range of biological 2
ligands, little has been known of the binding mechanisms. Furthermore, it has not been 3
known if SALSA interacts with the various ligands in a similar way, or if distinct binding 4
sites are utilized. Here we demonstrate that mutations of a dual cation-binding site interrupts 5
interactions with a representative selection of very different types of ligands. Specific 6
disruption of site 2 was sufficient to abolish ligand binding. We modelled the cation at site 2 7
in our crystal as Mg2+, based on an analysis of bond length, coordination number and 8
behaviour of crystallographic refinements with different cations modelled. However, 9
experimental data demonstrated that binding to the ligands tested was only dependent on the 10
presence of Ca2+, and not Mg2+. This is in line with previous descriptions of most SALSA-11
ligand interactions [5, 6, 45]. The generation of the SRCR8-crystal form with multiple 12
cations present, was achieved by soaking the crystals in 10 mM Mg2+ and 10 mM Ca2+. In the 13
extracellular compartment, however, the molar concentration of Ca2+ is higher than Mg2+, 14
and it is therefore likely that both sites 2 and 3 will be occupied by Ca2+ in a physiological 15
setting. The identification of this dual Ca2+-binding site thus provides an explanation for the 16
Ca2+-dependency of all SALSA-ligand interactions described in the literature, suggesting this 17
mechanism of binding is applicable to all SALSA SRCR ligands. Multiple studies have 18
proposed a role for the motif GRVEVxxxxxW in ligand binding [46-48]. The crystal data 19
show that this peptide sequence is buried in the SRCR fold, and we found no validation for a 20
role in ligand binding although mutations within this sequence are likely to perturb the 21
overall fold. This motif thus does not appear to have any physiological relevance as defining 22
a ligand-binding site. 23
24
The conserved usage of a single ligand-binding area for multiple interactions, suggests that 25
each SALSA SRCR domain engages in one ligand interaction. A common feature of the 26
ligands described here, as well as a number of other ligands such as DNA and LPS etc., is the 27
presence of repetitive negatively charged motifs [34].We analysed ligand binding by 28
individual SRCR domains in surface-plasmon resonance and isothermal calorimetry assays, 29
but interactions were observed to be very low affinity, making reliable measurements 30
unfeasible. This is not surprising for a molecule such as SALSA, where the molecular make-31
up with the full extension of 13 repeated units, interspersed by predicted non-structured 32
flexible SIDs, provide a molecule that can generate high avidity interactions with repetitive 33
ligands, despite having only low affinity interactions for an individual domain. Furthermore, 34
it has been suggested that SALSA in body secretions may oligomerize into larger complexes 35
[5, 49-51], probably via the C-terminal CUB and ZP domains. The repetitive nature, and 36
possible oligomerization, allow SALSA to not only engage with a repetitive ligand on one 37
surface (e.g. LPS or Spy0843 on microbes), but also engage in multiple ligand interactions 38
simultaneously. This would be relevant for its interactions with other endogenous defence 39
molecules, such as IgA, surfactant proteins and complement components, where a 40
cooperative effect on microbial clearance has been demonstrated [12, 16, 17, 52]. In addition, 41
this model of multiple ligand-binding would be relevant for microbes described to utilize 42
SALSA for colonization of the teeth or the host epithelium [10, 53, 54] (Fig 5). 43
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
15
1 2
3
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
16
Figure 5: The SALSA SRCR cation binding motif reveal a conserved mechanism for broad-spectrum 1
ligand interactions of SRCR-super family molecules. Based on mutational studies and structural information 2
across SRCR proteins, we propose a generalised mechanism of ligand interaction mediated by the cation-3
binding surface motif of the evolutionarily ancient SRCR fold. (Left side) SALSA has been described to bind a 4
broad range of ligands, incorporating into a complex network of binding-partners on the body surfaces as well 5
as the colonizing microbiota. The multiple SRCR domains of full-length SALSA bind repetitive targets (both 6
protein- and carbohydrate-structures) on the surface of microbes. The secreted fluid-phase molecule may thus 7
lead to microbial agglutination and clearance. However, the repetitive form of binding-sites will allow for 8
simultaneous binding to endogenous targets as well. This being e.g. 1) Binding of IgA, collectins and 9
complement components, to induce a cooperative anti-microbial effect. 2) Binding of hydroxyapatite on the 10
tooth surface. 3) ECM proteins and glycosaminoglycans, as well as mucus components of the epithelial surface 11
(such as Heparin, galectin 3 and mucins). (Right side) The cation-binding motif described in SALSA is 12
conserved in most other SRCR proteins. For CD6, CD163 and MARCO mutational studies support a crucial 13
role for this area in ligand interactions. CD163 binds the haemoglobin-haptoglobin complex as well as microbial 14
surfaces. CD6 binds endogenous ligands, but also engage in microbial binding. MARCO forms multimers, and 15
binds microbial surface structures. Other SRCR proteins with similar functions and conserved cation-sites 16
include SR-A1, Sp-alpha, SSc5D and M2bp. The functional role of the Neurotrypsin SRCR domains is not 17
known. The remarkable repetitive formation of multiple SRCR domains in many SRCR super family proteins, 18
with several domains containing a binding-site with a broad specificity, would supposedly allow for interactions 19
with multiple ligands simultaneously. The SRCR-fold thus appear to be an important functional component of 20
scavenging molecules engaging in complex network of interactions. The multiple SRCR domains shown for 21
SALSA, CD163 and Neurotrypsin are represented as copies of protein-specific SRCR domains with known 22
structure. Conserved cation-coordinating residues are highlighted in red. SRCR8MgCa (light blue), MARCO 23
(pdbid: 2oy3, sand), CD163 (pdbid: 5jfb, purple), CD6 (pdbid: 5a2e, grey), M2bp (pdbid: 1by2, yellow), and 24
murine neurotrypsin (pdbid: 6h8m, teal). 25
26
SALSA belongs to the SRCR-super family, a family of proteins characterised by the presence 27
of one or more copies of the ancient and evolutionarily highly conserved SRCR-fold [55]. 28
Though a couple of SRCR-domains, such as the ones found in complement Factor I and 29
Hepsin, have not been described to bind ligands directly, most others have [56, 57]. SRCR-30
super family members, such as SALSA, SR-A1, Spα, SSc5D, MARCO, CD6 and CD163 31
have broad scavenger-receptor functions, recognizing a broad range of microbial surface 32
structures and mediate clearance [24-26, 40]. While this potentially is relevant for all SRCR-33
super family proteins, some members of the family have distinct protein ligands, such as 34
CD6, CD163, and M2bp [22, 23, 27, 58]. With the exception of the CD6-CD166 interactions, 35
most described SRCR-ligand interactions are calcium-dependent, irrespective of the ligand 36
[24-26, 40-43]. A cation-binding site is conserved across SRCR domains, and multiple 37
studies support a role for this site in ligand binding. Even the specialised CD6-CD166 38
interaction utilise the same surface for binding, despite ‘having lost’ the calcium dependency 39
[27]. 40
41
Our studies have thus identified a dual cation binding site as essential for SALSA-ligand 42
interactions. Analysis of SRCR-folds from various ligand-binding domains reveal a very high 43
level of conservation of the residues at this dual site. The conservation of this site, along with 44
the well-described cation dependency on most SRCR-ligand interactions suggest that the 45
binding mechanism described for the SALSA SRCR domains is applicable to all SRCR 46
domains. We thus propose to have identified in SALSA a conserved functional mechanism 47
for the SRCR class of proteins. This notion is further supported by the specific lack of 48
conservation of these residues observed in the SRCR domains of complement Factor I and 49
Hepsin, where no ligand binding has been shown. The SRCR domains in these two molecules 50
may thus represent an evolutionary diversion from the common broad ligand binding 51
potential of the SRCR fold. The novel understanding of the SRCR domain generated here, 52
will allow for an interesting future targeting of other SRCR-super family proteins, with the 53
potential of modifying function. 54
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
17
Acknowledgements 1
We acknowledge Diamond Light Source and the staff of beamlines I03 and I04 for access 2
under proposal MX18069. The Central Oxford Structural Microscopy and Imaging Centre is 3
supported by the Wellcome Trust (201536). M.P.R was financially supported by grants from 4
the Wihuri foundation and the Finnish Cultural foundation. Staff and experimental costs in 5
S.M.L. lab were supported by a Wellcome Investigator Award (100298) and an MRC 6
programme grant (M011984). 7
8
Author contributions 9
M.P.R.: Designed and performed experiments. Protein purification, characterisation of 10
protein complexes, X-ray crystallography, structure determination and analysis. Wrote paper 11
with S.J. and S.M.L. 12
S.J.: Designed, supervised and performed experiments. Characterisation of protein 13
complexes. Wrote manuscript with M.P.R. and S.M.L. 14
V.L.: Performed experiments. Strain and plasmid construction, protein purification. 15
S.M.L.: Designed and supervised experiments. Wrote paper with M.P.R. and S.J. 16
17
18
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
18
References 1
2
1. Holmskov U, Lawson P, Teisner B, Tornoe I, Willis AC, Morgan C, Koch C, Reid KB 3
(1997) Isolation and characterization of a new member of the scavenger receptor 4
superfamily, glycoprotein-340 (gp-340), as a lung surfactant protein-D binding molecule. 5
J Biol Chem 272: 13743-13749 6
2. Thornton DJ, Davies JR, Kirkham S, Gautrey A, Khan N, Richardson PS, Sheehan JK 7
(2001) Identification of a nonmucin glycoprotein (gp-340) from a purified respiratory 8
mucin preparation: evidence for an association involving the MUC5B mucin. 9
Glycobiology 11: 969-977 10
3. Schulz BL, Oxley D, Packer NH, Karlsson NG (2002) Identification of two highly 11
sialylated human tear-fluid DMBT1 isoforms: the major high-molecular-mass 12
glycoproteins in human tears. Biochem J 366: 511-520 13
4. Reichhardt MP, Jarva H, de Been M, Rodriguez JM, Jimenez Quintana E, Loimaranta V, 14
Meindert de Vos W, Meri S (2014) The Salivary Scavenger and Agglutinin in Early Life: 15
Diverse Roles in Amniotic Fluid and in the Infant Intestine. J Immunol 16
5. Madsen J, Mollenhauer J, Holmskov U (2010) Review: Gp-340/DMBT1 in mucosal 17
innate immunity. Innate Immun 16: 160-167 18
6. Reichhardt MP, Meri S (2016) SALSA: A Regulator of the Early Steps of Complement 19
Activation on Mucosal Surfaces. Front Immunol 7: 85 20
7. Reichhardt MP, Holmskov U, Meri S (2017) SALSA-A dance on a slippery floor with 21
changing partners. Mol Immunol 89: 100-110 22
8. Prakobphol A, Xu F, Hoang VM, Larsson T, Bergstrom J, Johansson I, Frangsmyr L, 23
Holmskov U, Leffler H, Nilsson C et al. (2000) Salivary agglutinin, which binds 24
Streptococcus mutans and Helicobacter pylori, is the lung scavenger receptor cysteine-25
rich protein gp-340. J Biol Chem 275: 39860-39866 26
9. Hartshorn KL, White MR, Mogues T, Ligtenberg T, Crouch E, Holmskov U (2003) 27
Lung and salivary scavenger receptor glycoprotein-340 contribute to the host defense 28
against influenza A viruses. Am J Physiol Lung Cell Mol Physiol 285: L1066-76 29
10. Loimaranta V, Jakubovics NS, Hytonen J, Finne J, Jenkinson HF, Stromberg N (2005) 30
Fluid- or surface-phase human salivary scavenger protein gp340 exposes different 31
bacterial recognition properties. Infect Immun 73: 2245-2252 32
11. Rosenstiel P, Sina C, End C, Renner M, Lyer S, Till A, Hellmig S, Nikolaus S, Folsch 33
UR, Helmke B et al. (2007) Regulation of DMBT1 via NOD2 and TLR4 in intestinal 34
epithelial cells modulates bacterial recognition and invasion. J Immunol 178: 8203-8211 35
12. Rundegren J, Arnold RR (1987) Differentiation and interaction of secretory 36
immunoglobulin A and a calcium-dependent parotid agglutinin for several bacterial 37
strains. Infect Immun 55: 288-292 38
13. Boackle RJ, Connor MH, Vesely J (1993) High molecular weight non-immunoglobulin 39
salivary agglutinins (NIA) bind C1Q globular heads and have the potential to activate the 40
first complement component. Mol Immunol 30: 309-319 41
14. Tino MJ, Wright JR (1999) Glycoprotein-340 binds surfactant protein-A (SP-A) and 42
stimulates alveolar macrophage migration in an SP-A-independent manner. Am J Respir 43
Cell Mol Biol 20: 759-768 44
15. Oho T, Bikker FJ, Nieuw Amerongen AV, Groenink J (2004) A peptide domain of 45
bovine milk lactoferrin inhibits the interaction between streptococcal surface protein 46
antigen and a salivary agglutinin peptide domain. Infect Immun 72: 6181-6184 47
16. Leito JT, Ligtenberg AJ, van Houdt M, van den Berg TK, Wouters D (2011) The 48
bacteria binding glycoprotein salivary agglutinin (SAG/gp340) activates complement via 49
the lectin pathway. Mol Immunol 49: 185-190 50
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
19
17. Reichhardt MP, Loimaranta V, Thiel S, Finne J, Meri S, Jarva H (2012) The salivary 1
scavenger and agglutinin binds MBL and regulates the lectin pathway of complement in 2
solution and on surfaces. Front Immunol 3: 205 3
18. Madsen J, Sorensen GL, Nielsen O, Tornoe I, Thim L, Fenger C, Mollenhauer J, 4
Holmskov U (2013) A variant form of the human deleted in malignant brain tumor 1 5
(DMBT1) gene shows increased expression in inflammatory bowel diseases and interacts 6
with dimeric trefoil factor 3 (TFF3). PLoS One 8: e64441 7
19. Holmskov U, Mollenhauer J, Madsen J, Vitved L, Gronlund J, Tornoe I, Kliem A, Reid 8
KB, Poustka A, Skjodt K (1999) Cloning of gp-340, a putative opsonin receptor for lung 9
surfactant protein D. Proc Natl Acad Sci U S A 96: 10794-10799 10
20. Mollenhauer J, Holmskov U, Wiemann S, Krebs I, Herbertz S, Madsen J, Kioschis P, 11
Coy JF, Poustka A (1999) The genomic structure of the DMBT1 gene: evidence for a 12
region with susceptibility to genomic instability. Oncogene 18: 6233-6240 13
21. Mollenhauer J, Wiemann S, Scheurlen W, Korn B, Hayashi Y, Wilgenbus KK, von 14
Deimling A, Poustka A (1997) DMBT1, a new member of the SRCR superfamily, on 15
chromosome 10q25.3-26.1 is deleted in malignant brain tumours. Nat Genet 17: 32-39 16
22. Inohara H, Akahani S, Koths K, Raz A (1996) Interactions between galectin-3 and Mac-17
2-binding protein mediate cell-cell adhesion. Cancer Res 56: 4530-4534 18
23. Kristiansen M, Graversen JH, Jacobsen C, Sonne O, Hoffman HJ, Law SK, Moestrup SK 19
(2001) Identification of the haemoglobin scavenger receptor. Nature 409: 198-201 20
24. Santiago-Garcia J, Kodama T, Pitas RE (2003) The class A scavenger receptor binds to 21
proteoglycans and mediates adhesion of macrophages to the extracellular matrix. J Biol 22
Chem 278: 6942-6946 23
25. Sarrias MR, Rosello S, Sanchez-Barbero F, Sierra JM, Vila J, Yelamos J, Vives J, Casals 24
C, Lozano F (2005) A role for human Sp alpha as a pattern recognition receptor. J Biol 25
Chem 280: 35391-35398 26
26. Ojala JR, Pikkarainen T, Tuuttila A, Sandalova T, Tryggvason K (2007) Crystal 27
structure of the cysteine-rich domain of scavenger receptor MARCO reveals the presence 28
of a basic and an acidic cluster that both contribute to ligand recognition. J Biol Chem 29
282: 16654-16666 30
27. Chappell PE, Garner LI, Yan J, Metcalfe C, Hatherley D, Johnson S, Robinson CV, Lea 31
SM, Brown MH (2015) Structures of CD6 and Its Ligand CD166 Give Insight into Their 32
Interaction. Structure 23: 1426-1436 33
28. Winn MD, Ballard CC, Cowtan KD, Dodson EJ, Emsley P, Evans PR, Keegan RM, 34
Krissinel EB, Leslie AG, McCoy A et al. (2011) Overview of the CCP4 suite and current 35
developments. Acta Crystallogr D Biol Crystallogr 67: 235-242 36
29. Emsley P, Lohkamp B, Scott WG, Cowtan K (2010) Features and development of Coot. 37
Acta Crystallogr D Biol Crystallogr 66: 486-501 38
30. Adams PD, Baker D, Brunger AT, Das R, DiMaio F, Read RJ, Richardson DC, 39
Richardson JS, Terwilliger TC (2013) Advances, interactions, and future developments 40
in the CNS, Phenix, and Rosetta structural biology software systems. Annu Rev Biophys 41
42: 265-287 42
31. Pidcock E, Moore GR (2001) Structural characteristics of protein binding sites for 43
calcium and lanthanide ions. J Biol Inorg Chem 6: 479-489 44
32. Dokmanic I, Sikic M, Tomic S (2008) Metals in proteins: correlation between the metal-45
ion type, coordination number and the amino-acid residues involved in the coordination. 46
Acta Crystallogr D Biol Crystallogr 64: 257-263 47
33. Haukioja A, Loimaranta V, Tenovuo J (2008) Probiotic bacteria affect the composition 48
of salivary pellicle and streptococcal adhesion in vitro. Oral Microbiol Immunol 23: 336-49
343 50
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
20
34. End C, Bikker F, Renner M, Bergmann G, Lyer S, Blaich S, Hudler M, Helmke B, 1
Gassler N, Autschbach F et al. (2009) DMBT1 functions as pattern-recognition molecule 2
for poly-sulfated and poly-phosphorylated ligands. Eur J Immunol 39: 833-842 3
35. Loimaranta V, Hytonen J, Pulliainen AT, Sharma A, Tenovuo J, Stromberg N, Finne J 4
(2009) Leucine-rich repeats of bacterial surface proteins serve as common pattern 5
recognition motifs of human scavenger receptor gp340. J Biol Chem 284: 18614-18623 6
36. Zhou YF, Eng ET, Zhu J, Lu C, Walz T, Springer TA (2012) Sequence and structure 7
relationships within von Willebrand factor. Blood 120: 449-458 8
37. Holm L, Laakso LM (2016) Dali server update. Nucleic Acids Res 44: W351-5 9
38. Canciani A, Catucci G, Forneris F (2019) Structural characterization of the third 10
scavenger receptor cysteine-rich domain of murine neurotrypsin. Protein Sci 28: 746-755 11
39. Nielsen MJ, Andersen CB, Moestrup SK (2013) CD163 binding to haptoglobin-12
hemoglobin complexes involves a dual-point electrostatic receptor-ligand pairing. J Biol 13
Chem 288: 18834-18841 14
40. Sarrias MR, Farnos M, Mota R, Sanchez-Barbero F, Ibanez A, Gimferrer I, Vera J, 15
Fenutria R, Casals C, Yelamos J et al. (2007) CD6 binds to pathogen-associated 16
molecular patterns and protects from LPS-induced septic shock. Proc Natl Acad Sci U S 17
A 104: 11724-11729 18
41. Vera J, Fenutria R, Canadas O, Figueras M, Mota R, Sarrias MR, Williams DL, Casals 19
C, Yelamos J, Lozano F (2009) The CD5 ectodomain interacts with conserved fungal 20
cell wall components and protects from zymosan-induced septic shock-like syndrome. 21
Proc Natl Acad Sci U S A 106: 1506-1511 22
42. Nonaka M, Ma BY, Imaeda H, Kawabe K, Kawasaki N, Hodohara K, Kawasaki N, 23
Andoh A, Fujiyama Y, Kawasaki T (2011) Dendritic cell-specific intercellular adhesion 24
molecule 3-grabbing non-integrin (DC-SIGN) recognizes a novel ligand, Mac-2-binding 25
protein, characteristically expressed on human colorectal carcinomas. J Biol Chem 286: 26
22403-22413 27
43. Patel DD, Wee SF, Whichard LP, Bowen MA, Pesando JM, Aruffo A, Haynes BF 28
(1995) Identification and characterization of a 100-kD ligand for CD6 on human thymic 29
epithelial cells. J Exp Med 181: 1563-1568 30
44. Ashkenazy H, Abadi S, Martz E, Chay O, Mayrose I, Pupko T, Ben-Tal N (2016) 31
ConSurf 2016: an improved methodology to estimate and visualize evolutionary 32
conservation in macromolecules. Nucleic Acids Res 44: W344-50 33
45. Ligtenberg AJ, Karlsson NG, Veerman EC (2010) Deleted in malignant brain tumors-1 34
protein (DMBT1): a pattern recognition receptor with multiple binding sites. Int J Mol 35
Sci 11: 5212-5233 36
46. Bikker FJ, Ligtenberg AJ, Nazmi K, Veerman EC, van't Hof W, Bolscher JG, Poustka A, 37
Nieuw Amerongen AV, Mollenhauer J (2002) Identification of the bacteria-binding 38
peptide domain on salivary agglutinin (gp-340/DMBT1), a member of the scavenger 39
receptor cysteine-rich superfamily. J Biol Chem 277: 32109-32115 40
47. Bikker FJ, Ligtenberg AJ, End C, Renner M, Blaich S, Lyer S, Wittig R, van't Hof W, 41
Veerman EC, Nazmi K et al. (2004) Bacteria binding by DMBT1/SAG/gp-340 is 42
confined to the VEVLXXXXW motif in its scavenger receptor cysteine-rich domains. J 43
Biol Chem 279: 47699-47703 44
48. Leito JT, Ligtenberg AJ, Nazmi K, de Blieck-Hogervorst JM, Veerman EC, Nieuw 45
Amerongen AV (2008) A common binding motif for various bacteria of the bacteria-46
binding peptide SRCRP2 of DMBT1/gp-340/salivary agglutinin. Biol Chem 389: 1193-47
1200 48
49. Young A, Rykke M, Smistad G, Rolla G (1997) On the role of human salivary micelle-49
like globules in bacterial agglutination. Eur J Oral Sci 105: 485-494 50
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
21
50. Oho T, Yu H, Yamashita Y, Koga T (1998) Binding of salivary glycoprotein-secretory 1
immunoglobulin A complex to the surface protein antigen of Streptococcus mutans. 2
Infect Immun 66: 115-121 3
51. Crouch EC (2000) Surfactant protein-D and pulmonary host defense. Respir Res 1: 93-4
108 5
52. White MR, Crouch E, van Eijk M, Hartshorn M, Pemberton L, Tornoe I, Holmskov U, 6
Hartshorn KL (2005) Cooperative anti-influenza activities of respiratory innate immune 7
proteins and neuraminidase inhibitor. Am J Physiol Lung Cell Mol Physiol 288: L831-40 8
53. Jonasson A, Eriksson C, Jenkinson HF, Kallestal C, Johansson I, Stromberg N (2007) 9
Innate immunity glycoprotein gp-340 variants may modulate human susceptibility to 10
dental caries. BMC Infect Dis 7: 57 11
54. Stoddard E, Cannon G, Ni H, Kariko K, Capodici J, Malamud D, Weissman D (2007) 12
gp340 expressed on human genital epithelia binds HIV-1 envelope protein and facilitates 13
viral transmission. J Immunol 179: 3126-3132 14
55. Sarrias MR, Gronlund J, Padilla O, Madsen J, Holmskov U, Lozano F (2004) The 15
Scavenger Receptor Cysteine-Rich (SRCR) domain: an ancient and highly conserved 16
protein module of the innate immune system. Crit Rev Immunol 24: 1-37 17
56. Roversi P, Johnson S, Caesar JJ, McLean F, Leath KJ, Tsiftsoglou SA, Morgan BP, 18
Harris CL, Sim RB, Lea SM (2011) Structural basis for complement factor I control and 19
its disease-associated sequence polymorphisms. Proc Natl Acad Sci U S A 108: 12839-20
12844 21
57. Koschubs T, Dengl S, Durr H, Kaluza K, Georges G, Hartl C, Jennewein S, 22
Lanzendorfer M, Auer J, Stern A et al. (2012) Allosteric antibody inhibition of human 23
hepsin protease. Biochem J 442: 483-494 24
58. Burgueno-Bucio E, Mier-Aguilar CA, Soldevila G (2019) The multiple faces of CD5. J 25
Leukoc Biol 105: 891-904 26
27
.CC-BY 4.0 International licenseIt is made available under a
(which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint. http://dx.doi.org/10.1101/709915doi: bioRxiv preprint first posted online Jul. 21, 2019;
... Realistically, these are likely to mediate protein-target interactions, and/or contribute to Ca 2+ binding (Martinez et al., 2011;Canciani et al., 2019). Interestingly, in other proteins of the SRCR domain-bearing superfamily several domains of this type have been attributed with a Ca2 + dependant ligand recognition/binding function (Sarrias et al., 2005(Sarrias et al., , 2007Ojala et al., 2007;Reichhardt et al., 2019). While the limited molecular data on NT makes it currently difficult to determine whether this role is shared by NT's SRCR domains, the possibility is considerably intriguing. ...
Article
Full-text available
Synapse formation is a very elaborate process dependent upon accurate coordination of pre and post-synaptic specialization, requiring multiple steps and a variety of receptors and signaling molecules. Due to its relative structural simplicity and the ease in manipulation and observation, the neuromuscular synapse or neuromuscular junction (NMJ)—the connection between motor neurons and skeletal muscle—represents the archetype junction system for studying synapse formation and conservation. This junction is essential for survival, as it controls our ability to move and breath. NMJ formation requires coordinated interactions between motor neurons and muscle fibers, which ultimately result in the formation of a highly specialized post-synaptic architecture and a highly differentiated nerve terminal. Furthermore, to ensure a fast and reliable synaptic transmission following neurotransmitter release, ligand-gated channels (acetylcholine receptors, AChRs) are clustered on the post-synaptic muscle cell at high concentrations in sites opposite the presynaptic active zone, supporting a direct role for nerves in the organization of the post-synaptic membrane architecture. This organized clustering process, essential for NMJ formation and for life, relies on key signaling molecules and receptors and is regulated by soluble extracellular molecules localized within the synaptic cleft. Notably, several mutations as well as auto-antibodies against components of these signaling complexes have been related to neuromuscular disorders. The recent years have witnessed strong progress in the understanding of molecular identities, architectures, and functions of NMJ macromolecules. Among these, prominent roles have been proposed for neural variants of the proteoglycan agrin, its receptor at NMJs composed of the lipoprotein receptor-related protein 4 (LRP4) and the muscle-specific kinase (MuSK), as well as the regulatory soluble synapse-specific protease Neurotrypsin. In this review we summarize the current state of the art regarding molecular structures and (agrin-dependent) canonical, as well as (agrin-independent) non-canonical, MuSK signaling mechanisms that underscore the formation of neuromuscular junctions, with the aim of providing a broad perspective to further stimulate molecular, cellular and tissue biology investigations on this fundamental intercellular contact.
Article
Full-text available
Since its discovery, over 30 years ago, CD5 has been used as a marker to identify T cells, B1‐a cells, and B cell chronic lymphocytic leukemia cells. Throughout the years, many studies have described the functional relevance of CD5 as a modulator of T and B cell receptor signaling. However, it has not been until recent years that CD5 has emerged as a functional receptor in other areas of the immune system. Here, we review some of the most important aspects of CD5 as a modulator of TCR and BCR signaling, cell survival receptor both in T and B cells during health and disease, as well as the newly discovered roles of this receptor in thymocyte selection, T cell effector differentiation, and immune tolerance. CD5 was found to promote T cell survival by protecting autoreactive T cell from activation‐induced cell death, to promote de novo induction of regulatory T cells in the periphery, to modulate Th17 and Th2 differentiation, and to modulate immune responses by modulating dendritic cell functions. CD5 is overexpressed in Tregs and Bregs, which are fundamental to maintain immune homeostasis. The newly established roles of CD5 in modulating different aspects of immune responses identify this receptor as an immune checkpoint modulator, and therefore it could be used as a target for immune intervention in different pathologies such as cancer, autoimmune diseases or infections. Review on how CD5 regulates the balance between immunity and tolerance by modulating T/B cell signaling, T/B cell survival and T effector/Treg differentiation
Article
Full-text available
The degree of evolutionary conservation of an amino acid in a protein or a nucleic acid in DNA/RNA reflects a balance between its natural tendency to mutate and the overall need to retain the structural integrity and function of the macromolecule. The ConSurf web server (http://consurf.tau.ac.il), established over 15 years ago, analyses the evolutionary pattern of the amino/nucleic acids of the macromolecule to reveal regions that are important for structure and/or function. Starting from a query sequence or structure, the server automatically collects homologues, infers their multiple sequence alignment and reconstructs a phylogenetic tree that reflects their evolutionary relations. These data are then used, within a probabilistic framework, to estimate the evolutionary rates of each sequence position. Here we introduce several new features into ConSurf, including automatic selection of the best evolutionary model used to infer the rates, the ability to homology-model query proteins, prediction of the secondary structure of query RNA molecules from sequence, the ability to view the biological assembly of a query (in addition to the single chain), mapping of the conservation grades onto 2D RNA models and an advanced view of the phylogenetic tree that enables interactively rerunning ConSurf with the taxa of a sub-tree.
Article
Full-text available
The Dali server (http://ekhidna2.biocenter.helsinki.fi/dali) is a network service for comparing protein structures in 3D. In favourable cases, comparing 3D structures may reveal biologically interesting similarities that are not detectable by comparing sequences. The Dali server has been running in various places for over 20 years and is used routinely by crystallographers on newly solved structures. The latest update of the server provides enhanced analytics for the study of sequence and structure conservation. The server performs three types of structure comparisons: (i) Protein Data Bank (PDB) search compares one query structure against those in the PDB and returns a list of similar structures; (ii) pairwise comparison compares one query structure against a list of structures specified by the user; and (iii) all against all structure comparison returns a structural similarity matrix, a dendrogram and a multidimensional scaling projection of a set of structures specified by the user. Structural superimpositions are visualized using the Java-free WebGL viewer PV. The structural alignment view is enhanced by sequence similarity searches against Uniprot. The combined structure-sequence alignment information is compressed to a stack of aligned sequence logos. In the stack, each structure is structurally aligned to the query protein and represented by a sequence logo.
Article
Full-text available
Formation of the haptoglobin (Hp)-hemoglobin (Hb) complex in human plasma leads to a high affinity recognition by the endocytic macrophage receptor CD163. A fast segregation of Hp-Hb from CD163 occurs at endosomal conditions (pH <6.5). The ligand binding site of CD163 has previously been shown to involve the scavenger receptor cysteine-rich (SRCR) domain 3. This domain and the adjacent SRCR domain 2 of CD163 contain a consensus motif for a calcium-coordinated acidic amino acid triad cluster as originally identified in the SRCR domain of the scavenger receptor MARCO. Here we show that site-directed mutagenesis in each of these acidic triads of SRCR domains 2 and 3 abrogates the high affinity binding of recombinant CD163 to Hp-Hb. In the ligand, Hp Arg-252 and Lys-262, both present in a previously identified CD163 binding loop of Hp, were revealed as essential residues for the high affinity receptor binding. These findings are in accordance with pairing of the calcium-coordinated acidic clusters in SRCR domains 2 and 3 with the two basic Arg/Lys residues in the Hp loop. Such a two-point electrostatic pairing is mechanistically similar to the pH-sensitive pairings disclosed in crystal structures of ligands in complex with tandem LDL receptor repeats or tandem CUB domains in other endocytic receptors. Background: CD163 mediates endocytosis of haptoglobin-hemoglobin complexes formed upon intravascular hemolysis. Results: Acidic CD163 residues and basic haptoglobin residues mediate receptor-ligand binding in humans. Conclusion: A two-point electrostatic pairing between Ca²⁺-coordinated acidic clusters in receptors and basic ligand residues explains high-affinity CD163-(haptoglobin-hemoglobin) binding. Significance: The data add further evidence that Ca²⁺-dependent two-point electrostatic pairing is a common mechanism for coupling ligands to endocytic receptors.
Article
Full-text available
Hepsin is a type II transmembrane serine protease that is expressed in several human tissues. Overexpression of hepsin has been found to correlate with tumour progression and metastasis, which is so far best studied for prostate cancer, where more than 90% of such tumours show this characteristic. To enable improved future patient treatment, we have developed a monoclonal humanized antibody that selectively inhibits human hepsin and does not inhibit other related proteases. We found that our antibody, hH35, potently inhibits hepsin enzymatic activity at nanomolar concentrations. Kinetic characterization revealed non-linear, slow, tight-binding inhibition. This correlates with the crystal structure we obtained for the human hepsin-hH35 antibody Fab fragment complex, which showed that the antibody binds hepsin around α3-helix, located far from the active centre. The unique allosteric mode of inhibition of hH35 is distinct from the recently described HGFA (hepatocyte growth factor activator) allosteric antibody inhibition. We further explain how a small change in the antibody design induces dramatic structural rearrangements in the hepsin antigen upon binding, leading to complete enzyme inactivation.
Article
Full-text available
The complement system is a key component of innate and adaptive immune responses. Complement regulation is critical for prevention and control of disease. We have determined the crystal structure of the complement regulatory enzyme human factor I (fI). FI is in a proteolytically inactive form, demonstrating that it circulates in a zymogen-like state despite being fully processed to the mature sequence. Mapping of functional data from mutants of fI onto the structure suggests that this inactive form is maintained by the noncatalytic heavy-chain allosterically modulating activity of the light chain. Once the ternary complex of fI, a cofactor and a substrate is formed, the allosteric inhibition is released, and fI is oriented for cleavage. In addition to explaining how circulating fI is limited to cleaving only C3b/C4b, our model explains the molecular basis of disease-associated polymorphisms in fI and its cofactors.
Article
Full-text available
Deleted in Malignant Brain Tumors-1 protein (DMBT1), salivary agglutinin (DMBT1(SAG)), and lung glycoprotein-340 (DMBT1(GP340)) are three names for glycoproteins encoded by the same DMBT1 gene. All these proteins belong to the scavenger receptor cysteine-rich (SRCR) superfamily of proteins: a superfamily of secreted or membrane-bound proteins with SRCR domains that are highly conserved down to sponges, the most ancient metazoa. In addition to SRCR domains, all DMBT1s contain two CUB domains and one zona pellucida domain. The SRCR domains play a role in the function of DMBT1s, which is the binding of a broad range of pathogens including cariogenic streptococci, Helicobacter pylori and HIV. Mucosal defense proteins like IgA, surfactant proteins and lactoferrin also bind to DMBT1s through their SRCR domains. The binding motif on the SRCR domains comprises an 11-mer peptide in which a few amino acids are essential for binding (GRVEVLYRGSW). Adjacent to each individual SRCR domain are glycosylation domains, where the attached carbohydrate chains play a role in the binding of influenza A virus and Helicobacter pylori. The composition of the carbohydrate chains is not only donor specific, but also varies between different organs. These data demonstrate a role for DMBT1s as pattern recognition molecules containing various peptide and carbohydrate binding motifs.
Article
Full-text available
Dendritic cell (DC)-specific intercellular adhesion molecule-3-grabbing non-integrin (DC-SIGN) is a type II transmembrane C-type lectin expressed on DCs such as myeloid DCs and monocyte-derived DCs (MoDCs). Recently, we have reported that DC-SIGN interacts with carcinoembryonic antigen (CEA) expressed on colorectal carcinoma cells. CEA is one of the most widely used tumor markers for gastrointestinal cancers such as colorectal cancer. On the other hand, other groups have reported that the level of Mac-2-binding protein (Mac-2BP) increases in patients with pancreatic, breast, and lung cancers, virus infections such as human immunodeficiency virus and hepatitis C virus, and autoimmune diseases. Here, we first identified Mac-2BP expressed on several colorectal carcinoma cell lines as a novel DC-SIGN ligand through affinity chromatography and mass spectrometry. Interestingly, we found that DC-SIGN selectively recognizes Mac-2BP derived from some colorectal carcinomas but not from the other ones. Furthermore, we found that the α1-3,4-fucose moieties of Le glycans expressed on DC-SIGN-binding Mac-2BP were important for recognition. DC-SIGN-dependent cellular interactions between immature MoDCs and colorectal carcinoma cells significantly inhibited MoDC functional maturation, suggesting that Mac-2BP may provide a tolerogenic microenvironment for colorectal carcinoma cells through DC-SIGN-dependent recognition. Importantly, Mac-2BP was detected as a predominant DC-SIGN ligand expressed on some primary colorectal cancer tissues from certain parts of patients in comparison with CEA from other parts, suggesting that DC-SIGN-binding Mac-2BP bearing tumor-associated Le glycans may become a novel potential colorectal cancer biomarker for some patients instead of CEA.
Article
Full-text available
Scavenger receptors are innate immune molecules recognizing and inducing the clearance of non-host as well as modified host molecules. To recognize a wide pattern of invading microbes, many scavenger receptors bind to common pathogen-associated molecular patterns, such as lipopolysaccharides and lipoteichoic acids. Similarly, the gp340/DMBT1 protein, a member of the human scavenger receptor cysteine-rich protein family, displays a wide ligand repertoire. The peptide motif VEVLXXXXW derived from its scavenger receptor cysteine-rich domains is involved in some of these interactions, but most of the recognition mechanisms are unknown. In this study, we used mass spectrometry sequencing, gene inactivation, and recombinant proteins to identify Streptococcus pyogenes protein Spy0843 as a recognition receptor of gp340. Antibodies against Spy0843 are shown to protect against S. pyogenes infection, but no function or host receptor have been identified for the protein. Spy0843 belongs to the leucine-rich repeat (Lrr) family of eukaryotic and prokaryotic proteins. Experiments with truncated forms of the recombinant proteins confirmed that the Lrr region is needed in the binding of Spy0843 to gp340. The same motif of two other Lrr proteins, LrrG from the Gram-positive S. agalactiae and BspA from the Gram-negative Tannerella forsythia, also mediated binding to gp340. Moreover, inhibition of Spy0843 binding occurred with peptides containing the VEVLXXXXW motif, but also peptides devoid of the XXXXW motif inhibited binding of Lrr proteins. These results thus suggest that the conserved Lrr motif in bacterial proteins serves as a novel pattern recognition motif for unique core peptides of human scavenger receptor gp340.
Article
The hypothesis to be tested in this in –vitro study was that the salivary micelle like globules (SMGs) have a rôle in the agglutination of some oral bacteria. An attempt to determine the mechanisms for the interactions involved was also carried out. 4 laboratory and 4 native streptococci strains were tested. Human whole (HWS) and parotid (HPS) saliva was collected from 4 subjects, and SMGs were isolated from both salivas, and agglutination was recorded in the various bacterial suspensions over time. HPS, HWS and SMGs isolated from HPS and HWS caused typical agglutination patterns for the mutans strains. Salivary supernatants (without SMGs) caused a much delayed or no agglutination. Electron microscopy showed SMG-like structures on the surface of the agglutinated bacteria. Addition of pyrophosphate to HPS prevented agglutination, whereas guanidine HC1 prevented normal agglutination of a sanguis strain, and urea had no obvious effect. Together, these results indicate that the SMGs are important in the agglutination of streptococci, and that both calcium-dependent, electrostatic and hydrophobia interactions may be involved.