Content uploaded by Wolfgang Hans
Author content
All content in this area was uploaded by Wolfgang Hans
Content may be subject to copyright.
Available via license: CC BY 2.0
Content may be subject to copyright.
This Provisional PDF corresponds to the article as it appeared upon acceptance. Copyedited and
fully formatted PDF and full text (HTML) versions will be made available soon.
A comparative phenotypic and genomic analysis of C57BL/6J and C57BL/6N
mouse strains
Genome Biology 2013, 14:R82 doi:10.1186/gb-2013-14-7-r82
Michelle M Simon (m.simon@har.mrc.ac.uk)
Simon Greenaway (s.greenaway@har.mrc.ac.uk)
Jacqueline K White (jkw@sanger.ac.uk)
Helmut Fuchs (hfuchs@helmholtz-muenchen.de)
Valérie Gailus-Durner (gailus@helmholtz-muenchen.de)
Tania Sorg (tsorg@igbmc.fr)
Kim Wong (kw10@sanger.ac.uk)
Elodie Bedu (bedu@igbmc.fr)
Elizabeth J Cartwright (elizabeth.j.cartwright@manchester.ac.uk)
Romain Dacquin (romain.dacquin@etu.univ-lyon1.fr)
Sophia Djebali (sophia.djebali@inserm.fr)
Jeanne Estabel (je2@sanger.ac.uk)
Jochen Graw (graw@helmholtz-muenchen.de)
Neil J Ingham (neil.ingham@kcl.ac.uk)
Ian J Jackson (ian.jackson@igmm.ed.ac.uk)
Andreas Lengeling (andreas.lengeling@roslin.ed.ac.uk)
Silvia Mandillo (smandillo@ibc.cnr.it)
Jacqueline Marvel (jacqueline.marvel@inserm.fr)
Hamid Meziane (meziane@igbmc.fr)
Frédéric Preitner (Frederic.Preitner@unil.ch)
Oliver Puk (oliver.puk@helmholtz-muenchen.de)
Michel Roux (mjroux@igbmc.fr)
David J Adams (da1@sanger.ac.uk)
Sarah Atkins (s.atkins@har.mrc.ac.uk)
Abdel Ayadi (ayadi@igbmc.fr)
Lore Becker (lore.becker@helmholtz-muenchen.de)
Andrew Blake (a.blake@har.mrc.ac.uk)
Debra Brooker (d.brooker@har.mrc.ac.uk)
Heather Cater (h.cater@har.mrc.ac.uk)
Marie-France Champy (champy@igbmc.fr)
Roy Combe (combe@igbmc.fr)
Petr Danecek (pd3@sanger.ac.uk)
Armida di Fenza (a.difenza@har.mrc.ac.uk)
Hilary Gates (h.gates@har.mrc.ac.uk)
Anna-Karin Gerdin (akg@sanger.ac.uk)
Elisabetta Golini (egolini@ibc.cnr.it)
John M Hancock (jmhancock@gmail.com)
Wolfgang Hans (wolfgang.hans@helmholtz-muenchen.de)
Sabine M Hölter (hoelter@helmholtz-muenchen.de)
Genome Biology
© 2013 Simon et al.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Tertius Hough (t.hough@har.mrc.ac.uk)
Pierre Jurdic (pjurdic@ens-lyon.fr)
Thomas M Keane (tk2@sanger.ac.uk)
Hugh Morgan (h.morgan@har.mrc.ac.uk)
Werner Müller (werner.muller@manchester.ac.uk)
Frauke Neff (frauke.neff@helmholtz-muenchen.de)
George Nicholson (george.nicholson@stats.ox.ac.uk)
Bastian Pasche (Bastian.Pasche@helmholtz-hzi.de)
Laura-Anne Roberson (lr4@sanger.ac.uk)
Jan Rozman (jan.rozman@helmholtz-muenchen.de)
Mark Sanderson (ms14@sanger.ac.uk)
Luis Santos (l.santos@har.mrc.ac.uk)
Mohammed Selloum (selloum@igbmc.fr)
Carl Shannon (cs7@sanger.ac.uk)
Anne Southwell (a.southwell@har.mrc.ac.uk)
Glauco P Tocchini-Valentini (tocchini@ibc.cnr.it)
Valerie E Vancollie (vv2@sanger.ac.uk)
Sara Wells (s.wells@har.mr.ac.uk)
Henrik Westerberg (h.westerberg@har.mrc.ac.uk)
Wolfgang Wurst (wurst@helmholtz-muenchen.de)
Min Zi (min.zi@manchester.ac.uk)
Binnaz Yalcin (Binnaz.Yalcin@well.ox.ac.uk)
Ramiro Ramirez-Solis (rrs@sanger.ac.uk)
Karen P Steel (karen.steel@kcl.ac.uk)
Ann-Marie Mallon (a.mallon@har.mrc.ac.uk)
Martin Hrabě de Angelis (hrabe@helmholtz-muenchen.de)
Yann Herault (herault@igbmc.fr)
Steve DM Brown (s.brown@har.mrc.ac.uk)
ISSN 1465-6906
Article type Research
Submission date 18 March 2013
Acceptance date 28 June 2013
Publication date 31 July 2013
Article URL http://genomebiology.com/2013/14/7/R82
This peer-reviewed article can be downloaded, printed and distributed freely for any purposes (see
copyright notice below).
Articles in Genome Biology are listed in PubMed and archived at PubMed Central.
Genome Biology
© 2013 Simon et al.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
For information about publishing your research in Genome Biology go to
http://genomebiology.com/authors/instructions/
Genome Biology
© 2013 Simon et al.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1
A comparative phenotypic and genomic analysis of
C57BL/6J and C57BL/6N mouse strains
Michelle M Simon
1
, Simon Greenaway
1
, Jacqueline K White
2
, Helmut Fuchs
3a
,
Valérie Gailus-Durner
3a
, Tania Sorg
4
, Kim Wong
2
, Elodie Bedu
4
, Elizabeth J
Cartwright
5a
, Romain Dacquin
6
, Sophia Djebali
7
, Jeanne Estabel
2
, Jochen
Graw
3b
, Neil J Ingham
2
, Ian J Jackson
8
, Andreas Lengeling
9
, Silvia
Mandillo
10
, Jacqueline Marvel
6
, Hamid Meziane
4
, Frédéric Preitner
11
, Oliver
Puk
3b
, Michel Roux
4
, David J Adams
2
, Sarah Atkins
1
, Abdel Ayadi
4
, Lore
Becker
3a
, Andrew Blake
1
, Debra Brooker
1
, Heather Cater
1
, Marie-France
Champy
4
, Roy Combe
4
, Petr Danecek
2
, Armida di Fenza
1
, Hilary Gates, Anna-
Karin Gerdin
2
, Elisabetta Golini
10
, John M Hancock
1
, Wolfgang Hans
3a
, Sabine
M Hölter
3a
, Tertius Hough
1
, Pierre Jurdic
6
, Thomas M Keane
2
, Hugh Morgan
1
,
Werner Müller
5b
, Frauke Neff
3c
, George Nicholson
1
, Bastian Pasche
12
, Laura-
Anne Roberson
2
, Jan Rozman
3a
, Mark Sanderson
2
, Luis Santos
1
, Mohammed
Selloum
4
, Carl Shannon
2
, Anne Southwell
1
,Glauco P Tocchini-Valentini
10
,
Valerie E Vancollie
2
, Sara Wells
1
, Henrik Westerberg
1
, Wolfgang
Wurst
3b,15,16,17
, Min Zi
5a
, Binnaz Yalcin
13, 14
, Ramiro Ramirez-Solis
2
, Karen P
Steel
2
, Ann-Marie Mallon
1
, Martin Hrabě de Angelis
3a
, Yann Herault
4
and
Steve DM Brown
1
*
!"#
$
%&%'(&%)*!+
#
,
-./012(0
1)34'56789
,4
-./012(0
"234'56789
,
-./012(0
:'34'56789
9
(;(<(:13(3(11*(*3(31
=879!9(>?00)=
6
=020)0
3,@:%#
64
=0020)03,@:%#
8
(A;=,$?,9+2%8@!!7
=
7
13A6!+2%8@!!7=
5
(2014'&
14'19$ #
@
(0("2%(2014'1*
B14'1$6@#
!
'3-?*'34'(B1C,$
!!!6(
"0(0-0(0*'
,5$9
$
4=020
('220!6-
,
(0*'879!9(>=
9
0('220?!6-
6
0"2%2D/+C$
5!,,,
2
8
):>(0:#$5!5!9
7
".0/3'21>>'995!,,8
* '+E2"CC*?C4FCCC>
1E
C?CFCCC>G?C'FCCC>G
H;#C&?I>F'CC>G=?0F-?CG
BA?"?'F-?CG%'?'F'4C0G#
&'?>!F'CC>G1*?4F'4C0G1-4HC'?
-4CIC'FCC>G";?C;FC2?
C0G"I4?CI4FC0GH14?I$F'CC>GH
?'F-?CG3HC('?C'F>CC>G(HCH>?
CI>F'CCC>G+''?C''FCCC>G2
?F4CCGH;2?H;C2FC0G
-?-F'4C0G=AA:?=C:FCG2:>?
2C>F-?CG)?I)F'4C0G"2HC+?
F'CC>G+>?C>FCCC>G+4+?F'4C0G
*>?C4>F-?CG+*>?C4>FCCC>G"4
*>?C4>FCCC>G?CFCCC>G?=
?F'4C0G4?4F'4C0G:">?,F'CC>G
+=-?C0-FCCC>G?C'FCCC>G+?#
?>'F'CC>G14?'F4CCGHC>?
I>F'CG&0''?&0''CF-?CG4C
J?F-?CG%'?C'FCCC>G:
H?IF?C0G%C#?>$F'CC>G''?
C'FCCC>G&/?CFCC>G=>300?
=>C00F-?CG'3?'CFC)CC>G
*:?*C:F-?-CG?+4?9F'CC>G
H-?IC-F-?CG>?9F'CC>G
?CFCCC>G?F'4C0G?
7F'CC>G:C%?B?F4CCGB1CB?
22$F'CC>G&?CFCCC>G>&4'?
C4'FCCC>G&0''&?F-?CG.?
C-FCC>G*-K?*-CKFC)CC>G-??
F'CC>G#:C?#CF>CC>G+??
CFCCC>G4L+'?4F-?CGK
?F'4C0G2"CC*?C4FCCC>M
3
Abstract
Background
%467*<8H''
4'0C'
2%(#>(#'
67*<83'0'C4
'C0
2'00
000''C
Results
&>';067*<8H67*<830
3:200''2C
&,93:$'67*<8H67*<83'
;622'C(
204-'
1:'44'2
4'C&0'
)4000
C&2'0004
40'
442C
Conclusions
067*<8H67*<83'0
002)20
00C2;20
20'000424
C
Keywords: 4G;2G'G'
>>?G67*<8
4
Background
%20241
4(#>(#NO
00'C%1
2406!!!'2'2
'00'04;
C%(#4'4(
:'(:2)02'
44'6!!!0
20'0N$OC
+(#24''67*<831NOC
2'0(#0(:'
467*<83''4>'C
%067*<830I0''4'
2'467*<834
2400'C(
440'242
'67*<8H'200
44N,9ON6O'>N8O
42N7OC2'402
4067*<8H4>'C+;67*<8H
200;0'
N5@OC%'0'043H4'
44'
44'
00'0'C
5
%467*<84H>40
67*@95=$9C(@6=,$
3(03('67*<83C%
67*<8344=60'00
67*<83%=@@N!OC%67*<8H
67*<83240$$!'C10
'2467*<8H67*<834'09$7
3:0$3:!C5P4N>CO
0''C
($!)20'274'
)'402'68C7'
3:5C5!C$52B
44?2NOC(
;20
24C(2
''40;2'
04NOC00Q%04
0'2>40
2C('00Q%)42C
200Q%02'00
'00'2'0;0B
C+'20B'
I00>B
'2'0''0NOC%>
6
0'''2
-'400C
&0'0
67*<8367*<8H'''00
C&)004
';C('?';0
67*<8H'4*20
'?;02'3:B'
67*<8367*<8H0''2
0';C'40)2
'2B'''0';2
'07'I02C(
2>22)
4'2'
4C
Results
Genome sequence comparisons of C57BL/6N and C57BL/6J mice – SNPs
and small indels
&-?'067*<83 0'
67*<8H07IN!OC2000'
23:B4'
'24>00'
2;'C+>0'';
020''
7
';067*<8H'4*(C%4
040;C(
204'00<220
2G40''2
0')220'02'
'20;C%2
40';'24'02
C
%03:00'67*<8H67*<83
?'07:INOC&
2'+%>+%#N$O085$$!
2'67*<8H67*<83C'?
';067*<8H'4*(N,O4
02;'42'2
*67*<8H;C%0
2'0'2C%'0
R!C5-'24R,S6!C%
'0!7@9224I0
C
';:;'';'2'
240?'278$3:8@
C+'0067*<8H0
67*<830'C&
22067*<8H67*<83
'4244C"'
8
2,8,240
'-'':0C=
'685$,80244+0
%4C
''3?3:+2N96O'
'0)C%02;2
467*<8H67*<830,9'3:$'
98?'3:69?'C'2,$
3:'$00%4
C&0 2)Zp272
3H42084
;NO#C$!C
Genome sequence comparisons of C57BL/6N and C57BL/6J mice –
structural variants (SVs)
+''?'07
:INO400N8O0
66B467*<8H67*<83C+4N7O2
??'66B7;
40NO*H;'N,OC*'
4506600)
97!0?'C:'?
4;'520
,504467*<8H
67*<83400C+9,2
9
2B00'67*<8H67*<83%4$
'02C
=009,B2'%4$'$2
?''0'$200''0
'Vmn2r65B$86GNnt
'00'Cyp2a22:96!
0$40$$C0002>
4NntN5OG'92
024'04C
'')0'09,B
467*<8H67*<830$720
6?B244
B3%%4$C>422
67*<8H67*<83%4$C
Comprehensive phenotypic assessment of C57BL/6N and C57BL/6J mice
('1"1"(
20
67*<83%67*<8HC1"(0N@O
'44'06!!>>
'01'1
#>#:I(#'C00
1:0
'''$!'0
0@6>+0$='C%00'
10
'::40
1:4N$!O. ";9,'
981:4N@OC+
0>24')24
067*<83%C&2>2'
067*<8H67*<83%0
0H32C
=3H'?24'4
1:1'0-
C";000@0$!
00)'0?2'=+
+0$='C%1:24
'-1"(2
000);
1:C%04'-2
004C24'?2
0042'0
000'4
4'-24C"03
H041:4
4I0C(
43H04
C040
)44
2240040
+0,='$?C&
11
02
''?)2
004C203H
24''0
004NOC(
03H'1:'
01"(2
'00''0
0)0400
2'1:C
(-'2000'
'00043H,
C&0$7='
+0,='$C(2004
04C&
20
420='4+
0,='$40C2
002'0032H00
424C(20
24200C
&040''0
00$='$+0,='$4
'0C&0
4'0'?C
%0'4042
12
'000'
>402203<H00+
0,='$'C
Dysmorphology and Ophthalmology
&020I00'043
H' ?0>C240
'0040C+0
'20'2>N$O2
203H!C,9<'@6P(!C,!6?!C,$,0
*83T5@2C!C,@@<'@6P(!C,@9?!C9!90*8HT
$5R!C!!?C%000
;2'0'
4HE6C!PU!C6PT!G3E6C$U!C6PT!C&
0>42003'0;4
H=',+C%>0Cbr1
rd8
3
4C$!$N$$O'0>
422240>-000
=',+C='0
N$,O240224''4,
702,50=',*G'22
'44C%4042
'0'R!C!!H3='C,C
13
Cardiovascular
3?2241(V!!$
H'0'3''0
00004244)4C
242'0'3'
04'03
H4G0'Q%
2C&0'-4'1(V!$!
'032H$0
04C=0
04'004
020043HC
Metabolism
=0?01(V!!,42
0043H0
$
$
CH')'')
43'>
0C('0
')H23''C%
442H0
>H3''?0'
C%0020?0
1(V!!,2+0,
='$'C0'(:%%
1(V!!9'H23C%
42'4>0
14
Nnt '0HN5O4'
04C
"1 +"1' ?+444
1(V!!63204
4-'C="1 +
H23C(0*
"*"'HC20'
>"1 +C&
>µ%0='C90
>442'43
HC(02?
24>C=040
>02004='
9C
Neurological, behavioural and sensory
%I0043H2
01(V!!7='$''24
2H20)HC
%0042
03HN$9OC(''0000
C'2
32H'004
0C+0000''0
00C%1::0
;0-4
15
00''0'
G;G40
2'>200
42N$6OC2024
00'424C&200
'4'4)004C%'
4>03042'
??:+)N$8OC&
'000?04243
H40002>2
C('I0'32H
042'0C(0
00'4
0''
042422
C
&'<>)3H
='C6C&0'00043H4
0'?>'>C
2>'0'3
C0(:+#*('
:+'1(V!!5='9
H'02
0'02?0'
42C
16
&4004C"00'?
'1(V!!@42H'34
0000'00
04'?'0044'?'4
='4C'1(V!!2'000
0'403
000C&0)43H
4)''029
='6C&00H2>0
$0032''000
00,R!C!6C20$
9''000043
HC%'2
030044
'0'0C
&420432H
00C=003H&
-C32'0
0'H='8C
)'0222'
)0'C242'0
004C
1)0?41(V!='
220'0
00C+'!*
17
'?::?::9U1:
3H'000
C420?4
0043H?4::$::,'4
43HC2'?
4'000='
4+0,='$4040042
C%42'0400
'H3'+*
4000C
Clinical Chemistry
1)200
0'C%40
:1(V!$0'2'0
0:$1(V!60?0C"0
'
'0'0H23=''
)?C"00?00
'200'
24'3H='4C24
'2>4004''
0C"0
'00
><)-I='C+42
>0H
2'234Nnt
18
00'0?'H00
0'C24'
H3<
'000='4G0)
00C%'0'3
+:>'0'HC0
040='$C"00
00'C
Haematology
B'0:$
1(V!8C+40'0'4
'4&*4
*2B'4
='4C420
'4='$C(0
'00C%
400'0'
'C
Immune function and allergy
&2'40'
Listeria monocytogenesH3C=0H3
4Listeria monocytogenes0G2)00
Listeria43HC03
0Listeria90
19
HC%?03
,0H='7C
&3H0"3=*04-?
2C'0000
40H'C
34004
C(2'0203#22
'003#24(?$4(?$
H3G''00='
5C
Discussion
&0'00043H
2'40'442C
%0'2440'
004224>
204>'4C%
00043H;0
'000'4>'C
&067*<831024
'?NO0'0004
3H0'0
(#;0)04
'0067*<8HC
20
:0043H>40
4200'';'C&2'
20,83:00'';'
09,B4'C(206
'';2B''0C
+02>2;
'B2'
;C2'20'2
00>''2
04000'43H
C
&24''
0>>004243HC
=)24::'2
40>>0'3:
B43HC('2
422
'4243HC=0'4
)24-:
24C='3:90,8
24:C=BB2'0:07
06C(:2420>>4
00C=200
0-'>>'43
H0'42
00C=4>4
21
0-'240
0-'C
%40C=2
0000;2
43HC&3:2H3
002'4CB'40
''000004
000)''''00C
>>24'20'4>'0
)02C322
2442
'43H'22
3H>000>4
24>>C
093:26Crb1
Pdzk1PmchAdcy5Nlrp12:20
>41"(4'%4
,CCrb1rd84?'
C*0)'4'
'N$7OC%Crb1
rd8
'$>04
4;'4'0
'C&4223
'023H'
>C20'000
22
40243HC(
4rd803421
N$$OC=Pdzk1'2
>>44243HC=0
PmchAdcy5Nlrp1220400C
Pmch>>''4'
)'C322
'C3)'
4'''2'HC
Adcy5>>?22
24>C3222
Adcy5C34042'00'
2'240C2
4200
0''00'4'42
42C*''
0'03C=4
2PmchAdcy5:2:B100+
'200
0N$5OC3:$>40
N$@O3*3"02
CNlrp12>>00
2N,!OCH22'
Nlrp12?'0'C
20H"3=*?
2'''Nlpr12222
23
'00C34
C:2
''C
=B:,Chl1RptorNnt:2
'1"(%4,CChl1>>
4''4
24I>'CChl1 (31H
C23>'&
-H''00
34)4242
0CRptor>>'404
'0G2'
''G)'G
2CRptor %+HCH0
0''C
2404243H
00Rptor''
H)'23C=H
24''NntN5O
'0'
032HC(''2)'
000Nnt'000
'42Nnt4
C0)42Rptor>>
2'0H%+
Rptor''C%0
24
2'000Nnt''2
%+Rptor00'0C
Conclusions
=0'
40004'24
24C=02>
20;24G
67*<8367*8HC';2
226'2,9'3:$6B
00'67*<8367*<8HC
+220
40440'0
00C&04'2
42')0
0044
>00'2C%2)0
>>'
43H2C%
0420''4>'0>>
''00002
243HC(03:B4
00C20'''402
02'4)'>'42
''3<HC23<H
25
400''
0043H)0
;0'2I00
0C
26
Materials and Methods
Sequencing and genomic analyses
=67*<8H67*<830;'
2#C$!NOC
SNP and small indel identification
%?'067*<83'0'67*<8H?
>@<3*(,7NO03:
00'4?C%;2'
+%>+%#N$O0C&
0''400240
2C3:*H'R!C5
'0R,S6!200C
%67*<83*+'0'420'
C+020+2N9O
<3?3:N6OC'0'
2??002
4'C%''2
40?'20;2C
SNP and small indel validation
&')0078$3:8@
'"1(31C'-4
C&:1 "0;
++K00''2'"3+0
67*<840067*<8H067*<83H
27
67*<83%%K:10C%''
>00C(
-''67*<8H67*<83
00C
(;:;'';'
02C&'0$$3:!C:
':;'W+"''-
10&C%:0'Q'%;)C
+%*%0;';'
:Q@8:;C(00"3+
'3:00C
67*<83'024INOC'
3:24043:N,,$OC
SV identification
'4004N8O
066'?2B467*<8H
67*<83C&2?;'0
66;'>;N,,O097!
0?'C+'5
:'?4;'44C
28
SV validation
:'':,0&144'
C:''-0
2C%:
%;40Q'C%0
24N,9OC'':>Q'0'
'S$>4C:''0;0'
00:C:0@8?
0,!X0
$
;C+;'
+*(,7!!;44'
:1"<:+:C:'?4;'5
02,5044
67*<8H67*<83400C%2
240"40B+2"B
("E$!9N,6OC
Predicted effects of sequence variants
:00042
0':2:B100+-N$5OC
::'0''B3:40
%"4"N,8OC
Phenotyping
Phenotyping platforms in EMPReSSslim
29
+:0'41:4
N,7OC+'')'
=+('>C4;40
'4
4Phenotype Data Analysis4C
67*<83%'C'
40N,7O.
Data capture by Europhenome
"'01:40
4(0'(4>424
'004'C%
01:0:">''
:")0 4>'' 04 C
%)2-2
2I24N,5O0)C%0
4)4IC%4
02'1:'
0-1:4N,7OC(00
00=%:
C
1Y=%:'>41:
0C%'20'
1:0>0')'
1:C%022
30
00 '0401:%>C(02
01:4C"4
2044'00=%:
C%2
2C:40N,@OC
Phenotype data analysis
1
N@OC(03H'0
40)01
4C(00
00'00C&
4'0'4C+)0
0(:%%-(Z0Y
'0Z?>22YZYC%0
-$!$$0$!,067*<8367*<8H2C%
'2E$!$$C
4'067*<8H67*<83240
0'0C+0
'[-200-?2
004;''
0C
'=[1)%0$)$'4?
;%?2C3?&%
?400
4C%
31
>22''0
4'22420C
(?0'0
0<<)4?2
0''02004
'>C&67*<832'
67*<8H'0C('
024
004C
(02'00
4'?';N9!O
4402',
C+;40
;0C
2440402
4402''2C
0444244
>'000'C
+00,'
='-
!C!790C%420!C$,'
2?C&
04='$C+
-42!C,550
42!C!$''
32
'2'0C%0'
$4='4'2440!C5,
42!C5C
Abbreviations
1:E1:'0G1E
14G1E1'IG
=+E=?+2'G(#E(#>
G(:E(:'G(:%%E
(%%G#:E#>:IG:E
:G3:E'3:G:E
':
Competing interests
%200C
Authors’ contributions
H#&=B?"%#&1*1H"H1H3H((HH+H
=:::H*K?#:+?+K"*4;
2')C
"H+*:H#&=B?"%1*1H"H1H3H((HH+H=::
:H#:K"+++*"*?=+=+?#1&%
&=3?+H+%?BB1B&&&.0
)C
33
"H+*:+?"*+*:"H%#3&0
C
+20C
Description of additional data files
+0E%43:24
+0$E='1:'C
+0,E='$?''
004'C
Acknowledgements
%1"(I0414
?%?$!!8?!,755?%?$!!7?!,79964
&%'4!@5!6C&'0>'4
041"(IG40N@OC&>
?2'&%0
20)00';C*-K
41*'?%=+'3C%
(4='43+'0
'002:13(3+3?!?(3*?!7C
34
References
1. Skarnes WC, Rosen B, West AP, Koutsourakis M, Bushell W, Iyer V, Mujica AO,
Thomas M, Harrow J, Cox T, Jackson D, Severin J, Biggs P, Fu J, Nefedov M, de Jong PJ,
Stewart AF, Bradley A: A conditional knockout resource for the genome-wide
study of mouse gene function. Nature 2011, 474:337-342.
2. Brown SD, Moore MW: Towards an encyclopaedia of mammalian gene
function: the International Mouse Phenotyping Consortium. Dis Model Mech
2012, 5:289-292.
3. Carneiro AM, Airey DC, Thompson B, Zhu CB, Lu L, Chesler EJ, Erikson KM,
Blakely RD: Functional coding variation in recombinant inbred mouse lines
reveals multiple serotonin transporter-associated phenotypes. Proc Natl Acad
Sci U S A 2009, 106:2047-2052.
4. Williams RW, Gu J, Qi S, Lu L: The genetic structure of recombinant inbred
mice: high-resolution consensus maps for complex trait analysis. Genome Biol
2001, 2:RESEARCH0046.
5. Gregorova S, Divina P, Storchova R, Trachtulec Z, Fotopulosova V, Svenson
KL, Donahue LR, Paigen B, Forejt J: Mouse consomic strains: exploiting genetic
divergence between Mus m. musculus and Mus m. domesticus subspecies.
Genome Res 2008, 18:509-515.
6. Valdar W, Solberg LC, Gauguier D, Burnett S, Klenerman P, Cookson WO,
Taylor MS, Rawlins JN, Mott R, Flint J: Genome-wide genetic association of
complex traits in heterogeneous stock mice. Nat Genet 2006, 38:879-887.
7. Churchill GA, Airey DC, Allayee H, Angel JM, Attie AD, Beatty J, Beavis WD,
Belknap JK, Bennett B, Berrettini W, Bleich A, Bogue M, Broman KW, Buck KJ,
Buckler E, Burmeister M, Chesler EJ, Cheverud JM, Clapcote S, Cook MN, Cox RD,
Crabbe JC, Crusio WE, Darvasi A, Deschepper CF, Doerge RW, Farber CR, Forejt J,
Gaile D, Garlow SJ et al: The Collaborative Cross, a community resource for the
genetic analysis of complex traits. Nat Genet 2004, 36:1133-1137.
8. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P,
Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J,
Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby
M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD,
Carninci P et al: Initial sequencing and comparative analysis of the mouse
genome. Nature 2002, 420:520-562.
9. Church DM, Goodstadt L, Hillier LW, Zody MC, Goldstein S, She X, Bult CJ,
Agarwala R, Cherry JL, DiCuccio M, Hlavina W, Kapustin Y, Meric P, Maglott D, Birtle
Z, Marques AC, Graves T, Zhou S, Teague B, Potamousis K, Churas C, Place M,
Herschleb J, Runnheim R, Forrest D, Amos-Landgraf J, Schwartz DC, Cheng Z,
35
Lindblad-Toh K, Eichler EE et al: Lineage-specific biology revealed by a finished
genome assembly of the mouse. PLoS Biol 2009, 7:e1000112.
10. Mekada K, Abe K, Murakami A, Nakamura S, Nakata H, Moriwaki K, Obata Y,
Yoshiki A: Genetic differences among C57BL/6 substrains. Exp Anim 2009,
58:141-149.
11. Keane TM, Goodstadt L, Danecek P, White MA, Wong K, Yalcin B, Heger A,
Agam A, Slater G, Goodson M, Furlotte NA, Eskin E, Nellaker C, Whitley H, Cleak J,
Janowitz D, Hernandez-Pliego P, Edwards A, Belgard TG, Oliver PL, McIntyre RE,
Bhomra A, Nicod J, Gan X, Yuan W, van der Weyden L, Steward CA, Bala S, Stalker J,
Mott R et al: Mouse genomic variation and its effect on phenotypes and gene
regulation. Nature 2011, 477:289-294.
12. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis
AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM,
Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ: A framework for
variation discovery and genotyping using next-generation DNA sequencing
data. Nat Genet 2011, 43:491-498.
13. Gnerre S, Maccallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, Sharpe
T, Hall G, Shea TP, Sykes S, Berlin AM, Aird D, Costello M, Daza R, Williams L, Nicol R,
Gnirke A, Nusbaum C, Lander ES, Jaffe DB: High-quality draft assemblies of
mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci
U S A 2011, 108:1513-1518.
14. Wang K, Li M, Hakonarson H: ANNOVAR: functional annotation of genetic
variants from high-throughput sequencing data. Nucleic Acids Res 2010,
38:e164.
15. Grant JR, Arantes AS, Liao X, Stothard P: In-depth annotation of SNPs
arising from resequencing projects using NGS-SNP. Bioinformatics 2011,
27:2300-2301.
16. Wong K, Keane TM, Stalker J, Adams DJ: Enhanced structural variant and
breakpoint detection using SVMerge by integration of multiple detection
methods and local assembly. Genome Biol 2010, 11:R128.
17. Yalcin B, Wong K, Bhomra A, Goodson M, Keane TM, Adams DJ, Flint J: The
fine-scale architecture of structural variants in 17 mouse genomes. Genome
Biol 2012, 13:R18.
18. Freeman H, Shimomura K, Horner E, Cox RD, Ashcroft FM: Nicotinamide
nucleotide transhydrogenase: a key role in insulin secretion. Cell metabolism
2006, 3:35-45.
36
19. Morgan H, Beck T, Blake A, Gates H, Adams N, Debouzy G, Leblanc S, Lengger
C, Maier H, Melvin D, Meziane H, Richardson D, Wells S, White J, Wood J, de Angelis
MH, Brown SD, Hancock JM, Mallon AM: EuroPhenome: a repository for high-
throughput mouse phenotyping data. Nucleic Acids Res 2010, 38:D577-585.
20. Gates H, Mallon AM, Brown SD: High-throughput mouse phenotyping.
Methods 2011, 53:394-404.
21. Prusky GT, Alam NM, Beekman S, Douglas RM: Rapid quantification of
adult and developing mouse spatial vision using a virtual optomotor system.
Invest Ophthalmol Vis Sci 2004, 45:4611-4616.
22. Mattapallil MJ, Wawrousek EF, Chan CC, Zhao H, Roychoudhury J, Ferguson
TA, Caspi RR: The Rd8 mutation of the Crb1 gene is present in vendor lines of
C57BL/6N mice and embryonic stem cells, and confounds ocular induced
mutant phenotypes. Invest Ophthalmol Vis Sci 2012, 53:2921-2927.
23. Paques M, Guyomard JL, Simonutti M, Roux MJ, Picaud S, Legargasson JF,
Sahel JA: Panretinal, high-resolution color photography of the mouse fundus.
Invest Ophthalmol Vis Sci 2007, 48:2769-2774.
24. Matsuo N, Takao K, Nakanishi K, Yamasaki N, Tanda K, Miyakawa T:
Behavioral profiles of three C57BL/6 substrains. Front Behav Neurosci 2010,
4:29.
25. Tucci V, Lad HV, Parker A, Polley S, Brown SD, Nolan PM: Gene-environment
interactions differentially affect mouse strain behavioral parameters. Mamm
Genome 2006, 17:1113-1120.
26. Grenham S, Clarke G, Cryan JF, Dinan TG: Brain-gut-microbe
communication in health and disease. Front Physiol 2011, 2:94.
27. Mehalow AK, Kameya S, Smith RS, Hawes NL, Denegre JM, Young JA, Bechtold
L, Haider NB, Tepass U, Heckenlively JR, Chang B, Naggert JK, Nishina PM: CRB1 is
essential for external limiting membrane integrity and photoreceptor
morphogenesis in the mammalian retina. Human molecular genetics 2003,
12:2179-2189.
28. Choi Y, Sims GE, Murphy S, Miller JR, Chan AP: Predicting the functional
effect of amino Acid substitutions and indels. PLoS One 2012, 7:e46688.
29. Jeru I, Duquesnoy P, Fernandes-Alnemri T, Cochet E, Yu JW, Lackmy-Port-Lis
M, Grimprel E, Landman-Parker J, Hentgen V, Marlin S, McElreavey K, Sarkisian T,
Grateau G, Alnemri ES, Amselem S: Mutations in NALP12 cause hereditary
periodic fever syndromes. Proc Natl Acad Sci U S A 2008, 105:1614-1619.
37
30. Arthur JC, Lich JD, Ye Z, Allen IC, Gris D, Wilson JE, Schneider M, Roney KE,
O'Connor BP, Moore CB, Morrison A, Sutterwala FS, Bertin J, Koller BH, Liu Z, Ting
JP: Cutting edge: NLRP12 controls dendritic and myeloid cell migration to
affect contact hypersensitivity. J Immunol 2010, 185:4515-4519.
31. SNP & Indel Data:
http://www.ncbi.nlm.nih.gov/projects/SNP/snp_viewTable.cgi?handle=MRCHARW
ELL
32. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K:
dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 2001, 29:308-
311.
33. Manske HM, Kwiatkowski DP: LookSeq: a browser-based viewer for deep
sequencing data. Genome Res 2009, 19:2125-2132.
34. Yalcin B, Willis-Owen SA, Fullerton J, Meesaq A, Deacon RM, Rawlins JN,
Copley RR, Morris AP, Flint J, Mott R: Genetic dissection of a behavioral
quantitative trait locus shows that Rgs2 modulates anxiety in mice. Nat Genet
2004, 36:1197-1202.
35. Structural Variant Data: http://www.ebi.ac.uk/dgva/page.php
36. Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE: The Mouse Genome
Database (MGD): comprehensive resource for genetics and genomics of the
laboratory mouse. Nucleic Acids Res 2012, 40:D881-886.
37. Mallon AM, Blake A, Hancock JM: EuroPhenome and EMPReSS: online
mouse phenotyping resource. Nucleic Acids Res 2008, 36:D715-718.
38. Europhenome Java Library:
http://sourceforge.net/projects/europhenome/
39. Phenotype Data: http://www.har.mrc.ac.uk/nj
40. Efron B: 1977 Rietz Lecture - Bootstrap Methods - Another Look at the
Jackknife. Ann Stat 1979, 7:1-26.
38
Figure 1. ''000
467*<8367*<8H0000
\-.(;(
&%'(&%(C:'
01:N,7OC'02
000'0>C'000
0'4CA):
'00043H,CB):
'00043H$4
20C
Figure 2. ''000
467*<8367*<8H0000
\-.(;(
&%'(&%(\0
'000$4
0C:'
01:C'02000'
0>C
Figure 3C'000467*<8367*<8H
C(A)&0>4067*<8H0
0;2067*<832'0
2467*<830'C"'
67*<83008@C$PT7!
U,90(99C8PT96U650$,C!P
T59U@90&%(00>
39
67*<8H(E$@GE76U760G&%(E,9U$5
0C(B)*240004
4,07'6
40'067*<8HC(C)Q002
67*<8367*<8HT9!T7!
2C%4029C5]!C067*<83T
$$2C6C,]!C067*<8HT,5C*00
'0R!C!!?&>%C
Figure 4. X%00244
9?>?67*<8H267*<83(A)0(B)C4
09?>?00'4
(C)C0
)404>2
4249?>?67*<8H67*<83(D)C
+442E*CBC<%CBC42<2G%4C3C44G
%4CC%4'G?"C2GCC(C
)!0,0G"C+C'0G
C:CGC%C>G":")GC
C
Figure 5. '>C*(A)P0
>(B)40'<>(C)
67*<8HT!67*<83T@'5?!>C"
]1MR!C!6?C(D)'020C
4]100'
099!CCC,!!067*<8HT!67*<83
40
T!'@?>CMR!C!6MMR!C!!6MMMR!C!!!H2C3?
GMR!C!6MMR!C!!6MMMR!C!!!2C"=Y"C
Figure 6. &-CA)=-%''2C
4]100
67*<8HT!67*<83T!'8?$!>CMMR!C!!6
MMMR!C!!!6H2C3?GMMR!C!!6MMMR!C!!!2C"=Y"CB)
:4C*P;6'4
C"2$6PCMR!C!6MMR!C!!6MMMR!C!!!6
2C;?CCC2>067*<8H67*<83
'4C"00C
Figure 7.0467*<8H67*<83
4CA)#?222000
67*<8H67*<830C2C0$)!
9
?0'
00Listeria monocytogenes1"CB)*20
67*<8H67*<830C2C0$)!
9
0L. monocytogenes
1"C'0->0
4'CC)020>8(?80
4!(:?!>'$$467*<8H
67*<83(B)C0?0>
>4'>
$!?:):(2';!!Q'C'0
000EMR!C!6MMR!C!?&?C
*>4467*<8H4C&4467*<83
4C
41
Figure 8. A)03#3#2067*<8H
*8H267*<83*83%0C
3#067*<8H67*<83
)'CU<?"0(=3^2
'",?3#CU3#40C
B?02C
067*<8H67*<83-40$6
_0!C6P"3=*2>C%'4
06_0!C6P"3=*06"3=*C'
2?C1>
950'C20,))
'CMR!C!6GMMR!C!!6?&
42
Figure S1. 1:'C%$!
'0C"0=+04
;0C
Figure S2. A-D) ='='$40
67*<8367*<8H0E-H)
='='$'00-42
CA,E):'0004
3H,CB,F):'0
0043H$420
CC,G):042'0
00CD,H):'0
00$40C
43
Table 1. Coding SNPs and small indels identified between C57BL/6N and
C57BL/6J
Chr
Position
B6J
Base
B6N
Base
Strain
Gene Name
B6J
Amino Acid
B
6N
Amino Acid
Nonsense Polymorphism
13 65023280 C T B6N
Spata31
Arginine * (STOP)
Missense Polymorphisms
1 59904011 G A B6N
Bmpr2
Arginine Glutamine
3 95538799 T C B6J
Ecm1
Isoleucine Valine
3 96658480 A G B6J
Pdzk1
Asparagine Aspartic acid
4 21800831 C G B6J
Sfrs18
Arginine Glycine
4 137777588 C T B6N
Hp1bp3
Leucine Phenylalanine
4 140354038 A G B6N
Padi3
Leucine Proline
4 148318468 T C B6J
Casz1
Leucine Proline
5 90204376 C T B6N
Adamts3
Valine Isoleucine
5 97187161 T C B6J
Fras1
Leucine Proline
5 113191741 C T B6N
Myo18b
Arginine Histidine
6 39350455 T A B6J
Mkrn1
Asparagine Tyrosine
7 3222538 T C B6J
Nlrp12
Lysine Arginine
7 63386662 G A B6J
Herc2
Glycine Aspartic acid
7 86256240 A C B6J
Acan
Histidine Proline
7 110121823 C T B6N
Olfr577
Valine Isoleucine
7 127278693 G A B6N+Spretus
Zp2
Alanine Valine
7 129311164 C T B6N
Plk1
Arginine Tryptophan
9 24935069 C G B6N
Herpud2
Valine Leucine
10 66700922 T C B6J
Jmjd1c
Leucine Proline
10 78632222 A G B6N
Vmn2r80
Asparagine Serine
10 87554578 T C B6N
Pmch
Isoleucine Threonine
11 46036117 G A B6N
Cyfip2
Serine Phenylalanine
11 90341985 C T B6N
Stxb4
Alanine Threonine
13 21560172 A G B6J
Nkapl
Glycine Arginine
13 73465884 A G B6J
Ndufs6
Valine Alanine
13 93833534 C G B6J
Cmya5
Alanine Proline
14 70986011 G T B6N
Fam160b2
Serine Arginine
15 11266138 G T B6N
Adamts12
Cysteine Phenylalanine
15 77468437 A C B6J
Apol11b
Isoleucine Arginine
16 35291630 G A B6N
Adcy5
Valine Methionine
17 47537359 T C B6J
Guca1a
Isoleucine Valine
X 131227581 C A B6N
Armcx4
Alanine Aspartic acid
Splice Site Polymorphism
5 54280548 A G
B6J
Tbc1d19
- -
Frameshift
1bp
-
D
eletions
1 141133664 G -
B6N
Crb1
- -
9 65127938 G -
B6J
Cilp
- -
44
Table 2. Structural variants between C57BL/6N and C57BL/6J
Chr SV start SV stop Ancestral event Strain Gene Overlap
1 149518394 149524878 LINE Ins B6J
2 7325700 7330977 IAP Ins B6J
2 70619835 70620080 SINE Ins B6J
Tlk1
intron
3 77975065 77977953 Del B6N
3 5049018 5055845 LINE Ins B6J
3 60336036 60336037 Del (large) B6J
Mbnl1
intron
3 41885819 41887255 LINE Ins B6J
3 18484710 18484889 Del B6N
4 101954274 101954395 Del B6N
Pde4b
intron
4 116051393 116051799 MaLR Ins B6J
Mast2
intron
5 46376307 46377852 LINE Ins B6J
5 90356490 90356491 Del (~300 bp) B6J
5 146248861 146261885 Ins B6J+others
6 18112291 18119019 LINE Ins B6J
6 62964974 62972907 LINE Ins B6J
6 86478779 86479400 Ins B6J
6 103669536 103676487 LINE Ins B6J
Chl1
intron
6 104207081 104214434 LINE Ins B6J
7 92095990 92096149 Del B6N
Vmn2r65
exon
7 27636128 27748456 Ins B6J
Cyp2a22
entire
7 100892501 100899058 LINE Ins B6J
7 139306094 139307981 MaLR Ins B6J
Cpxm2
intron
8 16716381 16716382 Del (large) B6J
Csmd1
intron
9 25674550 25674770 SINE Ins B6J
9 58544415 58546304 MaLR Ins B6J
2410076I21Rik
intron
10 3039196 3039197 Del (large) B6J
10 29339441 29345955 LINE Ins B6J
10 32536420 32543464 LINE Ins B6J
Nkain2
intron
10 49543303 49550645 LINE Ins B6J
11 104906390 104906621 Del B6N
11 119560391 119566827 MTA Ins B6J
Rptor
intron
12 42023964 42032747 Del B6N
Immp2l
intron
13 71224557 71231011 MTA Ins B6J
13 120164268 120164269 Del (large) B6J
Nnt
exon
14 112825585 112832341 LINE Ins B6J
15 49554596 49554597 Ins (large) B6N
15 31106173 31106382 VNTR
16 6115804 6138105 Del B6N
17 60286367 60286368 Ins (~2000 bp) B6N
18 4809271 4809272 Del (~1200 bp) B6J
19 12863187 12863188 Del (~1800 bp) B6J
Zfp91
intron
X 15697909 15697910 Del (~400 bp) B6J
X 95155499 95163160 LINE Ins B6J
Start and stop coordinates are given for MGSCv37 of the mouse reference genome. Del, deletion; IAP,
intracisternal A particle; Ins, insertion; LINE, long interspersed nuclear elements; MaLR, mammalian
apparent LTR-retrotransposon; MTA, member of transcript retrotransposon; SINE, short interspersed
nuclear elements; VNTR, variable number tandem repeat.
45
Table 3. Comparison of predicted effects of SNPs and SVs that might contribute to the phenotypic
differences between C57BL/6N and C57BL/6J (see text). We have identified variant genes that show
homozygote knockout phenotypes with associated MP terms that were assessed in the phenotyping
pipeline, and compared these phenotypes to those observed between N and J.
Protein
coding
gene
C57BL/6J
Amino
Acid
C57BL/6N
Amino Acid
SNP is
Private
to
PROVEAN
Prediction
1
MP Terms
B6J
vs
B6N
2
B6N vs
B6J
2
Adcy5 Valine (V) Methionine
(M)
B6N TOLERATED
(-1.712)
impaired
coordination_MP:0001405
NR P
hypoactivity_MP:0001402 NR P
Pmch Isoleucine
(I)
Threonine
(T)
B6N TOLERATED
(0.493)
decreased circulating glucose
level_MP:0005560
NR A
abnormal glucose
tolerance_MP:0005291
NR A
increased oxygen
consumption_MP:0005289
NR P
Pdzk1 Asparagine
(N)
Aspartic Acid
(D)
B6J TOLERATED
(0.95)
increased circulating
cholesterol level_MP:0001556
A NR
Nlrp12 Lysine
(K)
Arginine
(A)
B6J TOLERATED
(0.781)
abnormal type IV
hypersensitivity
reaction_MP:0002534
P NR
Crb1 - - B6N - photosensitivity_MP:0001999 NR P
abnormal ocular fundus
morphology_MP:0002864
NR P
retinal
degeneration_MP:0001326
NR P
abnormal retina
morphology_MP:0001325
NR P
abnormal retinal photoreceptor
layer_MP:0003728
NR P
Chl1 - - B6J - abnormal learning/ memory_
MP:0001449
A NR
abnormal spatial working
memory_MP:0008428
A NR
Rptor - - B6J - increased lean body
mass_MP:0003960
P NR
increased oxygen
consumption_MP:0005289
A NR
hypoactivity_MP:0001402 A NR
decreased circulating glucose
level_MP:0005560
P NR
improved glucose
tolerance_MP:0005292
A NR
Nnt - - B6J - impaired glucose
tolerance_MP:0005293
P NR
1
Threshold for intolerance is -2.3
2
These columns indicate the direction of the phenotype effect that might be observed given the assignment of a SNP or SV as
private to B6J or B6N. Only one direction will be relevant and comparable to the effects of the knockout mutation.
Key:
NR
Not relevant
P
Phenotype present
A
Phenotype ant
46
Figure 1
Figure 2
Figure 3
Figure 5
Figure 6
Figure 7
Additional files provided with this submission:
Additional file 1: TableS1.pdf, 225K
http://genomebiology.com/imedia/8046326471045638/supp1.pdf
Additional file 2: Supplementary_Figure_1.pdf, 544K
http://genomebiology.com/imedia/1086014387104073/supp2.pdf
Additional file 3: Supplementary_Figure_2.pdf, 2994K
http://genomebiology.com/imedia/1777967506104073/supp3.pdf