Ksenia KrasheninnikovaWellcome Sanger Institute · Darwin Tree of Life project
Ksenia Krasheninnikova
About
55
Publications
13,297
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,497
Citations
Publications
Publications (55)
We present a genome assembly from an individual male Anopheles nili (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequence is 195 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondri...
We present a genome assembly from an individual female Anopheles marshallii (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae) from Lopé, Gabon. The genome sequence is 225.7 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome...
We present genome assembly from individual female An. coustani (African malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae) from Lopé, Gabon. The genome sequence is 270 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled for both species. The complete mitochondrial...
We present a genome assembly from an individual female Anopheles maculipalpis (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae). The genome sequence is 224 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also assembl...
We present a genome assembly from an individual female Anopheles gambiae (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), Ifakara strain. The genome sequence is 264 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was a...
We present a genome assembly from an individual male Anopheles moucheti (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequence is 271 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete...
Molluscs are a highly speciose phylum that exhibits an astonishing array of colours and patterns, yet relatively little progress has been made in identifying the underlying genes that determine phenotypic variation. One prominent example is the land snail Cepaea nemoralis for which classical genetic studies have shown that around nine loci, several...
Background
PacBio high fidelity (HiFi) sequencing reads are both long (15–20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated...
We present a genome assembly from an individual female Anopheles funestus (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae). The genome sequence is 251 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also ass...
Pusa sibirica, the Baikal seal, is the only extant, exclusively freshwater, pinniped species. The pending issue is, how and when they reached their current habitat—the rift lake Baikal, more than three thousand kilometers away from the Arctic Ocean. To explore the demographic history and genetic diversity of this species, we generated a de novo chr...
We present a genome assembly from an individual female Anopheles gambiae (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), Ifakara strain. The genome sequence is 264 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was a...
Background
PacBio high fidelity (HiFi) sequencing reads are both long (15-20 kb) and highly accurate (>Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated...
We present a genome assembly from an individual female Anopheles funestus (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae). The genome sequence is 251 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also ass...
The Puma lineage within the family Felidae consists of three species that last shared a common ancestor around 4.9 million years ago. Whole-genome sequences of two species from the lineage were previously reported: the cheetah (Acinonyx jubatus) and the mountain lion (Puma concolor). The present report describes a whole-genome assembly of the remai...
Mycobacterium tuberculosis is a highly studied pathogen due to public health importance. Despite this, problems like early drug resistance, diagnostics and treatment success prediction are still not fully resolved. Here, we analyze the incidence of point mutations widely used for drug resistance detection in laboratory practice and conduct comparat...
Background
Large-scale sequencing projects provide high-quality full-genome data that can be used for reconstruction of chromosomal exchanges and rearrangements that disrupt conserved syntenic blocks. The highest resolution of cross-species homology can be obtained on the basis of whole-genome, reference-free alignments. Very large multiple alignme...
Genome-wide assessment of genetic diversity has the potential to increase the ability to understand admixture, inbreeding, kinship and erosion of genetic diversity affecting both captive (ex situ) and wild (in situ) populations of threatened species. The sable antelope (Hippotragus niger), native to the savannah woodlands of sub-Saharan Africa, is...
The Russian Federation is the largest and one of the most ethnically diverse countries in the world, however no centralized reference database of genetic variation exists to date. Such data are crucial for medical genetics and essential for studying population history. The Genome Russia Project aims at filling this gap by performing whole genome se...
Mycobacterium tuberculosisis a highly studied pathogen due to public health importance. Despite progress in M.tuberculosis genome diversity analysis, there remain insufficient data on genome analysis of M.tuberculosis strains associated with pulmonary vs. extrapulmonary TB (PTB or XPTB respectively) tissue localization. Here we conduct comparative...
A comparative analysis of whole genome sequencing (WGS) and genotype calling was initiated for ten human genome samples sequenced by St. Petersburg State University Peterhof Sequencing Center and by three commercial sequencing centers outside of Russia. The sequence quality, efficiency of DNA variant and genotype calling were compared with each oth...
Long indel counts.
The number of identified long indels is given for each sequencing center to illustrate the effect of filtering (described in the first column).
(DOCX)
All identified LoF SNP list with annotation.
(XLSX)
Overlap of long indels across three sequencing centers.
The Venn diagram shows the number of shared long indels in the three datasets.
(PDF)
All identified LoF short indel list with annotation.
(XLSX)
List of candidate AIH-related genes obtained from separate studies.
(XLSX)
Distribution of alternative allele counts in called genotypes.
Three datasets of genotypes for 10 individuals (Illumina and Macrogen) and one dataset of genotypes for 6 individuals (Peterhof) were considered. For each variant, the number of alternative alleles was obtained; the variants were classified according to this number. Multiallelic variant...
Segmental duplications identified in trio in three datasets.
"Common" bar corresponds to segmental duplications present in all three datasets.
(PDF)
Statistics on called variants.
Statistics on variant calling and genotyping were calculated on the 6 samples shared in the three datasets. The variants were classified as known or novel according to their presence or absence in the NCBI dbSNP database build 147.
(XLSX)
Distribution of copy numbers in non-duplicated (control) regions.
The distributions are plotted for each sample from (A) Illumina, (B) Macrogen, (C) Peterhof.
(PDF)
Comparison of various QC parameters for raw reads.
Raw read quality control parameters assessed for all sequenced samples for each sequencing center.
(XLSX)
Alignment statistics.
Various parameters of alignment results are averaged over all samples in each dataset.
(DOCX)
Mendel inheritance errors.
Variants violating the Mendel inheritance law were counted in the trio genotype data.
(DOCX)
Per-sample genotype comparison between datasets.
(XLSX)
HLA genotyping and concordance of WGS-based and molecular typing.
(XLSX)
Filtered list of LoF SNPs.
(XLSX)
Filtered list of LoF indels.
(XLSX)
Time estimates for 30X coverage from sequencing centers per person.
(DOCX)
Solenodons are insectivores living in Hispaniola and Cuba that form an isolated branch in the tree of placental mammals highly divergent from other eulipothyplan insectivores The history, unique biology and adaptations of these enigmatic venomous species could be illuminated by the availability of genome data, but a whole genome assembly for soleno...
Whole-genome analysis of Mycobacterium tuberculosis isolates collected in Russia (N = 71) from patients with tuberculous spondylitis supports a detailed characterization of pathogen strain distributions and drug resistance phenotype, plus distinguished occurrence and association of known resistance mutations. We identify known and novel genome dete...
Mycobacterium tuberculosis isolate data; insertions and deletions associated with M. tuberculosis genetic clades.
Solenodons are insectivores living on the Caribbean islands, with few surviving related taxa. The genus occupies one of the most ancient branches among the placental mammals. The history, unique biology and adaptations of these enigmatic venomous species, can be greatly advanced given the availability of genome data, but the whole genome assembly f...
Pangolins (order Pholidota) are the only mammals covered by scales. We have recently sequenced and analyzed the genomes of two critically endangered Asian pangolin species, namely the Malayan pangolin (Manis javanica) and the Chinese pangolin (Manis pentadactyla). These complete genome sequences will serve as reference sequences for future research...
Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Striking...
Background
As the number of sequenced genomes rapidly increases, chromosome assembly is becoming an even more crucial step of any genome study. Since de novo chromosome assemblies are confounded by repeat-mediated artifacts, reference-assisted assemblies that use comparative inference have become widely used, prompting the development of several re...
Background
Patterns of genetic and genomic variance are informative in inferring population history for human, model species and endangered populations.
Results
Here the genome sequence of wild-born African cheetahs reveals extreme genomic depletion in SNV incidence, SNV density, SNVs of coding genes, MHC class I and II genes, and mitochondrial DN...