Are you Y Li?

Claim your profile

Publications (4)42.8 Total impact

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The NIF Registry is available to download in a couple of ways. The version attached to this paper is a snapshot and we recommend that you use an up-to date version. The places to view the updated registry are: 1. The main NIF site https://neuinfo.org/mynif/search.php?q=*&t=registry&b=0&r=20 *download the registry from here by looking at the "source options button -> download csv" and you should see all data including columns that are not displayed on the interface such as grants that support the project, PubMed Ids, and mentions in the literature (the output of the CalTech pipeline described in this paper). Note, in some cases Excel has a problem displaying these results if a column contains too many entries such as in the case of some resources with hundreds of PubMed identifiers. The NeuroLex site where any entry can be modified (there is an edit button that is available on every page) http://neurolex.org/wiki/Category:Resource There are several CSV download options on this page, but the media wiki platform has some known issues with producing large downloads, so the result set may be limited to the first few thousand entries. To query for specific resources or types of resources, you can try to use the main NIF site search, for example this is a search for all databases: https://neuinfo.org/mynif/search.php?q=database&t=registry&b=0&r=20 For developers, the SPARQL end point is available, with sample queries on this page: http://neurolex.org/wiki/NeuroLex_SPARQL_endpoint
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Schistosomiasis is a neglected tropical disease caused by blood flukes (genus Schistosoma; schistosomes) and affecting 200 million people worldwide. No vaccines are available, and treatment relies on one drug, praziquantel. Schistosoma haematobium has come into the spotlight as a major cause of urogenital disease, as an agent linked to bladder cancer and as a predisposing factor for HIV/AIDS. The parasite is transmitted to humans from freshwater snails. Worms dwell in blood vessels and release eggs that become embedded in the bladder wall to elicit chronic immune-mediated disease and induce squamous cell carcinoma. Here we sequenced the 385-Mb genome of S. haematobium using Illumina-based technology at 74-fold coverage and compared it to sequences from related parasites. We included genome annotation based on function, gene ontology, networking and pathway mapping. This genome now provides an unprecedented resource for many fundamental research areas and shows great promise for the design of new disease interventions.
    Nat. Genet. 01/2012; 44(2):221-225.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The breadth of information resources available to researchers on the Internet continues to expand, particularly in light of recently implemented data-sharing policies required by funding agencies. However, the nature of dense, multifaceted neuroscience data and the design of contemporary search engine systems makes efficient, reliable and relevant discovery of such information a significant challenge. This challenge is specifically pertinent for online databases, whose dynamic content is 'hidden' from search engines. The Neuroscience Information Framework (NIF; http://www.neuinfo.org) was funded by the NIH Blueprint for Neuroscience Research to address the problem of finding and utilizing neuroscience-relevant resources such as software tools, data sets, experimental animals and antibodies across the Internet. From the outset, NIF sought to provide an accounting of available resources, whereas developing technical solutions to finding, accessing and utilizing them. The curators therefore, are tasked with identifying and registering resources, examining data, writing configuration files to index and display data and keeping the contents current. In the initial phases of the project, all aspects of the registration and curation processes were manual. However, as the number of resources grew, manual curation became impractical. This report describes our experiences and successes with developing automated resource discovery and semiautomated type characterization with text-mining scripts that facilitate curation team efforts to discover, integrate and display new content. We also describe the DISCO framework, a suite of automated web services that significantly reduce manual curation efforts to periodically check for resource updates. Lastly, we discuss DOMEO, a semi-automated annotation tool that improves the discovery and curation of resources that are not necessarily website-based (i.e. reagents, software tools). Although the ultimate goal of automation was to reduce the workload of the curators, it has resulted in valuable analytic by-products that address accessibility, use and citation of resources that can now be shared with resource owners and the larger scientific community. DATABASE URL: http://neuinfo.org.
    Database The Journal of Biological Databases and Curation 01/2012; 2012:bas005. · 4.20 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Parasitic diseases have a devastating, long-term impact on human health, welfare and food production worldwide. More than two billion people are infected with geohelminths, including the roundworms Ascaris (common roundworm), Necator and Ancylostoma (hookworms), and Trichuris (whipworm), mainly in developing or impoverished nations of Asia, Africa and Latin America. In humans, the diseases caused by these parasites result in about 135,000 deaths annually, with a global burden comparable with that of malaria or tuberculosis in disability-adjusted life years. Ascaris alone infects around 1.2 billion people and, in children, causes nutritional deficiency, impaired physical and cognitive development and, in severe cases, death. Ascaris also causes major production losses in pigs owing to reduced growth, failure to thrive and mortality. The Ascaris-swine model makes it possible to study the parasite, its relationship with the host, and ascariasis at the molecular level. To enable such molecular studies, we report the 273 megabase draft genome of Ascaris suum and compare it with other nematode genomes. This genome has low repeat content (4.4%) and encodes about 18,500 protein-coding genes. Notably, the A. suum secretome (about 750 molecules) is rich in peptidases linked to the penetration and degradation of host tissues, and an assemblage of molecules likely to modulate or evade host immune responses. This genome provides a comprehensive resource to the scientific community and underpins the development of new and urgently needed interventions (drugs, vaccines and diagnostic tests) against ascariasis and other nematodiases.
    Nature 01/2011; 479(7374):529-533. · 38.60 Impact Factor