Questions related to Open Data
I have a question about the formal difficulties that may arise with the publication of a preprint when a journal agrees to have it published during the review process. So it is legal, but: How is it working? Does it violate the double-blind rule? Let us imagine this situation: I am a reviewer and I want to check the originality of the work, so I do research, and here, boom, the text is on the RG and I already know who published it. Moreover, on RG, the author knows that a potential reviewer has viewed his manuscript. In my opinion, this is a risk. Is it worth it? After all, open review and commenting on unpublished works is not a popular practice in the social sciences.
Same with the open data. I have no problem publishing them, I would like to do so, but it violates the anonymity of the manuscript I submitted. Of course, for beginner researchers like me, it may not matter much, but when someone gets some positive and negative contacts in academic community, it can be a real problem that arises with work experience. How do you see it?
Do there exist open data online sites where one can submit nematode micrographs, VCE videos, and image metadata? A content management system like Drupal could work. The level of effort and maintenance of such a site would be high.
Extending NemSyst (https://nemys.ugent.be/) with a citizen science add-on that allows users to submit observations?
I was wondering if there was an authority providing PM1 official measurement as open data. I checked in Europe and in the US but could not find anything apart from PM10 or PM2.5 official measurements. Many thanks
Many Global cities are opening Public transportation data feed under open license. General Transit Feed Specification (GTFS) is already offering such valuable information within the provision of freely license specification besides TRANSM.
Is there any overview - which cities are offering GTFS as open data (global, regional, national)?
Here I can see a summary in case of Germany (but not sure it is complete!!)- https://bit.ly/2Ut5Mf0
We live in a world of data, that much is obvious. Open data can be used to create new value and has many applications. But I'm curious how do you think open data can impact local development, how can it help local government? Maybe you have some experience, interesting examples?
Gray literature vs. scientific literature
-What are the advantages and disadvantages of gray literature compared to scientific literature.
-What are the repositories that you know about gray literature.
-What is the importance of gray literature for developing countries
Gray literature is "materials and research produced by organizations outside of the traditional commercial or academic publishing and distribution channels. Common gray literature publication types include reports (annual, research, technical, project, etc.), working papers, government documents, white papers and evaluations"
Scientific literature "comprises scholarly publications that report original empirical and theoretical work in the natural and social sciences. Within an academic field, scientific literature is often referred to as the literature. Academic publishing is the process of contributing the results of one's research into the literature, which often requires a peer-review process".
I wanted to upskill my analysis skill by using the open data. I tried several courses in edex and openlearning- but it was for those who looking to be become data scientist. I want use those skills in education- to provide analytics alongside the literature reviews.
Is there any available courses on this area? I am looking for microcred programs or online courses offered part-time mode.
SRA datasets has a lot of useful and reusable data for researchers. But some of them might be the ones that never get used at all. I would appreciate anyone advice on how can these data be more useful for the researchers? Would it be helpful to have it QCed and provide a quality metrics to the data, or maybe in some way ready to use data ie.e preprocessed? Is one dataset more used than the other (RNAseq vs DNA vs Metagenomics or a disease specific?
#SRA #OPENdataset #genomics #opensource
I am currently doing research on public transportation and the epidemic. Is there any open data related to the spread of the epidemic on public transportation?
I have a student that I am supervising who needs some type of open data set for analyzing using Structural Equation Modeling. It would need to be data from which latent variables/ constructs such as e.g. trait anxiety can be created. So, questionnaires and surveys are ideal for that.
Thank you in advance for any recommendations.
Greetings! I am interested in open-data locations on the web related to cybersecurity or cybercrime attacks recorded over the past 1-3 years, and the location of those attacked?
As I am currently doing regression related work, I hope to have some open data and help me to test my model, so I need you to provide me with some datasets that can be used. The requirement of datasets is N *feature*time series, where N represents the number of samples, feature represents the number of original feature channels.Time series refers to the number of channels sampled. At present, I have NASA's WEAR dataset, but I think it is not enough, so I need your help.
Is there a (preferably free) way to download all molecules (SMILES or SDF) from a published scientific article?
Like searching pubchem or chembl or ZINC via DOI or so?
I would like to extract molecules from some papers, but I would like
to avoid having to input each molecule by hand...
Thanks a lot,
I am in the process of writing a project proposal and finally wanted to include open-access dataset as a deliverable. Haven't published datasets before, it took some looking around and Zenodo and Figshare seem to be the most recognized ones, at least in the environmental engineering field. Zenodo seems to provide DOIs, but not sure about Figshare.
I am interested to know, have you already published your data there, or have you ever used someone else's data from Zenodo/Figshare?
we are testing a calibration-free algorithm to estimate the concentration of a unknown material using the LIBS spectra. It is implemented in Matlab and we plan to publish it in Github for everyone to use. I was wondering if anyone know of raw spectra of materials with known concentration (for example, standard reference materials -SRMs- from NIST) made available online. We would use these spectra to test and improve the algorithm.
I have noticed that in other research disciplines (artificial intelligence above all) it is nowadays common practice to share the code and the data, in many others, and in LIBS spectroscopy in particular, it is very rare.
Thank you all in advance for your answers.
Does anyone knows an Open data source or sharable data source for traumatic brain injury neuroimaging (Structural MRI,DTI,CT)?
How hard/easy is it to get access to clinical trials raw data, and how long does it usually take to get access clinical trials individual patient data through websites such as The YODA project or MSD?
Is any database of open-data websides ( fe www.worldometers.info...etc.) in the field of social, environmental and economical problematics, or do you know is any research about collecting data from inter-face of web pages..., longitudial project to store and provide to researchers..., if not, i will start :) thank you to all to help me
In my research related to monitoring the cow behavior with help of sensors I have a permanent and principal problem with the data for model validation. I tried to find some available open-source data sets and ask people on the last EAAP conference, but in vain. I need, for example, a dataset including records of tag accelerations for tens of cows for several days to validate my behavior discrimination model on data achieved in other conditions. The question is do datasets exist and where are they located?
Conference and journal papers building models for the behavior prediction have the same problem. But this problem is solved, for example, for the face recognition. There exist a site with the available datasets https://www.face-rec.org/databases/. Hence, I decided to concentrate links to all datasets that I find in one place on GitHub.
For my assignment, I am to use open data that focuses on ransomware attacks (however other forms of cyber attacks are also useful) so that I can identify trends and write my report. My report is based on attacks on local governments and councils however I have not been able to find any data that is refined enough.
Most of the publicly available databases give only the basic information like age, gender, mode of infection, etc. regarding the infected patients suffering from CoVID 19. So, can anyone recommend or suggest more specific databases related to image, speech or clinical data of the patients that are meant for open research?
I have the feeling that everything in chemistry related to information and data in particular is very sharded and obscured. Are there any projects ensuring more transparency and openess of the data? I am getting quite lost to be honest and would wish for more standards and unity in the field.
Does anyone know any open-source databases of Raman spectrum with anti-Stokes data?
I need the anti-Stokes part to prove my algorithm, but it seems that most of the databases only provide the spectrum starts from the positive wavenumber.
Please suggest the open-source Raman databases with data collecting that include the negative wavenumber region.
Thank you for your kind help.
I would like to test the statistical effect of a foreign-imposed regime-change (FIRC) on target country's trade openness data. My sample includes 29 cases of FIRCs in the past 70 years with trade-openness data (imports+exports/GDP) available for almost all of them.
I considered comparing the mean differences in trade-openness of affected countries with a control group of the region using MANOVA or t-test, but I am not quite sure if this can work, since I am not familiar with important events logic incorporated in time-series testing.
I downloaded MODIS NDVI data from 2001 to 2005 from Google Earth Engine using batch download for an area in Australia. I drew a polygon to demarcate my aoi for cropping. When I opened the exported dataset in ArcGIS, the images looked warped. The size and shape is different to the original polygon. For example, my aoi is a vertical rectangle, but when I opened data in ArcGIS, it showed images as horizontal rectangles. Of course, I changed the projection, but the problem remains same.
Can anyone help with solving this issue?
I'm looking for shared data of Coronavirus patients, in particular, I need the demographic, clinical, and survival characteristics to test some hypotheses in a secondary analysis.
Are there any databases/websites that share COVID-19 patients data?
We are looking for open data on the identification of stress (arousal or other related emotions) from physiological signals, specially Electrodermal activity (Galvanic Skin Response) but also Heart Rate Variability or EEG
Our ideal data set would contain the physiological signal(s) time series labelled with the stressors and at least 500 records (same or different individuals)
Thank you for reading!
Hello, this is Jean who is new to use ADNI data + DTI data.
I downloaded Axial DTI data of AD & CN, and I drop those data in dcm2niigui, but it does not work to covert to 4d nil image.
So I checked the data and it has 2,714 of dcm, which is different from what I got in PPMI DTI files.
Is it a matter of axial dti files? or is there any other issue regarding this??
I'm currently working on a project that desperately needs real-world data to test. What are some cities around the world where we can easily download OD matrices?
I'm looking for standardisations and regulations for open data provided by german/european cities.
To be more specific, I'm looking for standards, which portals like this (https://www.offenedaten-koeln.de/dataset/verkehrskalender-der-stadt-k%C3%B6ln/field_tags/transport-und-verkehr-52?sort_by=changed) are build upon.
I'm especially looking for standards, that regulate city traffic data, how they are stored, how they can be received (hopefully in near future), and how the stored information can be linked to common road transport network specifications.
If there are standards, i would appreciate if you could point me in the right direction.
Thank you very much.
We are doing some research on Bonn's urban structure and historical buildings/ neighborhoods/ destinations. We would like to find all sorts of GIS data for Bonn (e.g., topography, building footprint, streets, land use and land cover, sociodemographics). Web searching has been challenging for us, as we don't speak German. Can someone recommend some web pages where open data can be found? Thank you!
I am leader of the research project B-DATA – Big Data Assemblages: Techniques and Actors. B-DATA has the intent to study data assemblages inside research centres and data infrastructures which produce and use open data and Big Data. The three main case studies are: the Consortium of European Social Science Data Archives (CESSDA, Norway)); Italian Statistical Office (ISTAT, Italy); the Web Science Institute of the University of Southampton (UK).
During my research i came accross to some of the topics that have been adressed by your project. I was thinkink if you were interested in making a comparison with what is happening here in Italy at ISTAT. I can have full access to this field.
This is just a first contact. If you are interested, we can have some contacts to understand the feasibility of that.
I attach an open access article that show some of the finding of B-DATA, that may help you to understand what I am researching.
I am looking forward to hearing from you.
Yours sincerely, Biagio Aragona.
Assistant Prfessor of Sociology
University of Naples Federico II
The general public relies increasingly on mobile applications for day-to-day services. Beyond sharing data sets, it is more and more important for agencies such as the Asian Development Bank, USAID, the World Bank, etc. to satisfy the communications needs of their clients, audiences, and partners (not to mention those of their own employees) by the same means. Across the project cycle, meaning, the various stages from country programming to project completion and evaluation, where might there be opportunities for cheap, effective, and low-maintenance mobile applications à la "build-once, deploy-anywhere"?
I am looking for data set to develop a flood forecasting system specifically for an agriculture dominated watershed. The requirements include a) fine resolution DEM b) land use type and soil hydrologic group c) hourly/sub-hourly rainfall data for a minimum of 30 years d) hourly/sub-hourly stream flow data and e)demographic details of the area. It would be helpful if you suggest any data repository/experimental watershed for collecting the above mentioned data. I assure you that the services offered will be duly acknowledged.
The trend recently is towards sharing data in open data repositories. Some journals require that data be sent to such repositories as articles are submitted. Considering the pros and cons of data sharing, how willing will you as a researcher be willing to share the yet unexplored data of your research vis-à-vis the requirements for such submission by some journals?
Public organizations face important challenges regarding information access. There is an increased pressure for transparency, open data and information access. On the other hand, there is the right to privacy, the right to be forgotten, commercial and banking secrecies. How to find an acceptable balance?
I have tens of thousands of individual scans in proprietary file formats, and I want to make these public. I need a format that is free and open, or to make my own.
Our proprietary software offers a CSV option, but doesn't export all useful data to the file. In addition, the CSV file it creates is more like two spreadsheets, with the second half having the per-channel photon counts.
I've considered using XML because it is both machine and human readable. My only concern is that XML is bloated. XML has the added benefit of being readable over a web browser, and can be quickly converted to almost any language, including JSON.
Microsoft INI format is also machine and human readable, but INI is fairly phased out. Software writers still have full access to INI functions though, so I wonder if this is still a viable format. INI also converts well to object notation.
Both INI and XML could better represent two spreadsheets worth of different-typed content in a single file than a CSV.
What are your thoughts?
I am currently looking for an open data set including both brain imaging and physiological signals (prefer acquired brain injury).
Do you have relevant information that I am looking for?
As a number of funding agencies increasingly ask for data management and open data, I was wondering if any open database for to deposit and search for chromatographic data (HPLC, GC methods and chromatograms)?
Data have become increasingly more important for all types of professionals to scope, evaluate and maintain resilience projects around the globe.
But how do resilience practitioners make informed decisions and know where and
how data can be used to promote the sustainable development goals, transparency
#OpenData #Bolster #Resilience #Efforts #increasingly #professionals #sforzi #resilienza #professionisti #progetti #sostenibile #scope #globe #accountability #sustainable #Randieri #Intellisystem #IntellisystemTechnologies
Recently I downloaded Satellite Data of JERS-1 SAR. The first problem was the data didn’t opened in any software like (SNAP, MapReady, ArcGIS, ENVI, Erdas). The second problem was the data when extract it got without any extension, it read just file.
Please Help my find program open these data.
Meelad A. Hussein
Open science seems to become more and more important for Higher Education. I wonder if Open Science is an own Mission of HEI, or if it i one aspect of Third Mission of Higher Education Institutions.
What do you think?
In my opinion many parts, but not all aspects of open science belong to Third Mission, but Open access and Open data not.
Looking for a fruitful discussion on this topic.
Have a nice day and best regards
I'm looking for precipitation data in Brasil with the following requirements:
- covering 2017
- spatial resolution: less than 500 m
- based on measures in Brasil (not only modeling)
Thank you for your precious help on this,
When I want to open my project file in smartPLS 3.0, nothing appears on it, even on the indicator's pannel. In the indicator's pannel there is a message that'' Your data file setup is corrupt. Please open the data file and revise the setup''. I have also changed the file format to csv. I appreciate your kind help in advance!
I require some form of risk measurement (interval or ordinal scale) within business activity classifications.
I do have some NACE v2 (Statistical classification of economic activities in the European Community) classes (at least on level 2 - two digit format [88 items]), and I need a variable for these, which do express the comparative total risks associated with that specific business activity.
I have thought about simple volatility measures based on equity markets (for example average, 5 year annual volatility for stocks in a class), but surprisingly such data is not easy to get. The first problem is, that there are numerous classification systems (SIC, NAICS, ISIC, GICS, BICS, TRBC, ICB) and most viable sources use classifications which are lacking direct correspondence tables to NACE v2 (for example SIC from 1987). The latter four classification systems used by market data vendors are proprietary and there are no correspondence tables at all. Even so, I might have access to a Bloomberg Terminal, that data is problematic to be used in my publication. Some form of open data would be more helpful. I have seen this article: Kakushadze, Zura; Yu, Willie (2017): Open Source Fundamental Industry Classification. In Data 2 (2), p. 20. DOI: 10.3390/data2020020, but it is using SIC as well, and it gives no clue on how to produce the volatility measures (how to acquire and process ten thousands of time-series to get the aggregated volatilities). And that procedure might fall outside my capacities.
We are working on analysis of those medical records in traditional medicine. But we are lacking those medical records in traditional medicine. Is there any place in which I can find the open data of those records?
Those records may include narrative texts with tradition medicine terminology.
I need a open data set which contains reviews (comments) about people (eg: doctors / lecturers / politicians) for my project from where can I get that type of data set?
Hi guys, recently some of my data are presenting the following error message ""Invalid RR interval values (RR zero or negative!). File cannot be opened" error" . I'm collecting HR and HRV through Polar RS 800 CX and analyzing with Kubios HRV. Does anyone faced some similar problem or know how to fix it? (This problem appears while opening the data txt files with Kubios)
Thanks for your attention all! Greetings from Brazil.
I am looking for an open data set which includes students information related to courses, degree programs etc. The label of the data set should be grades of students.
In many implementations of Learning Analytics systems at universities the collection of learner-generated data is routine. But, students are typically not advised that this is happening. In formal research contexts such data collection would constitute a breach of research ethics. This issue is one among many others as it becomes easier & easier to produce & collect data
What is the relationship between "free speech" and "open data"?
Using the spectrometer, I obtained the raw graph (image attached) which has its x-axis as wavelength. I understand that it needs to be converted to wavenumber in order for me to obtain the raman spectra.
How do I go about this? Do I have to open the data in matlab and code the program to convert every point within the data?
I ‘m looking for P3 micro CT-scan of recent Pongo (5 - 10 specimens).
So, if you know:
-someone who can share some micro CT-scan data,
-or the name of an institution and contact susceptible to provide micro CT-scan data about Pongo teeth.
-or database like “MorphoSource” where I can access to micro CT-scan open data.
I will appreciate your help.
Does anyone know where I can find statistic data about car-sharing systems in the world?
In example: number of systems, number of cars, number of registered users, number of rentals, number of dedicated EV charging stations etc.
For each continent or country.
For last 10-15 years.
I'm an open source activist contributing on Promotion/Participation/Initiation of FOSS Related activities from Nepal. During my undergraduate study, I contributed from Kathmandu University Open Source Community an autonomous wing of Kathmandu University Computer Club.
Currently I am continuing to contribute via FOSS COMMUNITY NEPAL. So this year FOSS Nepal is planning to organize a Conference focusing on the Free and Open Source Software. As I'm taking the responsibility of setting the roadmap as well as managing the overall conference, I am not only collecting the research done by researchers but also investing time and effort to incubate research topics and also helping researchers in doing he research. So I would like to present you Researchers my ideas I've thought about and would want feedback on my points as well as new ideas on doing researches: 1) Doing Research on Penetration of FOSS Technologies mainly focusing on Government sector, School, Institutions and Research Centers 2) Adaptation of Free and Open Source Softwares in Health, Education, Research. 3) Open Data and it's implementation for solving community problems.
In order to do research and surveys on above mentioned topics, I am planning to work with Student from different college and Universities. But currently I don't have plans and action items which would make sure that these researches would get completed because I have no areas of generating resource.
Therefore I would like to request you researchers to help me identify opportunities, challenges and resource generating aspects of successfully organizing a Conference.
It would be great to hear from you all.
Volunteer, FOSS NEPAL
I am looking for databases where I can access open data. I already looked at https://www.ebi.ac.uk/ena/data/search?query=hunter+gatherer where I found some data. This is a good starting point but maybe there is more out there.
We are looking for SNP data, we are trying to create a sample of hunter-gatherer populations, where we will include our sample.
With the help of numerous satellites, we are able to research the earth and sun (space physics). However, the study on the other planets in the solar system is still lack. I wonder if there have some missions are available for the public to study the planets?
Open science is advocated by science funders, policy makers, and increasingly scientific communities. Yet, it seems there is little research on the effects of making data and code available, publishing open access or reaching out to the public (e.g. citizen science).
Do you know of studies that measure what effect open science practices have within academia (e.g. increase in citations) OR outside academia (e.g. commercial use of data)?
Please help me collecting hard evidence for the assumption that "open science" is in fact beneficial for scientists, science, and society at large.
Any source recommendations, studies, grey literature on the topic would help.
Thank you very much!
I'm trying to see if my data has normal distribution so I want to do the kolmogorov smirnov test. So if I follow the instructions below then do the kolmogorov smirnov test without doing anything else will that be okay?
Importing data from Excel:
• Data can be imported from Excel into SPSS for statistical analysis
• Click on File -> Open -> Data -> click on the drop down arrow next to Files of type to see different types of files. Choose Excel and locate the file that you want to open
• SPSS will ask for confirmation that the first row contains the variable names and of the length and breadth of the database. Click OK
• The SPSS Data View that opens will contain the data. However there will be no variables Labels or Values
Hi all! I have some MR spectroscopy data from epileptic patients and normal cases. I want to compare the metabolite rates in normal cases and 2 types of focal and generalized epilepsy. I need to be able to review the data and import the values into MATLAB. I'm not able to open data in MATLAB. Do you know of any toolbox or function to help with this?
Thanks in advance
Some of my colleagues are planning to analyse news headlines related to climate change. Does anyone know of a good source for historical news headlines as open data?