Science topic
Database Analysis - Science topic
Explore the latest questions and answers in Database Analysis, and find Database Analysis experts.
Questions related to Database Analysis
Bi0.8Sm0.2FeО3
Space group: P n m a or P b a m
Hello dear scientists
Ideas for good articles or databases on cystic fibrosis and common or potentially pathogenic variants are most welcome. And what strategies would you suggest for carrier screening for cystic fibrosis?
Can I use the SRA Run Selector on GEO Database to compare candidate genes, if yes, then how? I have tried analyzing by GEO2R but I am not getting the right information I need.
Dear All,
I would like to ask, is it possible to obtain data in some databases, websites about sexual behavior in different countries of Europe or the World? Thank You!
Best regards
Stefan
Tl;dr: I’m trying to convert gene IDs of an obscure MRSA strain from Ensembl Bacteria to KEGG.
Hello,
I’m trying to do a pathway enrichment analysis of MRSA strain 107 using GSEA. I have gene expression data that are associated with the gene IDs from Ensembl Bacteria. I plan to use KEGG as my pathway database.
GSEA requires a .gmt file of the gene IDs/enrichment data (of which the gene IDs are from Ensembl), then requires a pathway file (from KEGG). If I try to do the analysis with both of these files, the gene IDs don’t match up, so GSEA can’t do it.
My question is whether there’s a way to convert these gene IDs specifically with these strains of MRSA from Ensembl Bacteria to a site like KEGG. Here are the resources I’ve already tried:
DAVID
Dbtodb
Syngoportal
G:convert
MetaScape
BioMart from Ensembl
Annotationdbi
All these are tools that work, but they don’t include my strain. How should I convert these Ensembl Bacteria gene IDs? Is there another option I don’t know about?
PS. I don’t need to use KEGG; if a different pathway database works, that would also be acceptable.
Gene Ontology provides many genes annotated as taking part in various morphogenesis processes but I want to get a list of all morphogen coding genes specifically. Uniprot does not have "morphogen" keyword.
I need some gene annotation database that has genes labeled as "morphogen".
Below are some issues related to Big Data database technologies that can be developed scientifically:
- Application of data processing technology in Big Data database systems for modern education 4.0,
- Improvement of forecasting of natural, climatic, economic, economic, financial, social etc. phenomena based on analyzing large data sets,
- Analysis of sentiment, opinions of citizens, Internet users regarding brand recognition of companies, customer reviews of specific services and products, views on various topics, citizens' worldview based on the analysis of large collections of information downloaded from various websites, from comments downloaded from social media portals,
- Analysis of information and marketing services of commercially operating companies that carry out specific analyzes of sentiment, citizens' opinions, Internet users regarding brand recognition, customer reviews of specific services and products etc. on behalf of other companies that purchase specific analytical reports,
- Analysis of the possibilities of cooperation, synergy, correlation, conducting interdisciplinary research, connecting Big Data database systems with other information technologies typical for the development of the current fourth technological revolution called Industry 4.0, which include technologies such as: cloud computing, machine learning, Internet of Things, Artificial Intelligence, etc.
In what other areas are the technologies of processing and analysis of information in Big Data database systems used?
Please answer
Best wishes
Dear Colleagues and Friends from RG
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications:
I invite you to discussion and cooperation.
Best wishes

I guess there must be some data collected regarding Covid and related to the field of psychology/psychiatry, considering its psychological impact. It might be gathered from the patients, family members or the society at large, either a public or private collection. Does anybody have any idea on how to access such data for research purposes?
Does anyone of you use sentiment analysis in research conducted on data downloaded from the Internet and analyzed in the Big Data database system?
If so, please let me know in which issues, in which research topics do you use sentiment analysis?
Is sentiment analysis helpful in forecasting economic and financial processes?
Please reply
Best wishes

The improvement of specific risk management systems is particularly important in many areas of functioning of commercial business entities, financial institutions, public institutions as well as conducting investment, research and other projects.
How important is this is, for example, the global financial crisis that appeared in mid-September 2008, when specific financial, investment and credit risk management systems were not properly improved and the procedures of investment activity, including credit, were not carried out reliably, as well as customer service, and violation of business ethics in investment banks operating at the time and many other types of financial institutions and business entities.
please reply
Dear Colleagues and Friends from RG
The key aspects and determinants of applications of data processing technologies in Big Data database systems are described in the following publications:
I invite you to discussion and cooperation.
Thank you very much
Best wishes

Hello advisors. I am glad to be here in this Community. If you allow me, I would ask if anyone of the members here know about Databases of cutting tools. Nowadays, I am working on a project and the main goal is to research about lifetime of cutting tools using the Cox proportional hazards model. At the moment, I just have one Database that contains cutting speed, feed rate, tool failure time and depth of cut. My research question is: Which of these variables are the most representative that give us the time of failure?
Any kind of help, comments and questions are welcome.
In my opinion, analytics based on the processing of economic information accumulated in Big Data database systems will be used for researching and analyzing determinants of current economic processes, for risk management and for forecasting economic processes.
Please reply
Dear Colleagues and Friends from RG
Some of the currently developing aspects and determinants of the applications of data processing technologies in Big Data database systems are described in the following publications:
I invite you to discussion and cooperation.
Best wishes

What kind of scientific research dominate in the field of Big Data database systems?
Please, provide your suggestions for a question, problem or research thesis in the issues: Big Data database systems.
Please reply. I invite you to the discussion
Dear Colleagues and Friends from RG
Some of the currently developing aspects and determinants of the applications of data processing technologies in Big Data database systems are described in the following publications:
I invite you to discussion and cooperation.
Best wishes

Does anyone know if an endomyocardial biopsies database exist and can be downloaded?
Will the development of data processing technology accumulated in the Big Data banking database systems improve the credit risk management process or will it contribute to the development of Shadow Banking and the use of unethical practices for the surveillance of potential borrowers?
Large commercial banks generate high financial surpluses allowing for the implementation of modern integrated teleinformatic internet banking systems, Business Intelligence data analysis systems, data processing platforms in Big Data database systems, etc.
There were already situations of unethical use of modern ICT solutions, analysis of comments on social media portals, during which the bank verified the customer's data entered into the loan application by also scanning information that the potential borrower types in social media portals.
This informal verification took place without the knowledge of a potential borrower and could then be the basis for suing the bank.
However, the bank's client is not always aware of the fact that it can be invigilated in such a way by the public trust institutions that the bank should be.
Of course, these types of cases, which we know from the media is supposedly a margin of entire banking, which can be one of the categories of a new type of unethical practices typical of the so-called Shadow Banking.
However, only part of this type of information gets to the media.
Maybe this is just so-called "the tip of the iceberg" of this problem.
The situation is similar in the situation of cybercriminals' attack on bank IT systems or electronic banking platforms.
If it is possible to keep this type of events secret, then customers do not find out about it.
This is because media only receive information about some of these types of events.
Does any of you conduct research in this area?
If so, I invite you to cooperation.
I am asking for comments
Dear RG members
I frequently use the Web of Science. What‘s your choice? Thank you so much!

Is it possible now or in the future to create an artificial intelligence that will draw knowledge directly from the analysis of Internet resources and learn this knowledge?
Please reply
I am conducting research in this area. Based on the findings, I conclude that the rapid development of artificial intelligence (AI), as seen in the increasingly popular chatbots, raises questions about its capacity for self-improvement and autonomous learning. These chatbots, trained on huge data sets and improving through interactions, still operate within algorithms created by humans. Although they can learn and process information, their ability to self-improve is limited. However, advances in technology are moving us in a direction where autonomous learning AI is becoming more and more feasible, although it still requires overcoming technological and ethical challenges.
My research and observations show that artificial intelligence technology has been rapidly developing and finding new applications in recent years, there are new opportunities but also threats. The main determinants, including potential opportunities and threats to the development of artificial intelligence technology are described in my article below:
OPPORTUNITIES AND THREATS TO THE DEVELOPMENT OF ARTIFICIAL INTELLIGENCE APPLICATIONS AND THE NEED FOR NORMATIVE REGULATION OF THIS DEVELOPMENT
Please write what you think about this issue? Do you see rather threats or opportunities associated with the development of artificial intelligence technology?
What is your opinion on this issue?
What is your opinion on this subject?
Please respond,
I invite you all to discuss,
Thank you very much,
Best wishes,
I would like to invite you to scientific cooperation,
Dariusz Prokopowicz

Yes, in my country, the Scopus indexing base is considered one of the most important. The Scopus database is recognized as the main scientific database for the indexation of scientific publications characterized by high citation. However, on a global scale, the bases of indexing scientific publications recognized in various countries by various centers and scientific institutions are at least a dozen or so. However, these various indexing bases are not usually fully comparable, they are functionally differentiated and thanks to that they are only partially substitutable, but more often they are complementary. The question of complementarity should be developed. Then it will serve the development of scientific research and international cooperation of scientific communities.
In view of the above, I am asking you with the following question: Is the Scopus database recognized in your country as the main database for the indexation of scientific publications?
And if not the Scopus database, which other database of publications and scientific journals is considered the most important in your country?
Do you agree with me on the above matter?
What do you think about this topic?
Please reply.
I invite you to discussion and scientific cooperation.
Thank you very much.
Best wishes.
Dariusz Prokopowicz

Is the security of information collected in social media portals databases currently one of the key determinants of the development of new online media?
Security of social media portals is currently one of the most important topics of social media portals and other new internet media and information services. Therefore, scientists at various universities are involved in researching this issue. Therefore, security tools for information collected in social media portals databases and data security systems on the Internet are being developed. In companies and key public institutions, systems for risk management of information systems and information transfer on the Internet are also developed.
Do you agree with me on the above matter?
In the context of the above issues, the following question is valid:
Is the security of information collected in social media portals databases currently one of the key determinants of the development of new online media?
Please reply
I invite you to the discussion
Thank you very much
I also conduct research in this matter. I am researching the security of social media portals in connection with Big Data database technology. Below are links to my publications:
I invite you to discussion and cooperation.
Thank you very much
Best wishes

Greetings from my side.
I am facing problem in getting annual reports banks throughout Asian and Euro-Asian countries for my content analysis. Need suggestions regarding any source where I can get data these easily.
Considering the specifics of the increasingly common IT systems and computerized advanced data processing in Internet information systems, connected to the internet database systems, data processing in the cloud, the increasingly common use of the Internet of Things etc., the following question arises:
What do you think about the security of information processing in Big Data database systems?
Please reply
Best wishes

After some research, I found nothing satisfactory. The data do not allow for strong linkages.
Thanks for your help !
Best regards
Where I can find a global database free for Arab Maghreb Union (AMU) (classic and Islamic banks) with information about profitability, liquidity, capital adequacy, asset quality ... of Islamic banks worldwide?
Given your specific discipline. Have you ever irretrievably lost data of an ongoing research project? How did you handle it? Thanks in advance.
(Also, this is my story. A couple of years ago, in a study that included collection, preservation, identification and weighing of soil invertebrates, after an unfortunate event in the laboratory, the notebook that contained the weight notes of one of 10 sets of collected organisms, which belonged to the control group, was lost, so were the preserved organisms. I'm tagging this with an entomology lablel, so in case you're familiar with this topic: Would you consider trying some method of reconstructing the weight data or is there just nothing to do? There is no way to recover the notebooks, nor the preserved organisms).
Thank you.

Bi0.8Sm0.2FeО3
Crystal system: Orthorhombic
Space group: P n m a
Space group number: 62
The current Legionella pneumophila SBT databse ( http://bioinformatics.phe.org.uk/legionella/legionella_sbt/php/sbt_homepage.php / http://www.ewgli.org/ ) maintaint by Public Health England’s Bioinformatics Unit is down for the past 2 months, and no email replied from their listed email.
Is there any other database I can refer to, I stuck in the middle of my study.
Thanks
Hi!
My name is Catarina and I'm looking for some help in Innovation Marketing- Product Innovation.
I'm going to join a project which aims to analyse the consequences of implementation of an improved product in a certain store -emphasis on the fact that this is about incremental innovation, which means improving an already existing product and not developing a new one.
What I need to know is:
- How to perform a statistical analysis on changes in the market after the implementation of this case of innovation marketing, specifically product innovation, in order to assess the consequences of this implementation.
- What methods should I use to conclude what the clients consider to be an innovation?
- What statistical analyzes are usually performed in the evaluation of the implementation of Innovation marketing?
- How to measure the weight of positive / negative aspects?
- What are the stages of this study before implementation (forecast) and after (evaluation)?
All of these in a perspective of statistical analysis.
Also, if there is any documentation on this subject that could be helpful please let me know.
Thank you :
Do you know any database for searching about endemic species, their genomes, differences, DNA barcodes etc.?
We are looking to implement a web-based lab notebook as well as a tracking system to upload various assay results for several analogs of a parent compound. We will need to keep very close track of lot numbers, dates received, chemists who synthesized them, ect. Does anyone use a service which would be helpful?
We want to ascertain a reliable and effective database for the analysis of LC-QTOF-MS/MS data. The database should be a freely available one.
I'm looking for a database, which contains financial statements of companies but presented quarterly not yearly. Amadeus database presents data only yearly and only for last 10 years. Do you know some other sources?
I am looking for company-level data on R&D expenses, because I would map it to M&A data in order to assess the impact of (cross-border) M&A on R&D intensity. Thank you.
Hi there,
Could anyone suggest any database that could be used to find protein binding peptides ?
Thanks,
Chun
Currently, various data security tools are used in Big Data database systems. The basic principle is the parallel use of several types of IT security and compliance with specific procedures for analyzing and securing systems against potential materialization of operational risks, including technical risks associated with used computer hardware and specific database technologies and personnel risks associated with employees who support these systems.
The key issue is also whether built database systems are directly connected to the Internet online or are not permanently connected to the Internet and certain data from the Internet are added from time to time to Big Data databases after their analysis by anti-virus software, detecting malware worms, such as keyloggers and other malicious software created by cybercriminals and used to steal information from database systems of data warehouses and Big Data.
In a situation when Big Data database systems or other systems where important information is collected are connected to the Internet online, then the information sent should be encrypted, and system gateways connecting the Big Data database with the Internet should be equipped with a good firewall and other filtering security incoming information. If the employees operating the Big Data database system use certain e-mailboxes, they should be only company mailboxes and verified from the security side of data transfer on the Internet. The company should have strict security procedures for using e-mail boxes, because in recent years via e-mails cybercriminals have sent ransomware programs hidden in e-mail attachments, used to encrypt hard disks used in company and server databases.
Do you agree with me on the above matter?
In the context of the above issues, I am asking you the following question:
How should Big Data database systems be protected against the activities of cybercriminals? What types of programs and systems for securing Big Data databases against cybercrime are currently used? What other types of security instruments for Big Data database systems are currently used?
Please reply
I invite you to the discussion
Thank you very much
Best wishes

What are the important issues for you related to the collection and processing of large information sets in Big Data database systems?
The current technological revolution known as Industry 4.0 is motivated by the development of the following factors:
- Big Data database technologies,
- cloud computing,
- machine learning,
- Internet of Things,
- artificial intelligence.
On the basis of the development of the new technological solutions mentioned in recent years, the processes of innovatively organized analyzes of large collections of information collected in Big Data database systems dynamically develop.
In my opinion, the fastest-growing business projects are primarily those that are the subject of innovative startups developing dynamically for a minimum period of several years. Startups develop innovative business projects in such areas as: information technology, ICT, Internet, biotechnology, energy, ecology, environmental protection, medicine, agribusiness, etc. In addition, a number of innovative technologies in construction, material, process and marketing innovations have recently been created in the field of smart city, life science, cleantech, medical intelligence and others that are used by companies and corporations operating in various sectors of the national or international economy.
In addition, innovative business projects are also developed in the fields of various fields of information services, advanced data processing, business analytics and the development of teleiform technologies, which together are the pillars of a knowledge-based economy. The current technological revolution known as Industry 4.0 is motivated by the development of the following factors:
Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence. It is anticipated that in the next years in these fields of science and technology many large startups will be created, which will be developed effectively based on innovative business projects related to the topics mentioned above. On the basis of the development of the new technological solutions mentioned in recent years, the processes of innovatively organized analyzes of large information collections gathered in Big Data database systems dynamically develop.
In each of these areas, many specific design topics can be distinguished, in which business startups develop and reach the minimum level of a medium-sized company or large corporation in a situation of spectacular business success based on a well-designed business and an effectively implemented innovative business project.
What other technological improvements, innovative organizational, technical and IT solutions will be developed in the future based on the development of the above-mentioned factors?
Will the development of data mining technology, machine learning, artificial intelligence, Big Data data analysis, etc. develop new branches of the knowledge-based economy or only make use of these technologies in already existing branches, sectors of currently developing economies?
In view of the above, I am asking you: What are the important issues for you related to the collection and processing of large information sets in Big Data database systems?
Please reply
This issue is described in the following publication:
I invite you to discussion and cooperation.
Best wishes

I'm working on NrCAM gene mutation. I'm searching to know about any known mutations recorded in NrCAM gene. But I'm not getting any databases or articles regarding that. Can anyone suggest me any mutation database or links to know about the mutations of NrCAM gene in humans.
Big Data database systems can significantly facilitate the analytical processes of advanced processing and testing of large data sets for the needs of statistical surveys.
The current technological revolution, known as Industry 4.0, is determined by the development of the following technologies of advanced information processing: Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence, Business Intelligence and other advanced data mining technologies. All these advanced data processing and analysis technologies can significantly change and facilitate the analysis of large statistical datasets in the future.
Do you agree with my opinion on this matter?
In view of the above, I am asking you the following question:
Will analytics based on data processing in Big Data database systems facilitate the analysis of statistical data?
Please reply
I invite you to discussion and scientific cooperation
Best wishes

I am looking for a face (male and female) which is proved to be sympathic and is free for use. Does anyone have a suggestion?
While doing analysis on NRD database with SAS software, I defined IndexEvent using primary diagnosis variable (DX1) and calculated 30-day readmission rates for IndexEvent.
I want to find out the Primary Diagnosis for Readmission. I want to know the reason for readmission if its same as IndexEvent or any other reason for Readmisssion. Which variable I should use to look for Readmission Primary Diagnosis?? Any Specific SAS code I should use??
Anyone familiar with this please guide.
Thank you in advance!
Future of Big Data Analytics
In my opinion Big Data database technologies find more and more applications in business analytics, in Business Intelligence, in marketing analysis, in consumer preferences research, sentiment analysis based on comments on online portals, including social media. However, how will this Big Data analytics development look like in the future? This is determined by many factors regarding various issues, including the security of transfer and processing of data contained on the Internet, technological progress of data processing, information and marketing policy of online technology companies, including companies that manage leading social media portals, etc. Add analytical capabilities conducted research on the development of Big Data technology, the potential for using Big Data for industrial espionage, cybercrime and for maintaining information security and information systems by national and supranational security services to combat cybercrime, international money laundering transactions, transfer money to tax havens , terrorism, inducing destabilization in capital markets, etc. In this regard, an "arms race" will be the formats employed in the development of Big Data technology, employed on the one hand by legally operating companies, financial institutions, including banks and security services and hackers employed by criminal organizations who will continue to break into company information systems, banks and agencies by creating new cybercrime techniques government. This "arms race" is endless and is probably one of the key determinants of technological progress that is taking place on the Internet, including Big Data technologies.
I am currently conducting research in this area and I invite you to cooperation.

I read an interesting question this week about a researcher wanting to know how to sell a patient's database... Then, the related question emerged in my brain: is there a set formula that is used to value the database ?
Effectively, a database owner may be asking himself for a price per client record - that is used to value your database when you are looking to sell your clinic or a Research Databse.
An interesting concept don't you think?
The idea being that is you have 1000 clients on your clinic database - when you come to sell the business - you simply multiply the number of clients by a set dollar amount - say $10 per contact - meaning that your database would be worth $10,000 of your sale price.
However - as I a sure you can appreciate - this simple calculation method is not that easy - or accurate for that matter.
There are many things that can impact on the value of your database - so a simple "dollar amount per record" would not work in many cases.
One of the variables that impact on the value of your database is how responsive this list of people are to your marketing messages.
If you have 1000 contacts in your database - who - when you email them an offer - you get a 50% email opening rate - and 20% of them take you up on your offer - then this is a valuable list.
If your 1000 contacts - have an email opening rate of 10% and 1% respond to your offer - then this list is not as valuable.
Another variable in this valuation process is the lifetime value of the contacts on your list.
If your 1000 contacts are all high income clients who regularly attend your clinic and buy lots of extra products and services- then they are much more valuable than a list of people who attended for a single "lead generation" low price offer - and never return.
Another value determining factor in health business databases is the concept of "Recency" - that being - when was the last time these clients actually attended your clinic for a paid service.
If your business has been established for many years - there may be a large number of past clients who no longer live in your area , have swapped to a new health provider - or sadly - may have even died.
So of your original 1000 contacts - there may only be 500 who have actually visited your clinic in the past 5 years.
Again - this lack of recent buying activity will also impact on the value of your database when it comes time to sell your clinic.
So what is the take home message for you as a current or future health business owner?
If you are looking to buy a clinic - ask lots of questions about email opening rates, age of the database, responsiveness to marketing messages (assuming any marketing messages are even being sent by the current owner) , buying habits, purchase frequency and recency of last visit.
If you are selling a business and want to get maximum value for it - make sure you are in regular contact with your client list and can demonstrate a solid and recent buying history for these clients.
I have personally seen practice databases with over 10,000 clients details but due to poor marketing and lack of follow up - the list is largely worthless to a potential buyer.
Do you have any interesting research database for selling? Do you need help to estimate a good VALUE and CLIENTS for it?
In recent years, the field of research and business applications in the field of obtaining, archiving, analyzing and processing data in Big Data database systems has been developing strongly.
In many companies, especially in large corporations, integrated risk management systems are built and improved.
Integrated risk management systems combine risk management processes in various areas of a company, institution or other organization.
One of the areas of risk management, the importance of which in many companies is growing, is risk management in the area of obtaining, archiving, analyzing and processing data in Big Data database systems.
In view of the above, I am asking you: Are the risk management instruments and models for the acquisition, archiving, analysis and processing of data in Big Data systems already being developed?
Are you already familiar with examples of this type of systems with risk management risk, which concern the acquisition, archiving, analysis and processing of data in Big Data systems?
Please reply
Advanced technologies of digitalization and automation of data processing first find their application in business. Then also in public institutions can be introduced including in the field of e-governance. This also applies to Big Data database technologies, which is applicable in various sectors of the economy, but due to the high investment costs of implementing this technology in the business processes of business entities, so far only large corporations and larger enterprises can afford such technologies. However, in the future, investment costs of implementing tech technologies into business processes should decrease and processing technologies and data collection in Big Data database systems should be available also for smaller companies, including business entities of the SME sector.
I invite you to the discussion.
I´m searching for databases to install in my laptop to identify XRD patterns, and phases, if you could please post me the link to a free database it would be extremely helpfull
I am using fuzzy logic to help decision-makers to get "quickly" and reliable decisions. I have collected many projects and I am interested on 12 parameters. So Finally my system has 12 inputs, each one has 3 membership functions(MFs), and one output with 4 MFs.
But I have a problem regarding the Inference system, i.e. If-Then-Rules. My parameters have a solid Relationship between them, so I think that I Don't have to consider all possible combination ( 3^12) but I have to make sure that my system got a good training and then I can trust him.
Before talking about the testing the system with a sall dataset, let's talk about the way I can consider all possible If then rules. should I come bach to each project and according to each one I see on which membership it belongs, or I use Simply the parameters ranges that I already realised and build my If-then-rules according to that?
One more question relating to unsuccessful projects. how could I consider these information (example: when a was low and b was Med and ….. Then the project X was insuccessful, Can I use it like this:
If a is Low and b is Med and …. Then output is not X ?)
I will more than happy f you share with me your experience so I can adapt it to my problem and act accordingly.
Thank you, and I am wainting for your interactions.
King Regards,
It's the problem I have to solve, I don't have much knowledge about Distributed database, can any one help me?
i have developed a bioinformatic tool (Link: geltowgs.uofk.edu). that compares PFGE gel image analysis results to mathematical models of band sizes derived from WGS (FASTA files). I have suggested a new algorithm to count DNA fragments that co-migrates across each lane. We are about to publish ower work (Research gate DOI: 10.13140/RG.2.2.32752.76806) . The attached file illustrate our method.
thank you
Hello
Do any one know how to add an institution to SciVal to have its own performance and analysis? All researchers belong to institution had Scopus indexed Author ID.
Thanks
I am working on financial inclusion status both cross country analysis and within Bangladesh. It will be helpful for me to get some recent publications related with this and a fruitful database.
Hello,
I am conducting some preliminary concentration-fixing steps on a pesticide used for house fly larvae. My initial data from 4 initial concentrations came out like this: highest conc. - 12/20 dead, second highest - 10/20 dead, third highest - 7/20 dead, fourth highest - 3/20 dead, control 4/20 dead. When I do Abbott's correction on the fourth highest, I get a negative value. Do I simply not use Abbott's correction at all for any of these concentrations, or do I count the fourth highest response as 0/20? Thanks in advance!
Hi all,
for my current literature review I would like to try something new. I would like to search papers graphically, by highlighting the relations between papers in one "conversation", i.e. which are linked through a common factor, like a keyword.
Ideally the output should look something like the attached picture, with each dot representing one journal article and the links representing the citations between the articles.
Does anyone know a database or a software that is able to do so?
Thanks for your help!
Best,
Jens

I need some kind of database from a few years ago and now
Hi everyone. I have a database of operating conditions in a power system for different kind of faults (1000 operating condition, each one has been reevaluated for single line outages, so total number of samples are: # of branches * 1000). The problem I'm facing right now is the tremendous amount of samples in my database, because for each line outage I'll have 1000 operating conditions and that will give me so many samples to analyze.
To reduce the samples, I decided to make use of ReliefF algorithm. I reshaped my database matrix in a form that the rows are my operating conditions and columns represent outage of each line. The entries of the data matrix are stability index for each operating condition in single line outages. Using k-means, I classified similar operating conditions and then using ReliefF, I found the best features(here branch outages). Now using the branches that are more important attributes, I decided to delete other operating conditions that are related to less important branch outages.
So my question: Is the whole thing sounds logical?
If not, is there any idea that I can use. I want to reduce my samples.
Thank you for helping me.
I want to study which arginine or lysine residues in target proteins are preferentially modified by glycating agents like methylglyoxal. Is there a database or any other resource which lists the modified proteins with specific data on the amino acid position and the nature of the modification (e. g., MG-H1 at position Arg xy in protein z)?
I would appreciate your help.
Thanks and best regards, Christian
Dear Researcher.
I have only one meteorological station inside the study area, the others are outside but near the study area. The only information available for these stations is rainfall and temperature data. Kindly guide me with this information , Is it recommended to use SWAT for stream flow (Runoff/ Discharge) simulation ?
I have pavia dataset at the given web location. Here, class & samples are define. I want to know how to define these classes at what parameter in matlab.
Result of disease can be in a form of ......cure/ not cure/ better/ improved
or anyset of binary which reflect condition of patient in terms of result...
I want to analyze classification datasets based on their characteristics; such as features, attributes, data type, data complexity, dataset size, number of instances etc.
If anyone has already performed such or related analysis, kindly guide me or suggest techniques, ideas for effective analysis.
Thank you.
I collect fingerprint using sensor. It obtained two data: 1) raw data and 2) base data. The raw data obtained after touching sensor, while base data obtained without touching. I manually saw the difference but it is difficult for differentiate in large database, so is there any algorithm to find the difference between these two database?
What are the applications of ERD and normalization?
I need large datasets (>100,000) for some query questions using natural language and their corresponding answers
We have developed some novel algorithms in our group for early prediction of SCA using MIT-BIH database. We now need some other data sets to validate our algorithms.
I'm working on modeling DSM in smart grid. I need the electricity usage of different appliances in different homes.
where can i find them?
Exist a new governance and social hierarchy based on databases. We live inside a network of relationships that produce value and its measurement through algorithms. (Perhaps here there is the same risk too).
Does anyone know a critical organized?
Genome is a big database, how can we use the potential resources? Are there good methods or ways in addition to first mapping? I think the study of methods and tools about utilization of the sequenced genome is more important, such as studying statistics.
How do I determine that a database is cluster-friendly and therefore that it's possible to be confident in using an algorithm as k-means (for example)?
to discover the structure of the database
Note : the question is not related to the idea that that database can easily be distributed on lots of machines
I want to use PHP to pull data from the PostgreSQL Database into my application. I wanted to know whether this method is secure, if my various users will have to interact with data.
I need to run a federated SPARQL query (containing SPARQL Service Clause) via Sesame 2.7. Any help/examples would be highly appreciated. Thanks
Hello, I am new to genmod and I am trying to calculate RR with an modified Poisson approach using robust error variance (by using the repeated statement and the subject identifier).
I have two questions:
1.) my output does not show me the output of the exp option on the estimate statement completely and not in the way as it was in other examples I am following. The xp estimate is missing, however L'Beta estimate and alpha etc are given ind the specific row. And I have no idea why? Can anyone help?
2.) Secondly, in other examples I found to ways to state the estimate statement "estimate 'beta' smoking 1/exp" and "estimate 'beta' smoking 1-1/exp" where exactly is the difference?
I will add my model below unfortunately the output cannot be displayed in a readable manner.
Thank you in advance!!
What is the most popular database model to be used in business? I would be grateful if I could have an evidences for the answer.
Microsoft access is a software example for relational databases. I need more examples for relational databases. I need also some more examples for Object oriented databases and XML databases.
I would like to use this paper as my reference in evaluating databases. Please help me.
Conference Paper Benchmarking Simple Database Operations.
While working with date variables in STATA, I encountered an interesting problem. There is a date variable and there are hundreds of other common variables. I have to find out the initial and ending date for each variables based on that single date variable. Such as, if a variable has all missing observations up to 1st December but at least 1 respondent responds on 2 December, then the initial date for that variable would be 2 December and so on. But I would like to request for clues about how this can be done for all the variables through STATA command (or any other software) and not by looking in the actual data which is gruesome for a large dataset.
In a genome database (nucleotide sequence) located in my computer, I would like to identify a given conserved domain, e.g., a given PSSM or a given Pfam.
For this purpose, I looked at PSI-BLAST and DELTA-BLAST, but they are protein-protein search tools, while I need searching a nucleotide database. Similarly, http://pfam.xfam.org/ allows searching protein-protein.
Is there any tool suitable for me to be used locally on my computer?
Hello Everyone,
I want to know how to get the DBLP and SIGMOD query set. If you know the links, please can you share me? But if it is not gained query set from the links,these tested query is created by yourself when the query is processed. Please share me.. Thank you all.
I just have a basic question- after doing various machine learning and data science, how can we know that we have got the most from our data? I know data is always important it is not outdated anytime in the future or present. But i want to know that at present after all processing how to know that we have completed getting analysis types on that data.
CBR is considered to be a methodology not a technology to use. Different applications and techniques can be used to find the similarities and make use of objects/cases within the case-library you have.
Such as CBR using fuzzy logic, using Rough-sets, similarity measures and maybe K-nearest neighbor. What about CBR using DB technology?
If we have changed the source data, then do we have to follow the same step for finding/generating the rules? Or change the method?
I have read a couple of articles which are trying to sell the idea that the organization should basically choose between either implementing Hadoop (which is a powerful tool when it comes to unstructured and complex datasets) or implementing Data Warehouse (which is a powerful tool when it comes to structured datasets). But my question is, can´t they actually go along, since Big Data is about both structured and unstructured data?
For writing review paper data is not currently available.
Sometimes it is very hard to find a lectotypification of old taxa. This is very time consumig and frustrating. A database compiling all designated lectotypes could be helpful. On the other hand, the code requires that lectotypification should be made my scientists knowing the taxon and working methods of the author of the taxon for which the lectotypification is done. If it is known that a taxon lacks a specific type this could lead to the automatic lectotypifications described in the code.
So, in the end I would like to have an impression whether in your opinion the disadvatages outbalance the benefits of such a database.
I want to test a new approach for periodicity extraction from real and synthetic images and I need an universal database to do it.
I need different heuristics for query optimization.
I have been teaching RDMBS in an undergraduate database class. I would like to teach Big Data also. I want to make a smooth transition from RDBMS to Big Data. Can you suggest a textbook or good material which would give me the chance to compare between RDBMS and BIG data? Please let me know.
I want to analyze the relationship between lots of genes/molecules in human, so I need the information of pathway.
To get the first hint about tissue distribution of a particular protein I usually refer to GeneCards. There you can find at least three different datasets from BioGPS, RNASeq (Illumina Body Map) and SAGE (Serial Analysis of Gene Expression). The results sometimes do not overlap: i.e. BioGPS and RNASeq show me a signal let´s say in the brain, whereas SAGE shows no expression in the brain at all. How should I interprete it and why is there such a drastic difference there?
Which tool/database/algorithm might ease the quest of searching the best gene/s to characterize a bacterial species?
Currently all giant companies are using Nosql database for serving their customers like HBase, Cassandra, Dynamodb , MongoDb , Google’s Big table . Most are open source and capable to handle Big Data requests but all these are still evolving so is there a need of standardization to maintain ACID property and rules for data accessing?