Science topic

Big Data - Science topic

In information technology, big data is a loosely-defined term used to describe data sets so large and complex that they become awkward to work with using on-hand database management tools.
Questions related to Big Data
  • asked a question related to Big Data
Question
2 answers
If I have multiple scenarios with multiple variables changing, and I want to conduct a full factorial analysis, how do I graphically show the results?
Relevant answer
Answer
It gives visually the change in value as changing the variables in study, a way you cannot easily diagnose from data table. You can see the volume of change !!. Regards.
  • asked a question related to Big Data
Question
1 answer
Hello respected all Researchers,
I will start my masters in September 2022. I am really confused about my master's major topic. There are two options for me now to select for my future study BIG Data or Computer Vision for my research work. I don't know which one I should take as a beginner. Please help !!! That I can start my study and choose a research direction. I anticipate hearing from you.
Thank you.......
mahfil
Relevant answer
Answer
Computer vision is a field of artificial intelligence (AI) that enables computers and systems to derive meaningful information from digital images, videos and other visual inputs — and take actions or make recommendations based on that information. If AI enables computers to think, computer vision enables them to see, observe and understand.
Computer vision works much the same as human vision, except humans have a head start. Human sight has the advantage of lifetimes of context to train how to tell objects apart, how far away they are, whether they are moving and whether there is something wrong in an image.
Big data is a combination of structured, semistructured and unstructured data collected by organizations that can be mined for information and used in machine learning projects, predictive modeling and other advanced analytics applications.
Depending on the issue, you can choose one of these methods.
Regards,
Shafagat
  • asked a question related to Big Data
Question
2 answers
then the time column is showing as numbers, not as dates or different time steps. Please guide me how i can convert those nummbers into time steps or dates. Furthermore, if i compare the time column then 01-04-1959 08:00 AM is displaying as 519344.
Relevant answer
Answer
It doesn't look like UNIX. It looks like something that excel does when you convert a date field to number etc. Could be a badd formatting conversion
  • asked a question related to Big Data
Question
2 answers
Edge computing is a research hotspots, but I can not find any open data set of edge computing. Does anybody know any big data set available in literature for edge computing?
Relevant answer
Answer
It depends on the application you are using Edge computing for.
  • asked a question related to Big Data
Question
5 answers
I'm working on an update to our previous global geochemical database. At the moment, it contains a little over one million geochemical analyses. It contains some basic geochronology data, crystallization dates for igneous rocks and depositional dates for sedimentary rocks. The database differs from GEOROC and EarthChem, in that it includes some interpretive metadata and estimates of geophysical properties derived from the bulk chemistry. I'd like to expand these capabilities going forward.
What would you like to see added or improved?
Here's a link to the previous version:
Relevant answer
Answer
It should be able to categorize significant path finder elements for corresponding commodities with anomalous values.
  • asked a question related to Big Data
Question
2 answers
Big Data , personalization of E- learning Big Data , learner behavior . machine learning .
Relevant answer
Answer
thank u
  • asked a question related to Big Data
Question
2 answers
Electrochemical impedance has attracted more and more attention in recent years.However, due to the limitation of experimental conditions, data in this respect are very scarce.I would appreciate it if who can share papers or available data about EIS.
Relevant answer
Answer
不知道
  • asked a question related to Big Data
Question
8 answers
What is role of Big Data in digital marketing?
Relevant answer
I have he following good article related to this discussion thread:
  • asked a question related to Big Data
Question
3 answers
Dear users.
I need to construct the plots in excel and R.I've 2000 data and these are large numbers (millions, because this is profit).
Could you tell me please,how I can to do it in excel and r with a goal to look years 200-2005 on the horizontal axis?
Excel:=series(;'63'!$A$1:$A$204;'63'!$D$1:$D$346;1)
R:
data <- read.table(file = "2.txt", header = TRUE)
head(data)
plot(data,type = "l", col = "red")
Doesn't work correctly
Thank you very much
Relevant answer
Answer
I agree with other colleagues about using "ggplot2" library for data visualization. These links maybe useful
  • asked a question related to Big Data
Question
4 answers
My aim is to use six classifiers to test various ML tools and generate a model for each of them from the raw data ( Big analytic tools on the data set)
Relevant answer
You can perform an initial analysis to grasp an idea of which ML algo works best for you by using PYCARET(https://pycaret.org/) or making a pipeline using grid-search in scikit-learn(https://scikit-learn.org/).
I hope this helps you.
with bests,
Mansurbek Abdullaev
  • asked a question related to Big Data
Question
1 answer
All the conclusions of personalized medicine go through AI applications on enormous masses of biomedical information. Molecular data play a crucial role in obtaining metabolic models to be used for patient analysis. Data relating to proteins and their functions in almost all cases have to do with protein forms that have undergone PTMs (Post-translational modifications). We are speaking about 100,000 PTMs or so, for about 20.000 – 25,000 protein-coding genes. These numbers point to an estimate in humans of around 6 million protein species, that is, the human proteome. Obviously they are not all present at the same time but perform their function in different spatiotemporal contexts.
PTMs of proteins change the protein structure, its chemical-physical characteristics and makes possible new functions with specific molecular partners. The response of the modified protein to the environment also changes, because we are dealing with a new molecular form, with new properties. In a nutshell with a new molecule. From the number and types of potential sites for PTMs on a protein, it is possible to calculate how many molecular forms a single protein can produce. For example, 4 phosphorylation sites on a protein are enough to have 15 distinct combinations for 15 different molecular forms. In cell, each molecular form is generated by the specific space-time context in which it occurs, because only, and only in that cell context, it can exist with its specific functional role. So, when we want to analyze a molecular form experimentally, we should simulate as much as possible the metabolic context in which we think that function should take place, or in vivo studies we should extract and purify the protein from the tissue. Without context, we have inappropriate results on the molecular form because it is not identifiable in space and time. Thus the context should be explicitly reported in papers. Unfortunately, this is a very rare information. What commonly happens is that these data without spatio-temporal context flow into the databases and are used for network analysis, where we find them all collapsed on the native protein. This generates static metabolic models and most of the analyzes are therefore flawed with the possibility that the models used for personalized medicine may be wrong, with possible damage to patients. Another problem then arises, how to eliminate these errors from biomedical Big-data systems? A fundamental rule of Big-data systems is that in order to have reliable results the data must be characterized by a high index of Veracity. Today, this is not true.
What do supporters of personalized medicine think about?
Relevant answer
Answer
Personalized medicine may be considered an extension of traditional approaches to understanding and treating disease. Equipped with tools that are more precise, physicians can select a therapy or treatment protocol based on a patient’s molecular profile that may not only minimize harmful side effects and ensure a more successful outcome, but can also help contain costs compared with a “trial-and-error” approach to disease treatment. Personalized medicine has the potential to change the way we think about, identify and manage health problems. It is already having an exciting impact on both clinical research and patient care, and this impact will grow as our understanding and technologies improve
Personalized Medicine Is Impacting Patient Care in Many Diseases: For Example... …in Breast Cancer: One of the earliest and most common examples of personalized medicine came in trastuzumab. About 30% of patients with breast cancer have a form that over-expresses a protein called HER2, which is not responsive to standard therapy. Trastuzumab was approved for patients with HER2 positive tumors in 1998 and further research in 2005 showed that it reduced recurrence by 52% in combination with chemotherapy.1 …in Melanoma: BRAF is the human gene responsible for the production of a protein called B-Raf, which is involved in sending signals inside cells to direct cell growth, and shown to be mutated in cancers. In 2011, a drug called vemurafenib, a B-Raf protein inhibitor, and the companion BRAF V600E Mutation Test were approved for the treatment of late stage melanoma. Vemurafenib only works in the treatment of patients whose cancer tests positive for the V600E BRAF mutation. Around 60% of patients with melanoma have a BRAF mutation, and approximately 90% of those are the BRAF V600E mutation.2 …in Cardiovascular Disease: Prior to the development of a gene expression profiling test to identify heart transplant recipients’ probability of rejecting a transplanted organ, the primary method for managing heart transplant rejection was the invasive technique of endomyocardial biopsy – a heart biopsy. Today, a genetic diagnostic test is performed on a blood sample, providing a non-invasive test to help manage the care of patients post-transplant. New research suggests that ongoing testing may be useful in longer-term patient management by predicting risk of rejection and guiding more tailored immunosuppressive drug regimes
source : Personalized Medicine Coalition
  • asked a question related to Big Data
Question
4 answers
We are working on a collaboration project between Babeș Bolyai University in Cluj-Napoca and KPMG Romania, aiming to observe how the accounting profession is being transformed by the technological advancements.
Technologies under assessment are: Cloud Computing, Robotic Process Automation (RPA), Big Data and Data Analytics, Machine Learning (ML) and Artificial Intelligence (AI).
The questionnaire takes around 5 minutes and is anonymous. Your input would benefit us greatly.
Thank you for your time!
Relevant answer
Answer
Done
  • asked a question related to Big Data
Question
8 answers
How to connect data collected using IoT (Big Data) to a neural network? Example: 24 hours the patient's pressure is measured using the so-called portable holter. Data is transmitted via a sensor to the server in the form of Big Data. How to transfer data to the neural network?
Relevant answer
Answer
Thx a lot Faraed
  • asked a question related to Big Data
Question
7 answers
Modern politics is characterized by many aspects which were not associated with traditional politics. Big data is one of them. Data mining is being done by political parties as they seek help from data scientists to arrive at various patterns to identify behavior of voters. Question is, what are the various ways in which big data is being used by modern political parties and leaders?
Relevant answer
Answer
Big Data platforms allow government agencies to access large volumes of information that are essential for their daily operations. With real-time access, governments can identify areas that require attention, make better and more timely judgments about how to proceed, and enact the necessary changes.
  • asked a question related to Big Data
Question
4 answers
Over the last few months, I have come across several posts on social media where scientists/researchers even Universities are flaunting their ranking as per AD Scientific Index https://www.adscientificindex.com/.
When I clicked on the website, I was surprised to discover that they are charging a fee (~24-30 USD) to add the information of an individual researcher.
So I started wondering if it's another scam of ‘predatory’ rankings.
What's your opinion in this regard?
Relevant answer
  • asked a question related to Big Data
Question
3 answers
Hello everyone,
Could you recommend an alternative to IDC please to get records from the global datasphere for free?
Thank you for your attention and valuable support.
Regards,
Cecilia-Irene Loeza-Mejía
  • asked a question related to Big Data
Question
4 answers
Hello,
I'm a masters degree student and I am struggling to find a good thesis topic for my masters degree. I would really appreciate if you can help me.
As you know, biosystem engineering is a major where I can work on both mechanical engineering side of things and electrical/computer engineering side of things. Personally, I am interested in precision agriculture(electrical/computer side) and have academical experience on implementing computer vision models(Generally Deep Learning), analyzing and modeling big data(Generally Machine Learning) and deploying IOT applications.
Thank you for your time.
Relevant answer
Answer
Xiaoshun Qin Thank you for your in depth response. I have talked to my supervisor and he suggested that I work on smart poultry. He also suggested a specific topic in smart poultry for my thesis. I just wanted to know what other options I have so I can choose the one that best suits my skills and expertise.
  • asked a question related to Big Data
Question
2 answers
Dear All, I would like to ask, is it possible to obtain data in some databases, websites about sexual behavior in different countries of Europe or the World? Thank You! Best regards Stefan
Relevant answer
Answer
Hi Štefan,
I recommend that you contact the ISSM and the European Federation of Sexology (EFS) for more accurate information and data.
In the rest of the world, you can contact sexology academies and similar organizations.
I hope you obtain the necessary information.
Kind Regards,
  • asked a question related to Big Data
Question
4 answers
I am currently conducting a study on the effects of adopting IoT and Big data technologies in a manufacturing facility. I'm trying to get hold of data that would concern the change in the capacity of the plant, the maintenance costs, and OEE. I am aware that there is previous case studies on the matter but I am trying to quantify the change using real data. Does anybody know where I can find production data of a manufacturing facility I can use?
Relevant answer
Answer
You would have to get in touch with some of the companies and see if you can request the data. I would imagine not many will be fore coming. However, if you sign a NDA and you arent going to publicise were the data has come from then they may be willing to provide you with the data. Does your University have any collaborations or partnerships with any manufacturing companies. If they have it may be easier for you to go down that route.
Best Regards
Martin
  • asked a question related to Big Data
Question
5 answers
I have 2 seperate data sets that I need to combine, time point 1 and time point 2.
Not all participants in time point 1 are in time point 2 (i.e., attrition, etc.). So I will need to know how to match participants, and keep the duplicates. Too many instructional videos tell you how to remove duplicates.
Next, I need to structure of the time points to be on top of each other, by columns. Meaning, I need time 1 to be above time 2, not next to each other. Again too many videos tell you how to add rows, (i.e., dplyr left join, right join), its very hard to find those that teach you how to add by columns. I want the data structure to be suitable for longitudinal analysis, or at least some form of repeated measures - where adding data sets to left or right may not work.
Please help! Either in R or excel!
Relevant answer
Answer
You are looking for the functions merge and reshape. Lots of packages, like data.table, also have functions, but merge and reshape are the ones in R. Just look at their help files.
  • asked a question related to Big Data
Question
5 answers
Hello everyone. I have question about obtaining data from Internet.
In my research I will analyze comments from websites and social media platforms. And I am searching for applications/apps/technologies other tools to download comments from Internet to my computer.
Do you know any tools/apps to download comments for free?
There is around 10.000 comments and if I would copy/paste one by one it would take me a lot of time. I want to obtain data quickly.
Do you have any suggestions for me?
Thank you so much for help.
Regards, Nejc
Relevant answer
Answer
You could use a scraper as the faster method, but be mindful that there are many privacy laws on the internet about how content can be copied, downloaded, and analyzed. In general, just because it's online doesn't mean it's acceptable for download, analysis, or use. Check the country of each commenter and site, as well as the privacy policy of the site hosting the comments.
  • asked a question related to Big Data
Question
3 answers
Hello everyone. I have question about obtaining data from Internet.
In my research I will analyze comments from websites and social media platforms. And I am searching for applications/apps/technologies other tools to download comments from Internet to my computer.
Do you know any tools/apps to download comments for free?
There is around 10.000 comments and if I would copy/paste one by one it would take me a lot of time. I want to obtain data quickly.
Do you have any suggestions for me?
Thank you so much for help.
Regards, Nejc
Relevant answer
Answer
You have to be specific about which sociomedia you are referring to. There are specific R packages for twitter, FB, reddit etc. Do a google search and I am sure you will find one
  • asked a question related to Big Data
Question
3 answers
Good day,
I am looking into the application and risks associated with the application of big data in South Africa - specifically in terms of groundwater resource management?
I am looking into transboundary resource management but also curious on impact thereof on for example detecting illegal abstraction or resource depletion because of it as well as contamination detection.
Please share your thought or relevant articles?
Regards,
Cindy
Relevant answer
Answer
Dear Cindy,
In my opinion, an analyst based on Big Data Analytics is helpful in research works carried out in various topics and scientific disciplines. Big Data analytics is more and more often used to improve risk management systems and improve the processes of managing the organization of companies and enterprises. Big Data analytics is helpful in improving crisis management systems and solving key problems of the development of civilization, studying changes in the state of the biosphere and climate of the planet. I conduct research, among others in the issue of the possibility of using Big Data Analytics in research works carried out in various topics and fields of science. I have published my conclusions from the research in scientific publications that are available on the Research Gate portal. I invite all those who study this subject to research cooperation. In my opinion, the possibilities of using Big Data Analytics in various fields of research will increase in the coming years.
Best regards,
Dariusz
  • asked a question related to Big Data
Question
35 answers
The current technological revolution, known as Industry 4.0, is determined by the development of the following technologies of advanced information processing:
Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence, Business Intelligence and other advanced data mining technologies.
Which of these technologies are applicable or will be used in the future in the education process?
Please reply
Best wishes
Relevant answer
Answer
In this increasingly flexible and complex environment, collaboration takes on a more valuable role in HE, creating hybrid communities, doing hybrid work, using hybrid practices. We propose that HE institutions should not focus on academic disciplines alone. In fact, working collaboratively between academics and professional members of staff enhances and positively impacts organisational culture. By professional members of staff also engaging in research and education within their own job roles, while collaborating with academics to enhance the student experience, a more consistent approach can be achieved. What is more, this way of working and being helps cohesiveness and collaboration while empowering individuals as important members of the team....
  • asked a question related to Big Data
Question
10 answers
Can anyone suggest any ensembling methods for the output of pre-trained models? Suppose, there is a dataset containing cats and dogs. Three pre-trained models are applied i.e., VGG16, VGG19, and ResNet50. How will you apply ensembling techniques? Bagging, boosting, voting etc.
Relevant answer
  • asked a question related to Big Data
Question
4 answers
Good day house. Please I just updated my r to R 4.1.2 and then I started having issues with read_tsv() and read_csv().
It seems readr 2.1.2 is not compatible with R 4.1.2; because when I used read.csv (which is not readr function), I was able to import .csv file. But using read_tsv() and read_csv() which are both readr functions kept giving me error:
Error in app$vspace (new_style$`'margin-top' %||% 0) : attempt to apply non-function
Can anyone help with this or help me with another function to import .tsv file apart from read_tsv() in readr package?
Relevant answer
Answer
Good morning Olayinka,
I have installed both R 4.1.2 and readr 2.1.2 and it seems it works fine to me.
Some quick alternatives to try:
1. Uninstall and install the readr package back;
2. Try to use readr::read_tsv() or readr::read_csv();
3. Try the read.csv2( ..., sep = '\t' ) function from the utils package;
4. Try the fread() function from the data.table package
Hope it helps.
Best,
Luca
  • asked a question related to Big Data
Question
3 answers
In my upcoming research on Big Data architecture, I'd like to make use of data from some of the best conferences I've attended ( practice-led conferences not academic ones )
What's the most rigorous methodology to capturing data from a video in academia ?
Relevant answer
Answer
Dear Pouya Ataei,
I think you have got your answer. Ajit Singh has explained the most preferred methodology for capturing data from video and also the procedure to analyze video data elaborately and very clearly.
Best wishes,
Razina Sultana
  • asked a question related to Big Data
Question
5 answers
Hi, I'm currently on my master's at the University of Bradford studying "Applied Artificial Intelligence and Data Analytics" and I was looking for capstone dissertation topics related to AI and data analytics. So if anyone has a few suggestions for research topics I would love to see some.
Relevant answer
Answer
Chiemela Tobechukwu Frank .You may want to consider these areas:
2D analysis for big data, big information, big knowledge and big wisdom as a dimension, and descriptive analytics, diagnostic analytics, predictive analytics, prescriptive analytics as another dimension based on AI and ML to form a unified framework of intelligent analytics.
  • asked a question related to Big Data
Question
10 answers
Hi All,
What do you find is the best solution for big data such as NGS sequences storage options that would allow easy transfer to and forth to university's HPC server?
Do you have your own local server in the group? Loads of hardrives? Google cloud? Commercial clouds?
Thanks in advance!
Relevant answer
Answer
In modern era of data processing following storage system can be used:
1. Network Attached Storage (NAS)
2. SSD Flash Drive Arrays
3. Hybrid Flash Arrays
4. Hybrid Cloud Storage
5. Backup Software
6. Cloud Storage
7. Software-Defined Storage
8. Storage Virtualization
9. Hyperconverged Storage (HCS)
10 Artificial Intelligence (AI) based storage system.
Best Wishes
Dr. Ashwani Kumar
  • asked a question related to Big Data
Question
8 answers
hey guys, I'm working on a new project where I should transfer Facebook ads campaigns data to visualize in tableau or Microsoft power BI, and this job should be done automatically daily, weekly or monthly, I'm planning to use python to build a data pipeline for this, do you have any suggestions or any Resources I can read or any projects similar I can get inspired from ? thank you .
Relevant answer
Answer
To create an ETL pipeline using batch processing, you must first:
1. Construct reference data: create a dataset that outlines the range of possible values for your data.
2. Extract data from various sources: Correct data extraction is the foundation for the success of future ETL processes.
  • asked a question related to Big Data
Question
5 answers
In CSE, What are the recent research in Energy Management System Using Big data??
Big data Characteristic defines 5 Vs. i am planning to take velocity ( IoT sensor data) , so anyone suggests recent research in Energy Management systems with big data and IoT.
  • asked a question related to Big Data
Question
3 answers
research topic > masters degree> big data> medicine/ health
Relevant answer
Answer
  1. Scalability — Scalable Architectures for parallel data processing
  2. Cloud Computing Platforms for Big Data Adoption and Analytics — Reducing the cost of complex analytics in the cloud
  3. Real-time big data analytics — Stream data processing of text, image, and video
  4. Security and Privacy issues
  5. Quantum computing for Big Data Analytics
  6. Efficient storage and transfer
  7. How to efficiently model uncertainty
  8. Graph databases
  • asked a question related to Big Data
Question
15 answers
Commercial banks are increasingly worried about competition from fintechs, including online technology companies that expand the range of financial and pre-financial services. Commercial banks are more and more actively using IT technologies of online banking, building Business Intelligence data processing platforms, extending Big Data database systems, developing integrated risk management systems and conducting advertising campaigns on social media websites. In view of the above, large commercial banks have the opportunity to conduct a sentiment analysis on data collected in Big Data database systems for the purpose of analyzing the expectations and opinions of Internet users regarding, for example, financial services. Information obtained from the Internet and processed in the aforementioned manner can be used for more precise risk analysis, credit risk management, planning subsequent advertising campaigns, modifying the financial services offer in line with changing expectations of Internet users, searching for clients on social media portals. In this way, interdisciplinary analytical processes are also developed at commercial banks, for which the information from the websites of social media portals is the source of data.
Do commercial banks have a chance to win in this matter in competition with the fintech technology companies operating on the Internet?
Besides, What is the effectiveness of online advertising campaigns run by commercial banks?
Please, answer, comments.
I invite you to the discussion.
Relevant answer
Answer
Dear Denis Muchunku, Haseeb Javed,
Yes. Internet advertisements are used more and more often in advertising campaigns also by financial institutions, including commercial banks presenting their offers of banking products and financial services as well as internet mobile banking offer. During the SARS-CoV-2 (Covid-19) coronavirus pandemic, the development of electronic internet banking, including mobile banking, accelerated. Therefore, commercial banks have recently been developing mainly online mobile banking for citizens, individual clients and business entities. Recently, many banks have been conducting advertising campaigns using new online media, including social media portals, to promote their online banking offers, also offered to companies and enterprises. Banks offer the opening of an online banking account primarily for business entities from the SME sector that do not yet have a mobile banking account, do not have their own website, are startups, etc. In promotional online banking offers for companies and enterprises from the SME sector, commercial banks offer additional incentives and incentives. auxiliary services creating a website for the company, creating an online platform for selling products and / or services of the client's enterprise, creating an online store, they also offer tax advisory services, financial advisory services, etc. Banks more and more often offer their financial services through social media portals, because of the research conducted market know that their customers are increasingly actively using these new online media and that these online marketing communication channels can be the most effective.
Thank you very much,
Best regards, Greetings,
Have a nice day,
Be safe and healthy,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
4 answers
Does anybody know a solver for a large scale sparse QP that works on the GPU?
Or, more in general, can a GPU speed up solvers for sparse QPs?
  • asked a question related to Big Data
Question
3 answers
Phylogeny analysis seems beyond my capacity right now, however, there are so much information in my dataset of seed traits along with climate, phenology, any suggestion will be appricitated.
Relevant answer
  • asked a question related to Big Data
Question
7 answers
1. what are the high-quality research monographs (or books) on artificial intelligence or data science or big data analytics? I hope to have 3 recommendations for each.
2. What are the important technologies or techniques developed in the past 10 years since 2021? I hope to know 5 of them. Please do not mention deep learning.
Thank you
2022-1-12
Relevant answer
Answer
Interesting query
  • asked a question related to Big Data
Question
5 answers
How can a scientist learn about data science and modelling that is applicable to solve wide varieties of problems. If possible contact of where you can do that even as a visiting researcher ?
Relevant answer
Answer
  • asked a question related to Big Data
Question
4 answers
dear community, I need some sources for some data science project or machine learning project related to analyzing the google analytics and Facebook business data , your help is appreciated.
  • asked a question related to Big Data
Question
11 answers
As it is known, Artificial Intelligence, Machine Learning and Deep Learning methods are used today to produce meaningful information from big data. So how can the MCDM approach be integrated into such a structure? Is it healthy to use MCDM in big data?
Relevant answer
Answer
Dear Anas
I refer to your question: ' We are looking for new research based on the novel MCDM method for solving actual problems'.
Do you know SIMUS? It is rather new (2011); it does not need any kind of weights or assumptions, and can solve complex problems because it is based on a completely different approach, grounded on Linear Programming, and it is free. Some of these solved problems are published in RG, journals, and Scopus
I suggest going to its webpage and learning about it and what it can offer.
Enter with any browser in: www.simus.online
  • asked a question related to Big Data
Question
5 answers
Software Experts: Ever wanted to write a book? Here's an opportunity close to it that you may not want to miss. Please see
for more details.
Relevant answer
Answer
Thank you.
  • asked a question related to Big Data
Question
5 answers
Hello everyone,
I am looking for links of scientific journals with dataset repositories.
Thank you for your attention and valuable support.
Regards,
Cecilia-Irene Loeza-Mejía
Relevant answer
Answer
Dear Cecilia-Irene Loeza-Mejía
I think you should have a look at the site «re3data: Registry of Research Data Repositories» (https://www.re3data.org).
There you will find the following search/browsing options: Browse by content type Browse by subject Browse by country
When you choose "Browse by content type", you will get "Raw data" or "Scientific and statistical data formats" (among others): https://www.re3data.org/browse/by-content-type/.
With best regards Anne-Katharina
  • asked a question related to Big Data
Question
6 answers
Big data is a new trend in the Technology field, it has many applications in Education especially in analysis students performance if the teacher using LMS.
my question about how we can make Big dat benefit for us in Mathematics Education ?
Relevant answer
Answer
Thank you for sharing this research
  • asked a question related to Big Data
Question
28 answers
AI and Big Data have recently seen widespread application in virtually every field. With the economy's increasing digitization, it is expected that massive amounts of data will be generated at every node. I wonder if primary data based research in consumer behavior, economics, agricultural economics, and related fields will become obsolete in the future as more sophisticated models aided by AI and Big Data provide a more accurate picture of various phenomena. Please share your thoughts on what will be the role of researchers in applied economics, business, and marketing etc (not including those in the fields of computer science).
Relevant answer
Answer
Whereas, Big data tends to answer the What, Who, Where, When and How Much questions. In contrast, to amplify the point made by James, there is no substitute for understanding how and why things are happening. Qualitative data methods have already become more important as we seek to explain the patterns that emerge from all the big data analytics ... and to understand their implications
  • asked a question related to Big Data
Question
11 answers
I have devoted time to the pricing of "dataset" asset access, a form of licensing? you can read more here [1].
I have also addressed an approach to structuring data in conjunction with a structural graph of relationships (like the maritime and other routes between two harbours), where the notion of set is leveraged upon for its flexibility (you aggregate what you want in a set, homogeneity not required unlike with vectors), and the structure of matrix is used for showing relations between nodes i and j. [2]
I am now exploring specific application fields, such as recommendations based on collaborative filtering from observed user behaviours towards items, and how to generalise Fuzzy Cognitive Maps (FCM) which overlay thin information on graphs, as matrices of sets do with thicker possibilities.
Another area I am exploring with the tool framework [2] is how to handle the cases where information is heterogenously available over time t and space x (for instance lots of information in data set D(t1,x1) relative to (t1,x1) but maybe less at (t2,x2) or (t1,x2) or (t2,x1)...
This can be expressed by the data gathered/observed at time t and position x as D(t,x), whereas the complete data which would ideally describe the details of what is happening at (t,x) might be C(t,x) which contains set D(t,x) but may be larger than D(t,x).
Have you encountered cases similar to the ones mentioned above? Can you give details, maybe references?
Ref:
Relevant answer
Answer
  • asked a question related to Big Data
Question
1 answer
I have 4 varieties 30 treatments with 3 levels which type of graphs is suitable to express results ??
Relevant answer
Answer
Have you considered structuring your data under a matrix of set: you have a single set of nodes/vertices, and you generalise the edges which are sets M(i,j) containing all information you can gather between i and j?
Feel free to read and feed back :
  • asked a question related to Big Data
Question
7 answers
what are the recent reach in sustainable development using big data?
Relevant answer
Answer
Dear Subha,
Answering the above question: What are the recent reach in sustainable development using big data? - I state that research and analytical techniques improved through the use of Industry 4.0 technology, including the use of analytical platforms Big Data Analytics, Data Science, etc. are already used in improving forecasting models for long-term climate changes, analyzing changes in the state of the biosphere on individual continents and oceans, forecasting future climatic, geological and natural disasters, changes in the state of environmental pollution, changes in state the sustainability of the relationship between the impact of the development of civilization and the biodiversity of natural ecosystems, etc.
Best regards,
Dariusz
  • asked a question related to Big Data
Question
9 answers
What will be the future applications of analytics of large data sets conducted in the computing cloud on computerized Business Intelligence analytical platforms in Big Data database systems in enterprise logistics management?
The analytics conducted on computerized Business Intelligence platforms is one of the key advanced information technology technologies of the fourth technological revolution, known as Industry 4.0. The current technological revolution described as Industry 4.0 is determined by the development of the following technologies of advanced information processing: Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence, Business Intelligence and other advanced data mining technologies.
The analytics conducted on computerized Business Intelligence platforms currently supports business management processes, including logistics management.
In my opinion, the use of analytics of large data sets conducted in the computing cloud on computerized Business Intelligence analytical platforms in Big Data database systems in enterprise logistics management, including supply logistics, production logistics, provision of services and distribution of manufactured products and services, is currently growing.
The analytics conducted on large data sets conducted in the cloud computing on Business Intelligence computerized platforms in Big Data database systems makes it particularly easy to identify opportunities and threats to business development, allows for quick generation of analytical reports on selected issues in the economic and financial situation of the business entity. In this way, the generated reports can be helpful in the processes of enterprise logistics management, including supply logistics, production logistics, provision of services and distribution of manufactured products and services.
Do you agree with my opinion on this matter?
In view of the above, I am asking you the following question:
What will be the future applications of analytics of large data sets conducted in the computing cloud on computerized Business Intelligence analytical platforms in Big Data database systems in enterprise logistics management?
Please reply
I invite you to the discussion
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications:
I invite you to discussion and cooperation.
Best wishes
Relevant answer
Answer
It is rising field since intelligence and in general artificial intelligence becomes the dominant technology of current era
  • asked a question related to Big Data
Question
6 answers
My essay is an attempt to answer the following : « Is the data economy, then, destined to benefit only a few elite firms? » Apparently that would be the issue till now. What are available tools to avoid this false target ? Reference to my essay on Stochastic Models in particular the section « Handling human social technical dimension; in particular man-system interface including positioning technology at man services » you may find guidelines to produce these tools and make BIG DATA exploitable by large majority of users : 1. Engine should trace “player” behaviour, evaluate its capabilities and quickly meet its needs. 2. Immersion generated by simulation enables training and experimentation of behaviour strategies, in particular learning “by doing”. 3. Engine should use following resources : 3.1. Tools to be customized by trainers. 3.2. Applied standards. 3.3. New learning approaches discovery through obtained results, whether these approaches are positive or negative, in the sense of improving technology performance of assembled prototypes. 4. How SPDF (Standard Process Description Format) may produce a universal engine to run the stochastic model ? 4.1. SPDF consists of two parts : 4.1.1. Message structured-data part (including semantics) and, 4.1.2. Process description part (with higher level of semantics). 4.2. Two key outputs of the SPDF research will be a process description specification and framework for the extraction of semantics from legacy systems. 4.2.3. Note that : a)The more we may have semantic rules the more unpredictable events are controlled. b) Acquired knowledge to elaborate semantic rules for unpredictable events requires many occurrences of the stochastic model. c) Convergence shall not be reached until getting more qualitative semantic rules. d) Performing dynamically a given scenario is the goal of the proposed messaging system.
Relevant answer
Answer
To start our collaborative work, I'll let you propose a case study and we shall try together to apply the knowledge acquired through modelling: what are the challenge facing humanity today: 1) covid 19, 2) Climate change 3) wars in the middle east? but I won't accept to make you select one of these three proposals, since collaborative work requires to share our knowledge equally, should we succeed that should be a great achievement, thank you
  • asked a question related to Big Data
Question
9 answers
The current technological revolution, known as Industry 4.0, is determined by the development of the following technologies of advanced information processing: Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence, Business Intelligence and other advanced data mining technologies.
In connection with the above, I would like to ask you:
Which information technologies of the current technological revolution Industry 4.0 to the greatest extent support the enterprise management process?
Please reply
Best wishes
Relevant answer
Answer
In my opinion, in recent years the implementation of Big Data Analytics technologies, the Internet of Things, learning machines and artificial intelligence to the business activities of companies and enterprises has been increasing. improving business management systems. During the SARS-CoV-2 (Covid-19) coronavirus pandemic, the scale of digitization and internetization of economic processes increased. As part of this increase in digitization and internationalization, many manufacturing, commercial, technological, etc. companies implemented investments in the implementation of new information technologies, ICT and Industry 4.0, in order to improve specific spheres of their business activity. As part of these investments, i.a. in the IT systems of enterprises, computerized systems of digital twins are built, in which the entire production processes, logistics processes, and the functioning of machines and devices are digitally built. The digital twin systems built in this way support the systems of production process management, production logistics, supply and delivery logistics, distribution logistics, offering services, etc.
Greetings,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
5 answers
Big Data Analytics (Undergraduate or Postgraduate Course)
It usually have
Hadoop and Spark as its main outline with Hive and HBase as Distributed Data warehouse and Distributed Database examples.
Anyone who is teaching this course with LABS for VM, Docker or Kubernetes based Labs for Hadoop and Spark from Single Node Cluster to Multiple Nodes Cluster configurations and some example Labs starting from Word Count distributed / parallel processing run on multiple nodes.
Please share any resources / labs / tutorials. Thanks
#BigData #dataanalytics #BigDataAnalytics #Hadoop #Spark #kubernetes #Docker #virtualmachines #Hbase #Hive
Relevant answer
Answer
  • asked a question related to Big Data
Question
16 answers
The goal of predictive analysis is to develop predictions for the development of complex, multifaceted processes in various fields of science, industry, economy or other spheres of human activity. In addition, predictive analysis may refer to objectively performing processes such as natural phenomena, climate change, geological, cosmic etc.
Predictive analysis should be based on taking into account in the analytical methodology possible the most modern prognostic models and a large amount of data necessary to perform the most accurate predictive analysis. In this way, the result of the prediction analysis performed will be the least subject to the risk of analytical error, ie an incorrectly designed forecast.
Predictive analysis can be improved by using computerized modern information technologies, which include computing in the cloud of large data sets stored in Big Data database systems. In the predictive analysis, Business Intelligence analytics and other innovative information technologies typical of the current fourth technological revolution, known as Industry 4.0, can also be used.
The current technological revolution known as Industry 4.0 is motivated by the development of the following factors:
Big Data database technologies, cloud computing, machine learning, Internet of Things, artificial intelligence, Business Intelligence and other advanced data mining technologies. On the basis of the development of the new technological solutions in recent years, dynamically developing processes of innovatively organized analyzes of large information sets stored in Big Data database systems and computing cloud computing for the needs of applications in such areas as: machine learning, Internet of Things, artificial intelligence, Business Intelligence are dynamically developing.
For the abovementioned application examples, one can add predictive analyzes of subsequent, other fields of application of advanced technologies for the analysis of large data sets such as Medical Intelligence, Life Science, Green Energy, etc. Processing and multi-criteria analysis of large data sets in Big Data database systems is carried out according to V4 concepts, ie Volume (meaning a large number of data), Value (large values of certain parameters of the analyzed information), Velocity (high speed of new information) and Variety (high variety of information).
The advanced information processing and analysis technologies mentioned above are used more and more often for the needs of conducting predictive analyzes concerning, for example, marketing activities of various business entities that advertise their offer on the Internet or analyze the needs in this area reported by other entities, including companies, corporations, institutions financial and public. More and more commercial business entities and financial institutions conduct marketing activities on the Internet, including on social media portals.
More and more public institutions and business entities, including companies, banks and other entities, need to conduct multi-criteria analyzes on large data sets downloaded from the Internet describing the markets on which they operate, as well as contractors and clients with whom they cooperate.
On the other hand, there are already specialized technology companies that offer this type of analytical services, including offering predictive analysis services, develop custom reports, which are the result of multicriteria analyzes of large data sets obtained from various websites and from entries and comments. contained on social media portals based on sentiment analyzes of the content of entries in the comments of Internet users.
Do you agree with my opinion on this matter?
In view of the above, I am asking you the following question:
How can you improve the process of predictive analysis?
Please reply
I invite you to discussion and scientific cooperation
Dear Colleagues and Friends from RG
The key aspects and determinants of the applications of modern computerized information technologies for data processing in Big Data and Business Intelligence database systems for the purpose of conducting predictive analyzes are described in the following publications:
I invite you to discussion and cooperation.
Best wishes
Relevant answer
Answer
Dear Alexander Kolker,
Thank you very much for your answer and pointing to the important aspects of predictive analytics in business and the use of Big Data Analytics in these analyzes.
Thank you very much,
Best regards,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
42 answers
How to obtain currently necessary information from Big Data database systems for the needs of specific scientific research and necessary to carry out economic, business and other analyzes?
Of course, the right data is important for scientific research. However, in the present era of digitalization of various categories of information and creating various libraries, databases, constantly expanding large data sets stored in database systems, data warehouses and Big Data database systems, it is important to develop techniques and tools for filtering large data sets in those databases data to filter out of terabytes of data only information that is currently needed for the purpose of conducted scientific research in a given field of knowledge, for the purposes of obtaining answers to a given research question and for business needs, eg after connecting these databases to Business Intelligence analytical platforms. I described these issues in my scientific publications presented below.
Do you agree with my opinion on this matter?
In view of the above, I am asking you the following question:
How to obtain currently necessary information from Big Data database systems for the needs of specific scientific research and necessary to carry out economic, business and other analyzes?
Please reply
I invite you to the discussion
Thank you very much
Dear Colleagues and Friends from RG
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications:
I invite you to discussion and cooperation.
Best wishes
Relevant answer
Answer
Respected Doctor
Big data has three characteristics as follows:
1-Volume
It is the volume of data extracted from a source, which determines the value and capabilities of the data to be classified as big data, and by the year 2020, cyberspace will contain approximately 40,000 megabytes of data ready for analysis and information extraction.
2-Variety
It means the diversity of extracted data, which helps users, whether they are researchers or analysts, to choose the appropriate data for their field of research and includes structured data in databases and unstructured data (such as: images, clips, audio recordings, videos, SMS, call logs, and data). Maps (GPS), and require time and effort to prepare them in a suitable form for processing and analysis.
3-Velocity
It means the speed of producing and extracting data and sending it to cover the demand for it. Speed is a crucial element in making a decision based on this data, and it is the time we take from the moment this data arrives to the moment the decision is made based on it.
There are many tools and techniques that are used to analyze big data, such as: Hadoop, Map Reduce, HPCC, but Hadoop is one of the most famous of these tools. Big data is on several devices and then distributes the processing process to these devices to speed up the processing result and is returned or called as a single package. Tools that deal with big data consist of three main parts:
1- Data mining tools
2- Data Analysis Tools
3- Tools for displaying results (Dashboard).
Its use also varies statistically according to the research objectives (improving education, effectiveness of decision-making, military benefit, economic development, health management ... etc.).
greetings
Senior lecturer
Nuha hamid taher
  • asked a question related to Big Data
Question
7 answers
Sometimes the advance technologies like CT Scan and MRI are available. But the competency to read the results is lack. According to big data technology, is it possible to construct "translation machine" or "scanner" for CT Scan or MRI "graphs/ pictures" into diagnostics statement that can be understood for the common ones ?
Relevant answer
Answer
Thank you for your information Rafal Z Slapa
  • asked a question related to Big Data
Question
16 answers
Hi, I am currently starting to work my degree and I am having trouble coming up with a good research topic. If possible I would like to include IoT, AI and Big Data. Any suggestions?
Thank you very much.
Relevant answer
Answer
APM 4.0: Transforming Decision Making about Plants and Assets with Role-Based Applications
Artificial intelligence (AI) now overtakes and advances automation to improve productivity. But it will not continue to happen in the same way we see it experimentally executed on the frontiers of data science today...
  • asked a question related to Big Data
Question
3 answers
Dear Sir/Madam,
I trust you are keeping safe.
My name is Martin a masters student at Nottingham Business School.
I need your help on a questionnaire survey related to usage of big data analytics in the investment management (or asset management) industry.
Please find more details on the Participant Information Sheet (PIS) in the link below (1) and if you agree to participate please find the survey in below link (2)
(1) PIS link :
Thank you in advance !
Regards,
Martin
Relevant answer
Answer
You can use LinkedIn also.
  • asked a question related to Big Data
Question
2 answers
Hey!
I tried out Knime to save some time because some of my evaluation processes include a lot of copy-pasting of data into the right format for me.
I created a workflow that helped me now. What would be great is if the results file would be integrated in the original Excel file as a new work sheet. Can anyone explain to me how to do this? Is it possible at all? Right now I have an Excel writer node create a new file with the results.
Also I usually have several files and already found that If they are in the same folder i can execute the same routine on all files in the folder. But then it can not be processed because 1) i only have 1 output file and 2) some data sets are different (too good---without error message rows) and give the error that the respective rows are missing.
So if you have an idea how i can make the respective changes please help.
Attachted a file how my original data looks like ("Results") and the results i got with Knime copied into the next work sheet (sorting, getting rid of duplicates and rows i don't need). Also tried to export the workflow so you can have a look. I execute the writer at the bottom most of the time when i saw at the machine that the duplicates are ok.
Relevant answer
Answer
Thank you for this collection. I'll have a look and see if one of them offers a solution.
Best,
Juliane
  • asked a question related to Big Data
Question
36 answers
What are the important topics in the field: Data analysis in Big Data database systems?
What kind of scientific research dominate in the field of Data analysis in Big Data database systems?
Please reply. I invite you to the discussion
Dear Colleagues and Friends from RG
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications:
I invite you to discussion and cooperation.
Best wishes
Relevant answer
Answer
Dear B. Dr. Ravishankar,
Today for the answer. Yes, you have indicated a key aspect that determines many of the currently developed analytical applications of Big Data Analytics technology.
Thank you very much,
Best wishes,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
8 answers
What are the best research topics for doining PhD in Big data?
Relevant answer
Answer
Dear Subha Mani,
I propose the following research topics in the field of Big Data Analytics technology applications:
1. Analysis of Internet users' sentiment to study the changing opinion of citizens on specific topics, company brands, product and service offers of enterprises, changing customer preferences, etc. based on the analysis of entries, comments, posts, etc. posted on social media portals.
2. Analysis of the company's or enterprise's competitive environment based on the verification of multiple data records relating to various categories of information obtained from the Internet.
3. Analysis of changes in the level of sales, distribution logistics, logistics of deliveries and supplies in an enterprise operating on various markets, cooperating with many points of sale and many recipients of products or services and suppliers of components or prefabricates in order to improve logistics processes.
Best regards,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
14 answers
Below are some issues related to Big Data database technologies that can be developed scientifically:
- Application of data processing technology in Big Data database systems for modern education 4.0,
- Improvement of forecasting of natural, climatic, economic, economic, financial, social etc. phenomena based on analyzing large data sets,
- Analysis of sentiment, opinions of citizens, Internet users regarding brand recognition of companies, customer reviews of specific services and products, views on various topics, citizens' worldview based on the analysis of large collections of information downloaded from various websites, from comments downloaded from social media portals,
- Analysis of information and marketing services of commercially operating companies that carry out specific analyzes of sentiment, citizens' opinions, Internet users regarding brand recognition, customer reviews of specific services and products etc. on behalf of other companies that purchase specific analytical reports,
- Analysis of the possibilities of cooperation, synergy, correlation, conducting interdisciplinary research, connecting Big Data database systems with other information technologies typical for the development of the current fourth technological revolution called Industry 4.0, which include technologies such as: cloud computing, machine learning, Internet of Things, Artificial Intelligence, etc.
In what other areas are the technologies of processing and analysis of information in Big Data database systems used?
Please answer
Best wishes
Dear Colleagues and Friends from RG
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications:
I invite you to discussion and cooperation.
Best wishes
Relevant answer
Answer
Dear Srdjan Atanasijevic,
Yes. You pointed to the important conditions for the development of big data analysis technology with the use of Big Data Analytics.
Thank you, Regards,
Dariusz Prokopowicz
  • asked a question related to Big Data
Question
4 answers
Thank you so much
Relevant answer
Answer
How is Six Sigma used in service industry?
Using the DMAIC (Define – Measure – Analyze – Improve – Control) methodology, Six Sigma helps in implementing quality in any industry by reducing defects. The defects are first identified, data is collected as to how the defects occur, and then a new method of working is implemented to reduce errors in the future
  • asked a question related to Big Data
Question
3 answers
Hello fellow researchers,
I am currently dealing with very large data sets of SNPs (more than 2 million) to investigate whether GWAS significant SNPs are more frequently located within certain genomic regions than non-significant SNPs. I have a 2x2 table stating the absolute number of SNPs in the significant vs. non-significant group that are either located within this specific region or not. Now, I obviously need to check my results for statistical significance, which initially I have done with the Chi-square test. But because I have so high numbers, every investigation is (putatively) statistically significant. I know that some publications just state the Cramer's V as an additional indicator, but I would rather have something alternative to use (if it exists). So do any of you know good alternative tests or methods to deal with these high numbers without this large sample size bias? How do you normally deal with these huge samples sizes?
I would be grateful for any tip or advice.
Thank you!
Relevant answer
Answer
Luisa -
You noted that "But because I have so high numbers, every investigation is (putatively) statistically significant."  That is the problem with the one-size-fits-all p-value "significance" levels (say 0.05 or 0.01) that were originally considered for industrial experiments as an indicator (by Fisher) of whether or not an investigation should proceed.  It was flawed from the beginning because any measure that changes with sample size (not just estimated better with a larger sample size, but changes) has to be interpretable in some context.  You need an idea here of "effect size."  I do not know your subject matter, but often a p-value is just a step too far for a very practical usage.  It is often considered good for decision making, but I argue that.  An automatic yes/no decision made for you may seem attractive, but you really need as much information as possible, often enhanced greatly by graphics, to make a more informed decision.  A standard error, although it also changes with sample size, is more easily interpretable, especially if you can generate confidence intervals.  And graphics can convey a variety of information and stimulate further analysis. 
The real question is How much of a difference makes a real, practical difference to your application?
I have worked with very small samples, and urged people not to just look at an isolated p-value, where they tend to be large, just because of small sample sizes.  I've seen industry try to claim a standard was met, just because there was too little information to show that it wasn't.  I know that large data sets have the opposite problem.  A lone p-value is rather meaningless, ​in either case.  
Also, in favor of 'measurement' rather than hypothesis testing at all, we often do not really want to know "Is something present, yes or no?" But rather "How much is present?" which could include little or none.  The example I have in mind for that is the hypothesis tests for heteroscedasticity in regression.  Why even do one?  You really want to know the impact, especially on variance.  If you do an hypothesis test, then what?  If you decide "yes" it is present, then you need to do something about it.  If "no," then it can still impact results and you won't know it.  But if you just estimate the coefficient of heteroscedasticity and use that in the model to model the heteroscedasticity, then you don't have to guess how much difference it would have made (and different size predicted values should be associated with different size sigma for residuals, which impact prediction intervals). 
OK, that was a long aside, but whatever your application, I think making good decisions is actually aided by studying sigmas not p-values.   A sigma can be compared to the parameter.  Some hypotheses, such as multiple sample comparisons may not be so easily restructured, but you still just do not get good information from a single p-value in isolation.  Perhaps you could research "size effect."
Cheers - Jim
  • asked a question related to Big Data
Question
6 answers
Some studies say that the Random forest method could be the best. But I'd like to get more opinions since many people seem to be using many different methods. It would be nice if someone could provide any resources for carrying out the methods too (Tutorials, R code, etc)
Relevant answer
Answer
Hi Joshua
You don't need opinions as you can solve the problem without relying on other people's preferred methods of imputation.
The process is: take your full data and remove (say) 2% of it at random. Then try several imputation methods to replace those values. Compare predictions to known observations. The method with the smallest error is the one to use when imputing your genuinely unknown values.
  • asked a question related to Big Data
Question
9 answers
Hello Everyone, I want to find out if there is a way to do (remote) scientific collaboration in the field of Big Data Analytics, Machine Learning/Deep Learning. The goal is only to learn and to enhance my publication list and my portfolio of projects.
I have a PhD in Big Data Analytics. I have worked a lot on Big Data/Deep Learning but I am available to work on any filed.
Thanks in advance,
Relevant answer
Answer
Please have look on our(Eminent Biosciences (EMBS)) collaborations.. and let me know if interested to associate with us
Our recent publications In collaborations with industries and academia in India and world wide.
EMBS publication In association with Universidad Tecnológica Metropolitana, Santiago, Chile. Publication Link: https://pubmed.ncbi.nlm.nih.gov/33397265/
EMBS publication In association with Moscow State University , Russia. Publication Link: https://pubmed.ncbi.nlm.nih.gov/32967475/
EMBS publication In association with Icahn Institute of Genomics and Multiscale Biology,, Mount Sinai Health System, Manhattan, NY, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29199918
EMBS publication In association with University of Missouri, St. Louis, MO, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30457050
EMBS publication In association with Virginia Commonwealth University, Richmond, Virginia, USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852211
EMBS publication In association with ICMR- NIN(National Institute of Nutrition), Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/23030611
EMBS publication In association with University of Minnesota Duluth, Duluth MN 55811 USA. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852211
EMBS publication In association with University of Yaounde I, PO Box 812, Yaoundé, Cameroon. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30950335
EMBS publication In association with Federal University of Paraíba, João Pessoa, PB, Brazil. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30693065
Eminent Biosciences(EMBS) and University of Yaoundé I, Yaoundé, Cameroon. Publication Link: https://pubmed.ncbi.nlm.nih.gov/31210847/
Eminent Biosciences(EMBS) and University of the Basque Country UPV/EHU, 48080, Leioa, Spain. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27852204
Eminent Biosciences(EMBS) and King Saud University, Riyadh, Saudi Arabia. Publication Link: http://www.eurekaselect.com/135585
Eminent Biosciences(EMBS) and NIPER , Hyderabad, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29053759
Eminent Biosciences(EMBS) and Alagappa University, Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30950335
Eminent Biosciences(EMBS) and Jawaharlal Nehru Technological University, Hyderabad , India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/28472910
Eminent Biosciences(EMBS) and C.S.I.R – CRISAT, Karaikudi, Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237676
Eminent Biosciences(EMBS) and Karpagam academy of higher education, Eachinary, Coimbatore , Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237672
Eminent Biosciences(EMBS) and Ballets Olaeta Kalea, 4, 48014 Bilbao, Bizkaia, Spain. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29199918
Eminent Biosciences(EMBS) and Hospital for Genetic Diseases, Osmania University, Hyderabad - 500 016, Telangana, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/28472910
Eminent Biosciences(EMBS) and School of Ocean Science and Technology, Kerala University of Fisheries and Ocean Studies, Panangad-682 506, Cochin, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27964704
Eminent Biosciences(EMBS) and CODEWEL Nireekshana-ACET, Hyderabad, Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/26770024
Eminent Biosciences(EMBS) and Bharathiyar University, Coimbatore-641046, Tamilnadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27919211
Eminent Biosciences(EMBS) and LPU University, Phagwara, Punjab, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/31030499
Eminent Biosciences(EMBS) and Department of Bioinformatics, Kerala University, Kerala. Publication Link: http://www.eurekaselect.com/135585
Eminent Biosciences(EMBS) and Gandhi Medical College and Osmania Medical College, Hyderabad 500 038, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27450915
Eminent Biosciences(EMBS) and National College (Affiliated to Bharathidasan University), Tiruchirapalli, 620 001 Tamil Nadu, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/27266485
Eminent Biosciences(EMBS) and University of Calicut - 673635, Kerala, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/23030611
Eminent Biosciences(EMBS) and NIPER, Hyderabad, India. ) Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/29053759
Eminent Biosciences(EMBS) and King George's Medical University, (Erstwhile C.S.M. Medical University), Lucknow-226 003, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25579575
Eminent Biosciences(EMBS) and School of Chemical & Biotechnology, SASTRA University, Thanjavur, India Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25579569
Eminent Biosciences(EMBS) and Safi center for scientific research, Malappuram, Kerala, India. Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/30237672
Eminent Biosciences(EMBS) and Dept of Genetics, Osmania University, Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/25248957
EMBS publication In association with Institute of Genetics and Hospital for Genetic Diseases, Osmania University, Hyderabad Publication Link: https://www.ncbi.nlm.nih.gov/pubmed/26229292
Sincerely,
Dr. Anuraj Nayarisseri
Principal Scientist & Director,
Eminent Biosciences.
Mob :+91 97522 95342
  • asked a question related to Big Data
Question
3 answers
Hi. Since missing value imputation is determined by nearby data, should we separate control and treatment groups and perform MVI separately for each? Context: This is for mass spectrometry data.
Relevant answer
Answer
Yes. It is better to calculate MVI for separate control and treatment groups.
  • asked a question related to Big Data
Question
5 answers
Does anyone of you use sentiment analysis in research conducted on data downloaded from the Internet and analyzed in the Big Data database system?
If so, please let me know in which issues, in which research topics do you use sentiment analysis?
Is sentiment analysis helpful in forecasting economic and financial processes?
Please reply
Best wishes
Relevant answer
Answer
Dear Venkatesh Gauri Shankar,
Thanks for the given example and description of building a forecasting model of economic processes based on the use of Python libraries.