Science topics: Information SystemsMicrosoft Excel Data Analysis
Science topic
Microsoft Excel Data Analysis - Science topic
Explore the latest questions and answers in Microsoft Excel Data Analysis, and find Microsoft Excel Data Analysis experts.
Questions related to Microsoft Excel Data Analysis
I am studying leadership style's impact on job satisfaction. in the data collection instrument, there are 13 questions on leadership style divided into a couple of leadership styles. on the other hand, there are only four questions for job satisfaction. how do i run correlational tests on these variables? What values do i select to analyze in Excel?
I have a dataset of patients with ESRD and want to estimate GFR using the 2021 CKD-EPI formula.
how to transform my results in the EuroQOL EQ-5D + EQ VAS to get an overall score that can be presented as “good, moderate, poor“ quality of life?
we will use it pre and post bariatric surgery research and it is not the main topic so we would like a short and easy to use questionnaire that is easy to calculate and easy to present
Hey,
I want to calculate the standard deviation for each substituent for two molecules using Excel and then calculate the average of all the values (no S.D).
For S.D I used STDEV.P and for average I used AVERAGE. Is it the right way? Or should I use STDEV.S? or should I calculate range (large-small) instead of average?
I tried using Gigasheet but it does not have many features that are available in excel. Suggest me some freely available sources where I can load my ~1.7 million rows and do some calculations like sort multiple columns, remove duplicates
TIA
I need to run artanova and tukey-hsd for the interactions among the treatments, but my dataset has few NAs due to experimental errors.
When I run :
anova(model<- art(X ~ Y, data = d.f))
I get the warning :
Error in (function (object) :
Aligned Rank Transform cannot be performed when fixed effects have missing data (NAs).
Manually lifting is not an option because each row is a sample and it would keep NAs, simply in wrong samples.
Dear experts,
Could you please give me a hint on how to depict ranges in a scatter plot, e.g. how to highlight a reference range for data in clinical chemistry, when plotting a time course (see attached image)?
Currently, I'm using LibreOffice Calc and add the reference range manually by drawing a rectangle, but that's tedious to copy/paste into a report when analyzing lots of data, as for each plot, I need to make a screenshot first and then cut and paste the diagram (or select both the rectangle and the diagram before copying). It also would be more convenient and accurate, if I could specify the reference range in the data table or in a form, instead of belly guessing it on the chart.
Any suggestion how I could automate this procedure (e.g. by using a dedicated function and specifying the reference range and possibly a color for it in a data column) so that I just can copy/paste the diagram or even better link to it?
Is there by chance a plug-in or a dedicated program for this purpose?
Thanks for your help!

There are few steps to make heatmap of your qRT-PCR data (fold change or relative quantification) using R.
Data file preparation:
Make excel file of your data in which your will place your gene of interest in column and your treatment or conditions in row.
Save the file in *csv extension.
Import data file in R:
By using following codes, import your data file into R,
data2 <- read.csv("data1.csv")
~ data1.csv will be file name your data file your created in excel and data2 is the name of your data in R. You can use your own names instead of data1 or data2 and you can even give your data a single name at both places.
When you will import the data, you will see first column composed of serial numbers. We need to replace the numbers with the names of actual column of your data that contain your gene of interest. To do this use this code:
rownames(data2) <- data2$Name
~ Name is first column
This will replace the serial numbers with your first column. But now you have two columns with your genes of interest. To remove duplicate, use this code:
data2$Name <- NULL
Now your data is ready to create heatmap.
Developing heatmap:
First create matrix of your data by using following code:
data2 <- as.matrix(data2)
Now install a package to create heatmap "pheatmap" by following code:
install.packages("pheatmap")
after installing you will call that package every time when you want to use it by following code:
library("pheatmap")
Then give a command to make heatmap of your data by following codes:
pheatmap(data2)
Usually we show fold change/relative quantification value inside our heatmap to add them modify your code in the following way:
pheatmap(data2, display_numbers = TRUE)
- You can customize your heatmap in many ways. Contact me any time if your any help.
Deal All,
I have two series of time series data that I would to correlate. One data set is the deposits, by month, for a list of different account. The other is the balances, by month, for the same list of accounts. In essence, I have two matrices that I want to understand correlation for without having to strip out each account separately. Furthermore, I want to cross-section that data into different segments.
This is being done with the goal of being able to forecast account balances in the futures, by looking at their usage behavior (assuming there is a lag relationship).
How do I build an intermediate matrix of the correlations? Is there a way to do it in Python or R-Studio? Is there a way to do it in excel?
Thanks
Ryan
Hi everyone,
recently, I have been working on a study where I examine the impact of American tariffs, customs and other import duties on European exports to the US. I have three variables (y = EU exports to the US, x1 = US tariffs, x2 = US customs and other duties). I use quarterly data from 1995Q1 until 2017Q1 (89 observations). My tutor has emphasized that I need to controll for year and country fixed effects and maybe introduce dummies per year and country. I am quite clueless how to do that. Why is it necessary? What is the equation? How do I do that in Excel or eViews? I would appreciate step by step instructions so much!
Thank you in advance for any help or comments.
I can't find a straight answer on google and it's frustrating me. I am trying to indicate that some cells in my data sheet do not have a value of 0, but rather that no data was obtained and the cell should therefore be left out of calculations. How would I achieve this?
Just typing in "No Data" is problematic, since I receive an error when using pivot tables to calculate averages.
I performed a growth performance experiment of microalgae with four treatment. Where I measure cell dry weight (unit: mg/L), cell density (unit: ×10^5 cells/ml), chlorophyll a (unit: µg/ml) and Beta carotene (unit: µg/ml) content of the microalgae. Reviewer suggests me to analyze correlation of cell growth/size with the pigment content (Chlorophyll a and Beta carotene) of microalgae. So, how can I measure correlation of cell growth/size with the pigment content (Chlorophyll a and Beta carotene) of microalgae using Microsoft Excel or other suitable data analysis software? Thank you.
Since it is preferred to check any autocorrelation among the variables; one has to remove highly correlated variables to run an SDM (I am using MaxEnt). For my study, I have calculated the Pearson correlation coefficient (r) among the variables (correlation matrix is provided). But as I am new to this, I am finding it hard to interpret the correlation matrix table. Meaning how and on what basis, I am going to remove the variables? (I am taking the threshold ≥0.8 for the purpose), I need some expert suggestions.
Q1. How the variables are chosen? And please suggest me accordingly to the provided table. Which are the variables I have to select for my study?
Q2. How one variable is selected, when there is a high correlation between two variables?
Q3. Is negative correlation not a problem? I am asking this because; I have seen few papers where highly negatively correlated variables are also selected.
Please help me.

I want to analyze data of different workshop (approx 40 workshop) participants data and following two are issues:
A. I want to check out of 4000 participants, How many participants attended multiple workshop
B. Separate the participants data based on their designation, Department, Affiliation
I used binary logistic regression to model the behavior of drivers' stop and go decision in dilemma zone. Now I have to estimate the elasticity of variables (corresponding change in outcome probability on changing one unit of X). My variables are of continuous and categorical nature. Any help would be highly appreciated.
P.S: I have uploaded the screenshot of one of the papers where author has calculated the elasticities.


Imports System.Data.OleDb
Imports Microsoft.Office.Interop
Imports System.Windows.Forms
Public Class UploadData
Private connString As String = "Provider=Microsoft.Jet.OLEDB.4.0;Data Source=C:\Users\admin\Desktop\MatrixVisualizationSystem\MatrixVisualizationSystem\database.mdb;Persist Security Info=True"
Private Sub btnBrowse_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles btnBrowse.Click
Dim result As DialogResult = ofd.ShowDialog()
If result = Windows.Forms.DialogResult.OK Then
txtfilename.Text = ofd.FileName ' خزنت اكسل بالtxtfilename
End If
End Sub
Private Sub Button1_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button1.Click
If txtfilename.Text = "" Then
lblstatus.Text = "ارجو تحديد الملف المراد تحميل البيانات منه"
lblstatus.ForeColor = Color.Red
Return
Else
lblstatus.Text = ""
End If
Dim Excel As String = txtfilename.Text 'تعريف الباث
Dim excelConnString As String = "Provider=Microsoft.Jet.OLEDB.4.0;Data Source=" & Excel & ";Extended Properties='Excel 8.0;HDR=Yes;IMEX=1;';"
Dim excelConnection As New OleDbConnection(excelConnString)
Dim excelCommand As New OleDbCommand("select * from [ورقة1$]", excelConnection)
Dim excelReader As OleDbDataReader = excelCommand.ExecuteReader()
Dim dt As New DataTable() 'dt data table لل داتا قرد
dt.Load(excelReader) ' rows and colبيتعامل معها كتيبل
Dim accessConnection As New OleDbConnection(connString)
Dim accessCommand As New OleDbCommand() ' بياخد كل انواع الي بدي ياها ونا بختار
For Each row As DataRow In dt.Rows ' كل رو"as data row" امشيلي عليه
If row(0).ToString = "" Then ' اذا كان الرو فاضي
Continue For ' لا تكمل وارجع لف مرة تانية حتى تصير ترو
End If
accessCommand.CommandText = String.Format("INSERT INTO Schedule (courseCode, courseNo, SectionNo, CourseTitle, Hours, Days, fHour, tHour, room, roomNo, maxLimit, teacherName) " & _
" VALUES ('{0}','{1}','{2}','{3}','{4}','{5}','{6}','{7}','{8}','{9}','{10}','{11}')",
row(0),
row(1).ToString(),
row(2),
row(3),
row(4),
row(5),
row(6),
row(7),
row(8),
row(9),
row(10),
row(11)) ' تخزين كل البيانات بالاكسس
accessCommand.Connection = accessConnection
If accessConnection.State = ConnectionState.Closed Then ' اذا مغلق افتحه واطلع واذا مفتوح خزن واطلع
End If
accessCommand.ExecuteNonQuery()
Next
accessCommand.CommandText = "select * from Schedule" ' حذف الخلايا الفاضية
dt = New DataTable
dt.Load(accessCommand.ExecuteReader)
gridSchedule.DataSource = dt
If excelConnection.State = ConnectionState.Open Then
excelConnection.Close()
End If
If accessConnection.State = ConnectionState.Open Then
accessConnection.Close()
End If
End Sub
'Private Sub Button2_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button2.Click
' ' شغل الدكتور
' ' المشكلة الساعات واسم المساق ما بيعطي الي فيه سترينغ
' Dim xlsApplicaton As Excel.Application
' Dim xlsWorkBook As Excel.Workbook
' Dim xlsWorkSheet As Excel.Worksheet
' xlsApplicaton = New Excel.Application
' xlsWorkBook = xlsApplicaton.Workbooks.Open("G:\123.xls")
' xlsWorkSheet = xlsWorkBook.Worksheets("Sheet1")
' Dim type As Object = xlsWorkSheet.Cells(1, 7).Value.GetType ' 1 row , 7 col
' For i As Integer = 2 To xlsWorkSheet.Rows.Count
' Dim CourseCode As Object = xlsWorkSheet.Cells(i, 1).Value
' Dim CourseNo As Object = xlsWorkSheet.Cells(i, 2).Value
' Dim SectionNo As Object = xlsWorkSheet.Cells(i, 3).Value
' Dim CourseTitle As Object = xlsWorkSheet.Cells(i, 4).Value
' Dim hours As Object = xlsWorkSheet.Cells(i, 5).Value
' Dim Days As Object = xlsWorkSheet.Cells(i, 6).Value
' Dim fHour As Object = xlsWorkSheet.Cells(i, 7).Value
' Dim tHour As Object = xlsWorkSheet.Cells(i, 8).Value
' Dim room As Object = xlsWorkSheet.Cells(i, 9).Value
' Dim roomNo As Object = xlsWorkSheet.Cells(i, 10).Value
' Dim teacherName As Object = xlsWorkSheet.Cells(i, 12).Value
' Next
'End Sub
Private Sub ofd_FileOk(ByVal sender As System.Object, ByVal e As System.ComponentModel.CancelEventArgs) Handles ofd.FileOk
End Sub
End Class
I would like to calculate the MSD and Deff for each particle (every particle have an unique TRACK_ID in the table).
You could find one of my files attached.
Thanks a lot
So I have 2 sets of data, one is around 8700 values and the other is around 5100 values. Both of them are from different setups and I need to find the error between them. I have made the graphs and everything and I need to find the exact value of maximum error in the trend. The data is currently in MS Excel. I know MATLAB and Python, but I am unable to get the value. Is there anyway it can be done?
To explain it further in terms of a graph, the x-axis contains time and the y-axis contains the observed data. I need the time at which the maximum error occurs and the value of that error.
I am straggling to use SPSS amous software. Can you give some suggestions to me?
Apart from using Microsoft Excel to draw graphs, which other software is user-friendly that I can use?

I have 50 survey responses coded as 0 and 1 in a square matrix format pasted on different excel worksheets. I want measure the frequency on 0 and 1 in each cell to identify if 0 is having frequency or 1. What is probable method for it??
In order to analyze daily yield of Solar PV , the master file needs to be split into smaller files. As Excel has limitation of number of rows, it is tough to load or extract the same. Hope there is a good solution or work around is available.
Thank you in advance.
I am working on portfolio optimization using lower partial moment of order 1, can someone help me how to implement LPM-1 in excel sheet using "tau" as my threshold value as 0.00% and order (n) as 1.
Thank you all in advance for your contributions to my question.
To dear Researchers,
I was analyzing a series of concentration for estimation of Real-Time PCR efficiency. The concentration was 1:10. I used MS-excel to evaluate Slope. The result of slope was -8 and it was consistence with that in Scatter plot equation. I also drew a line plot. In the line plot the slope was -3.2 and the efficiency was 100%. Also, there was no linearity in Scatter plot but there was a well fitted linearity (0.99) in line plot. I'm not sure what is the problem. Please help me figure it out.
Kind regards
Alireza
I'm looking for an excel sheet to calculate the STS Score. From the website http://riskcalc.sts.org/stswebriskcalc/calculate. Does anyone have a worksheet to calculate the STS Score?
I submitted three sample materials for DMA analysis. Resulting data (frequency, temperature, E', E'', dL, tanD, Ft, time) came out in excel file along with the graphs of all materials from the analyzing software which were also attached inside the excel file in a separate sheet . Upon inspection of the graphs, I noticed that unlike most DMA graphs which presents three smooth curves displaying the storage modulus, loss modulus, and loss tangent, the graphs I received has multiple curves that represent each elements to which I cannot properly interpret.
Furthermore, because I need to make a comparative study of my materials, I am required to plot the same sets of elements into one graph. As I initially plot for the storage modulus, what appeared was three zigzag lines coming to a curve instead of three smooth curves. This same curving zigzag lines were evident also as I plotted for the loss modulus and loss tangent of the sample materials.
Is the excel data and graph I received from the DMA analysis reliable? How do I properly plot the DMA data in excel or, possibly, in another software to achieve desired results? Thank you.
Attached here are the DMA graph of Sample 1, the graph for the storage modulus of Samples 1, 2, & 3, and DMA graph from Lozano-Sánchez, L., Bagudanch, I., Sustaita, A., Iturbe-Ek, J., Elizalde, L., Garcia-Romeu, M., & Elías-Zúñiga, A. (2018). Single-Point Incremental Forming of Two Biocompatible Polymers: An Insight into Their Thermal and Structural Properties. Polymers, 10(4), 391. For your reference.
Scatter with Smooth Lines were used to plot E' (y-axis) and Temperature (x-axis) using Microsoft Excel in the image Storage Modulus of Three Samples.JPG
I have 3 data sets (eg Scores from three different Universities) all mixed up in a single excel sheet and with different number of scores. Eg Oxford sample with 4000 scores, Cambridge with 3200 scores, and Portsmouth with 1800 scores and I want to extract the samples,what would the macros look like? Please note the spreadsheet has 2 columns i.e. University and Score. If someone can also answer the same question for SPSS I will also be grateful.
I am required to do a statistical analysis comparing Country A and Country B in terms of their approaches to minimize or prevent occurrence of COVID-19 in their respective country.
So far these are the information I have gathered:
- Positive cases from COVID-19
- Death cases from COVID-19
- Mitigation measures taken (Descriptive)
- Condition of country: health, economy, social
For now I can only decide to do T-test for death cases. Do you think it is suitable?
Feedbacks and ideas are very much appreciated. Thank you in advance!
Dear colleagues,
do you have some EXCEL template for calculate GINI coefficient and create Lorenzo curve? I find some R packages but I am not sure how to use it on my date with population and income by region.
Hi everyone, as stated above, I would like to know whether it is possible to export data of SpO2 and Nasal Flow in .csv or .xls(x) format from ApneaLink for further analysis.
Thank you in advance, best regards.
Is it possible to do Central composite design and Response surface method when there are two independent variables? Say, the first independent variable represents 5 graded levels of a fatty acid and the second independent variable represents 2 temperature and I would like to see their interaction effect on fish.
Please provide me suggestion how it can be analysed in Excel or SPSS.
Hi all,
I'm looking at accelerometer data. I used two accelerometers at once for a backup, but now I need to correlate the two datasets provided. I'm only looking at one axis on each, but one set of data has ~5000 more data points than the other. This may sound like a lot, but the datasets on average hold about 215,000 points of data. During data collection, I paired the two so they began collecting data and ended at the same time, but still one observed more movements than the other. Is there anyway around this to compare the two to say that both pick up about the same type of data than the other? A correlation test in R isn't working since there are two different amounts of data in each column. I do, however, have the mean and SD of each set.
Thanks so much in advanced.
I am doing research for my thesis which is comparing the genetic variants (genes, SNPS, etc.) of 4 medical conditions. I have identified approximately 7,000-10,000 genetic variants that I would like to analyze, compare, and run a statistical analysis on. I would like to complete my thesis in this lifetime, so manual entry is probably not the best option nor is it the most accurate (due to human error). If anyone has any techniques that would work, it would be wonderful. Though I am comfortable with EXCEL, I have access to SPSS statistical software, maybe export into SPSS is easier/possible? Thank you.
Hi,
I want to calculate the correlation of the Bitcoin price with some other asset classes such as gold or oil. Therefore, I use the daily returns in percent of each asset and simply apply the correlation formula in excel. To be sure I got me two different datasources for oil and gold (Krugerrand and H&H gold, Texas and London Oil). Obviously the historic prices of Krugerrand and H&H are almost the same, the same applies for the two different kinds of oil.
My question/problem is: The correlation of the absolute values from Gold 1 - Gold 2 and Oil 1 - Oil 2 is close to one which makes sense. The correlation of the returns, however, is close to zero, i.e. non existent. How can that be? shouldn't the returns be at least very positively correlated as well?
Furthermore, the correlation of each time series with Bitcoin differs, even though I am thinking that Oil 1 and Oil 2 / Gold 1 / Gold 2 should have the same relation to Bitcoin.
I am trying to figure this out since days now and I am running out of time for this project. I would really appreciate if anyone has an idea what I am not seeing here.
I attached the excel, the most relevant sheets are highlighted in red.
Is there an Excel implementation of the Douglas Peucker algorithm?
In order to analyze the sentiment on downloaded data from social media portals (such as Facebook, Tweeter, LinkedIn, but also YouTube, Instagram ...) and aggregated in Big Data database systems, it is necessary to use specialized software for extraction and analysis of these data.
The quality of the data transferred to, for example, Excel sheets depends on the quality of the extraction process carried out with the help of specialized software.
Then, the quality of data analysis software in Excel sheets or in systems of computerized analytical platforms depends on the result obtained, the answer to the question given to the collected, initially unstructured data in the Big Data database system.
In the future, artificial intelligence may be used for this purpose, and the whole process of purposeful analysis of collected data will proceed in a much more effective, automated manner, less probable errors, will be a cheaper research process and will be carried out much faster even on much larger information collection than current.
In view of the above, the current question is: What will the directions of development of analytical processes carried out sentiment analyzes on data collected in Big Data database systems in the future?
Please, answer, comments. I invite you to the discussion.

I have a data set comparing the accumulated biomass on two types of substrate. The number of samples from each substrate is different, however low (n<10).
What would be the most appropriate test to show significant differences between the two substrates?
I've tried PERMANOVA v7 on euclidean distance resemblance matrix, but it seems a bit too much for such a small sample size.
Suggestions anyone?
How to Derive Coefficients of the plynomial Eq. via interpolation calculations of hourly measured values of Net Power over Measured solar Data in Excel ?
Hi,
Similar to Matlab xlswrite(filename,A), what is the function name in Scilab to write on a new excel file ? I tried searching, but couldn't find out. There was only write_csv available.
Thanks,
Praveen
I need to transfer four adjacent cells from Table 1 to Table 2 based on a single cell value with only some of the words in the cell matching. Can anyone please help with a formula/function/script? Many thanks!
Hello,
I'm trying to graph some data and can't figure out how to make the graph I need in Excel. I have a single DMSO control and then a dose response series for several compounds. How can I graph this data without getting a huge gap between DMSO and Compound A. Also, how do I get DMSO off of the legend entry? The excel file is attached. Tips for Excel or Prism 6 would be helpful. Thanks!
How can I specifically directly import any column or row from Excel file saved at some location into sciNotes?
How can I interface scinotes and Excel?
Hey all,
I do calcium imaging on dissociated neurons and am currently experiencing an issue with my data analysis.
Essentially the computer I use to image these neurons will take data points every 2 seconds. This computer has a limited amount of processing power and ram so when I am imaging a large amount of neurons at a time, it is unable to image these neurons precisely at every 2 seconds. My x values will look like x=2.12, 4.14, 6.19, 8.41 etc. instead of x=2.00,4.00,6.00,8.00 etc.
The problem is that I need to overlay the data sets to create a mean data set of all the neurons I've imaged. However, I am unable to create a mean data set on GraphPad Prism because the x values of the individual neurons do not 'match up'.
I'm looking for a solution on Excel to make the x values of the recorded neurons 'match up'. I can see either 2 solutions: (A) binning all the x values of a certain range to a certain number. Ie. Changing the values of all x values that fall between 3.00-4.00 and 4.00-5.00 to 4.00 The other solution (B) would be to predict the y values at a certain x values based on the y values at other x values. Ie. If I have values at x= 2.12, y= 5.00 and x= 4.14, y= 7.00, using these known values to predict what y would be at x=2.00.
I've attached a couple of screenshots of the excel files I work with to hopefully clarify what I am trying to do. Each sheet corresponds to a different plate of neurons that was imaged. The values in the A column are time (the x values) and every other column corresponds to an individual neuron on that plate and their calcium level (y value).
Thanks for your help.


I have large number of precipitation daily data in a column (for 53 years).I need to convert it to monthly sum. How can I do that? The data available format and required format are attached in this sheet. Kindly guide me
Data is in the form of likert scale 1-4 with 1=highly unimprtant and 4=highly important
6 parameters
How can I illustrate a forest plot in excel for 2 groups? I would like to do meta analysis for each publication, each with a different technique.
Hi,
I am Writing a thesis about predicting abnormal stock returns based on sentiment analysis of tweets.
More specifically we have a huge datasets of tweets, corresponding to a randomized sample of about 1% of all tweets during a year.
Now, we want to sort out the tweets mentioning the companies in the index we are looking at, which is EURO Stoxx 50.
We now want to sort or dataset for tweets containing any cash-tag ($) for our companies. For example AztraZeneca will be $AZN for their ticker symbol. So for this index we will filter for a list of any of 50 cashtags. How can we do this? Preferably in excel.
I enclosed a Picture of how our spreadsheet looks like, as well as a sample of the dataset.
KR
Benjamin

A simple example for 4 P facilities and 20 demand points will straight thing out for me
Attached is an example of my problem. The different names listed in Column A need to be the headings of columns in a new table, with ALL the corresponding values listed under the appropriate headings in the table. Can anyone help? I can't figure it out although it seems like such a simple problem.
My dataset has 32 rows and 13 columns containing monthly rainfall data of 31 years. (The sample file is provided herewith).
I have two datasets which are outputs from a model. Each dataset contains an array of independent variable X and 10 dependent variables Y. I have tried to create 4 different visualization options in Excel, MATLAB, Mathematica & ENVI IDL (Check attached image). However, I would like to know if there is a better graphical/data visualization alternative. The first plot in the image was created in Excel with the simple 'scatter plot with smooth lines' option. The rest 3 options were plotted in MATLAB, Mathematica and IDL programmatically. The Excel and IDL plots look similar. MATLAB and Mathematica plots share the same characteristic look. I am trying to find a way to present the differences clearly between the two Scenarios A & B graphically. I would appreciate if someone can suggest a better statistical/graphical/visualization/plot option.
Following are my datasets:
SCENARIO A
x= [0, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1]
y1= [0, 0, 0.02, 0.01, 0, 0, 0, 0, 0, 0, 0, 0]
y2= [0.01, 0, 0.05, 0.1, 0.19, 0.6, 0.87, 1, 1, 1, 1, 1]
y3= [0.02, 0.05, 0.2, 0.69, 0.99, 1, 1, 1, 1, 1, 1, 1]
y4= [0.02, 0.12, 0.25, 0.97, 1, 1, 1, 1, 1, 1, 1, 1]
y5= [0, 0.12, 0.68, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y6= [0, 0.2, 0.84, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y7= [0.01, 0.49, 0.97, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y8= [0.01, 0.51, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y9= [0.01, 0.82, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y10= [0, 0.84, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
SCENARIO B
x= [0, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1]
y1= [0.01, 0.03, 0.01, 0, 0.01, 0, 0, 0, 0, 0, 0, 0]
y2= [0.01, 0.07, 0.04, 0.13, 0.23, 0.5, 0.92, 1, 1, 1, 1, 1]
y3= [0.01, 0.03, 0.2, 0.61, 0.99, 1, 1, 1, 1, 1, 1, 1]
y4= [0.02, 0.06, 0.4, 0.99, 1, 1, 1, 1, 1, 1, 1, 1]
y5= [0, 0.24, 0.61, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y6= [0, 0.26, 0.88, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y7= [0, 0.51, 0.99, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y8= [0.02, 0.64, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y9= [0.02, 0.87, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
y10= [0.01, 0.94, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]

I use Microsoft Excel for numerous spreadsheets for clinical trials. Anything from enrollment logs, reimbursement to the subjects, reimbursement from the sponsors, lists of potential subjects, etc. I'd hate to think how much manual data I would have to capture without it.
Dear friends and colleagues!
Help me, who know how to write in Microsoft Excel formula for calculating the binary logistic regression.
Calculation is needed is a set of formulas in cells of Microsoft Excel, in extreme cases - the script VBA.
Do not offer to use R, SPSS, MathCad and others.!
Using this method, I want to improve the model for the prediction of the activity of natural focal infections, and the model is constructed in Excel and rewrite it to other platforms is no reason.
If someone gives me a suitable option - will co-author in all publications, made using this method.
I am doing a thesis on M&As, and I have to check for significant differences in the operating performance of cross-border and domestic M&As and compare the same. Ideally, I would compute the median pre- and post- acquisition performance of 2-3 years before and after the takeover, and arrive at the median pre- and post-acquisition performance. However, I would then need to compute the Median Difference from the median pre- and post-acquisition performance and derive it's statistical significance.
I am confused on how to go about it using Excel's T-Test. When I select the variables, I am made to select the output range. I do not understand how the author of the paper I am referring to has done it (I have attached a screenshot of the author's method).
Your inputs are appreciated.

I have some data that I need to find 50% value for. The chart consists of the period (days) as X axis values and percentages as Y axis values. I was trying to find a way to determine the 50% value on a trend line for each treatment. Is there any easy way to do that without going through modifying the formula and applying it for each treatment?
I attached a pic for one of the treatments I want to find 50% value for.
Thank you in advance for your assistance.

I want to quantify the change in protein expression from cell lysates by western blotting. But first, I need to find the linear range of the signal. In order to do that, I loaded increasing amounts of protein, blotted them with appropriate antibodies, measured the signal, and plotted the results in Excel as a scatter plot.
As expected, more protein gives stronger signal. With low protein loading (0-10 micrograms), there is a proportional response, but with higher protein loads (> 20 micrograms), the signal flattens out. There is a plateau because signal detection is saturated.
I want to fit the data to show the plateau more clearly. How would I do this? I also want to find the linear range of this curve. Do I need to fit a second, straight line?
I am facing some difficulties to organize the exported Excel Table to analyze further into T Align software. Does anybody help me to guide how to do it? I have given a sample table, which column I will take for analysis into T Align software?
I only have field measured values of solar radiation, air temp, relative humidity, dew point, air pressure, wind speed and direction. I do not have any information on ratio of clear/cloudy time and I'm having a difficult time arriving at the necessary emissivity values needed to calculate incoming longwave rad.
Thank you for any help.
I am doing RT-PCR for my experiment. I have CT value and after that I want to calculate fold difference between treatment and non-treatment.
If somebody has an Excel template, could you send to me?
Thanks.
many are written formulae for testing reliability
but i have not seen any formulae for test of validity
Finding validity is essential for any research
Exporting the list of papers to Excel allow you to sort papers and delete duplicates
I am currently plotting some graphs on excel. I would like compare the expression of a certain gene in different conditions. I made my calculations on excel and ended up with a normal column graph of the normalized mean + SEM of the gene expression under the different conditions. If you see on the picture I have attached, they easily show if the difference between any of the column bars is statistically significant with a nice line connecting the two bars and the annotation of whether P is smaller than 0.05. How can I annotate my graph in a similar way (in a way in which I can clearly see which bars are compared with t test)?
Thank you in advance

How accurate is the relationship given by Excel's trendline? If there are several parameters or variables involved in the relationships and it is preferable to solve the relationships one by one rather than putting a whole bunch of data into SPSS a certain form of end equation is required. How can it be done using Excel?
I have an excel table which includes more than 10 columns and 600.000 rows. I don't want same rows which includes same things in columns.
I tried (In LibreOffice and Open Office)
CtrlA>data>filter>standard filter
Column A = not empty AND
Column B = not empty AND
....
More Options>No duplicates
I think it was supposed to be done, but nothing happened.
When I tried it with 2 or 3 columns, it worked. But is there any way to do this stuff with more columns?
For now I solved the problem by a temporary solution but I need to learn the real solution.
I am working on my thesis for my undergraduate degree and I am meant to carry out a load forecast on my school's data. While collating the data for my forecast, I noticed that some values were missing. What method can I use to fill in these values to increase the accuracy of my forecast? I am using the regression analysis method and also working with Artificial Neural Networks. I am working with MicroSoft Excel.
I have been asked to apply this method in a set of results I have for a due paper, but I do not really know much about statistics, so I am at an impasse. Could someone help me?
I cultured a bacterium in 8 different culture conditions to find the VOCs cohort of this bacterium in different conditions. I am wondering if you can tell me the best way of data analysis by using SPSS. 35 VOCs (peaks: retention time) have been identified per each condition and I want to compare them to find the similarities by using SPSS.
How can we avoid initial number of exponent in Y-axis of column graph such that the axis only have base 10 without decimal number as attached example?

And population standard deviation?
I am looking for a comprehensive question to make a reasoned evaluation of a post graduate student's ability to working with MS EXCEL.
What do you think, is the drawing error-bar in excel only based on standard deviation or should we divide it by the square root of iterations?
I thought bar graphs with different colors for each group, but Excel nor STATA include this option.
I have recently done a workshop on Tableau Software of Tufts University for Microsoft Excel Data Analysis. It gives very good graphs etc.
If I have values in column A in a spreadsheet starting from A1 to A5 as for example 1, 2, 3.5, 4, 6 and I need to extract all values less than 5 in a new column, what is the best method? And how do I set the auto extraction if the value 5 is in for example in cell C1. That is mean when I changed the value in C1 the values extracted in the new column will changed according to the new value in C1.
Excel is so very basic; SigmaPlot and SPSS ok, but again no support for simple non-parametric plots, for eg. Box and Whisker. Wondering what others are using?
I'm looking for a software to create a transparent forest plot graph of Odds Ratio. Do you know any recommendable programs?