ArticlePDF Available

Abstract and Figures

The maximum likelihood parameter estimation method with Newton Raphson iteration is used in general to estimate the parameters of the logistic regression model. Parameter estimation using the maximum likelihood method cannot be used if the sample size and proportion of successful events are small, since the iteration process will not yield a convergent result. Therefore, the maximum likelihood method cannot be used to estimate the parameters. One way to resolve this un-convergence problem is using the score function modification. This modification is used to obtain the parameters estimate of logistic regression model. An example of parameter estimation, using maximum likelihood method with small sample size and proportion of successful events equals 0.1, showed that the iteration process is not convergent. This non-convergence can be solved with modifications on a score function. Modification on score function is to change a score function, a matrix of the first derivative of the log likelihood function, to the first derivative matrix itself minus multiplication of information matrix and biased vector. The modification of the score function can quickly yield values of parameter estimates, especially when the sample sizes are larger, and convergence was reached before the 10 th iteration.
Content may be subject to copyright.
Journal of Physics: Conference Series
PAPER • OPEN ACCESS
The parameter estimation of logistic regression with maximum likelihood
method and score function modification
To cite this article: R Febrianti et al 2021 J. Phys.: Conf. Ser. 1725 012014
View the article online for updates and enhancements.
This content was downloaded from IP address 193.160.78.22 on 14/01/2021 at 14:01
Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution
of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Published under licence by IOP Publishing Ltd
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
1
The parameter estimation of logistic regression with
maximum likelihood method and score function modification
R Febrianti, Y Widyaningsih and S Soemartojo
Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA),
Universitas Indonesia, Depok 16424, Indonesia
Corresponding author’s email: yekti@sci.ui.ac.id
Abstract. The maximum likelihood parameter estimation method with Newton Raphson
iteration is used in general to estimate the parameters of the logistic regression model. Parameter
estimation using the maximum likelihood method cannot be used if the sample size and
proportion of successful events are small, since the iteration process will not yield a convergent
result. Therefore, the maximum likelihood method cannot be used to estimate the parameters.
One way to resolve this un-convergence problem is using the score function modification.
This modification is used to obtain the parameters estimate of logistic regression model.
An example of parameter estimation, using maximum likelihood method with small sample size
and proportion of successful events equals 0.1, showed that the iteration process is not
convergent. This non-convergence can be solved with modifications on a score function.
Modification on score function is to change a score function, a matrix of the first derivative of
the log likelihood function, to the first derivative matrix itself minus multiplication of
information matrix and biased vector. The modification of the score function can quickly yield
values of parameter estimates, especially when the sample sizes are larger, and convergence was
reached before the 10
th
iteration.
Keywords: Maximum likelihood, score function modification
1. Introduction
Regression analysis is a statistical method to analyze the relationship between one response variable and
one or more explanatory variables. Regression analysis is used to analyze data with quantitative response
variable. If the response variable is a qualitative variable, the linear regression model cannot be used.
So, logistic regression model is used to analyze the data with qualitative response variable. Logistic
regression model is defined as [1].




or







where .
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
2
The maximum likelihood method can be used to estimate the parameters of the logistic regression
model.
is a random variable and is independent with  and

.
Defined likelihood function of

is [2, 3].
!
"
!
#
!
$
%
&
'(
)
"&
$
*"
(1)
Because
+
"+



and (
"
",
'-.-/)
then equation 1 can be expressed as:
!
"
!
$
0
1
0
"
20
1
0
"
%



$
*"
&



(2)
To simplify the derivative of likelihood function, the log-likelihood function is used. The log-
likelihood function is defined as follows:
0
1
0
"
3420
1
0
"
5!
0
1
0
"

$
*"
(534'


)
$
*"
(3)
Next, the derivative of log-likelihood function with respect to 0
1
, is [4].
60
1
0
"
60
1
5!
$
*"
(





(4)
Since 

,
-.-/
",
-.-/
then equation 4 can be expressed as:
60
1
0
"
60
1
5!
$
*"
(
 7 (5)
The derivative of log-likelihood function with respect to 0
"
is
60
1
0
"
60
"
5!
$
*"
(





(6)
Again, since 

,
-.-/
",
-.-/
, equation 6 can be expressed as:
60
1
0
"
60
"
5!
$
*"
(

 5!
$
*"
(

7 (7)
Equation 5 and equation 7 are not in 0
1
and 0
"
, and are difficult to find the solutions analytically,
then to obtain the values of 0
8
1
and 0
8
"
, Newton Raphson numerical iteration method should be used [5].
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
3
The steps to estimate parameter 9 using Newton-Raphson iteration are
1. Input the initial estimated value of 9
:
2. To obtain estimation values on the (k+1)-th iteration, calculate
9
;"
9
;
(<9
";
=9
;
.
3. The iteration is continued until 9
;"
>9
;
.
=9 is defined as a matrix of the first derivative of log likelihood function with respect to the
parameters
=9?
@
@
@
A
69
60
1
69
60
"
B
C
C
C
D
?
@
@
@
@
A
5!
$
*"
(
5!
$
*"
(

B
C
C
C
C
D
and <9
"
is
<9?
@
@
@
A
6
#
0
1
0
"
60
1
#
6
#
0
1
0
"
60
1
60
"
6
#
0
1
0
"
60
"
#
6
#
0
1
0
"
60
"
60
1
B
C
C
C
D
"
<9(
?
@
@
@
@
A
5
'(
)
$
*"
5
'(
)
$
*"
5
'(
)
$
*"
5
#
'(
)
$
*"
B
C
C
C
C
D
"
2. Estimation of parameter using modification of score function
The logistic regression model uses the maximum likelihood method to estimate the parameters of the
model, using Newton Raphson method to obtain the final solution. According to Badi N H S [6],
the Newton Raphson iteration is not convergent if the sample size is small and the proportion of success
events is small. According to Czepiel S A [7] if the result of parameter estimation through the iteration
is not convergent, indicate that the model formed is no suitable for the data being analyzed.
The solution to solve the divergence problem in Newton Raphson iteration is to modify the score
function. Modification on score function discovered by Firth in 1993. Modification of score function is
using bias vector and information matrix to estimate parameter in logistic regression model.
Mathematically, the modification of score functions is to change =9 to =
E
9 as follows [8]:
=
E
9=9(F9G9 = 0 (8)
where F9 is an information matrix, defined as:
F9?
@
@
@
A
(HI6
#
0
1
0
"
60
1
60
1
J(HI6
#
0
1
0
"
60
1
60
"
J
(HI6
#
0
1
0
"
60
"
60
1
J(HI6
#
0
1
0
"
60
"
60
"
JB
C
C
C
D
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
4
K9
?
@
@
@
@
A
5
(

$
*"
5

$
*"
(


5

(

$
*"
5
#

(

$
*"
B
C
C
C
C
D
L
M
NL
with NOPQR
(
S and L is the design matrix.
G9 is a bias vector, defined as:
G9'L
M
NL)
"
L
M
NT
where
T
?
@
@
@
@
@
@
A
U
""


"
(
"
V
"
(
W
U
##


#
(
#
V
#
(
W
X
U
$$


$
(
$
V
$
(
WB
C
C
C
C
C
C
D
with U

being an element of the diagonal of the hat matrix. In Generalized Linear Models, the hat
matrix was calculated as:
YN
Z
[
L'L
M
N
L
)
Z
L
M
N
Z
[
So the formula of modification of score function of regression logistics model is:
=
E
9 =9(F9G9 = 0
0 =9('L
M
NL)'L
M
N
L
)
"
L
M
NT
0 =9(<L
M
NT
0 =9(L
M
NT
\7
7]
?
@
@
@
@
A
5^!
U

(U


_
$
*"
5^!
U

(U


_
$
*"
B
C
C
C
C
D
`=
:
E
=
Z
E
a
The result of 9 parameter estimation for modification of score functions requires numeric iteration.
The (m+1)-th iteration of modification of score function is:
9
b"
E
9
b
E
(G9
b
E
<
b
9
"
=
E
9
b
E
with 9
b
E
is the value of 9 at the m-th iteration,
G9
b
E
is bias vector at the m-th iteration,
<
b
9
"
is the invers of information matrix at the m-th iteration,
=
E
9
b
E
is the score function at the m-th iteration,
cd7.
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
5
3. Application
The maximum likelihood parameter estimation and modification of score function to logistic regression
models is applied on endometrial cancer data. In this data, HG (Histology Grade) is a high or low value
of endometrial cancer that is determined as variable response. If the HG value is 1, cancer endometrial
is on high stadium; if HG value is 0, cancer endometrial is on low stadium. EH (Endometial Hyperplasia)
is state of the endometrial growing to excess. EH is the explanatory variable in modeling. The data
consists of 79 observations, with response values of y = 1 are 30 observations and the response values
of y = 0 are 49 observations [7]. The samples application use samples of size n = 10, n = 20, and n = 30
with the proportion of y = 1 is 0.1 and the stopping criterion in the program is the maximum iteration of
10,000 or the error tolerance in the program is e7
f
.
Table 1 shows that the parameter estimation using maximum likelihood method with Newton
Raphson iteration is not convergent. This problem is solved using modification of the score function to
estimate the parameters of the model. Table 2 is the result of a modification of score function for n = 10
and proportion of y = 1 is 0.1.
Table 2 shows that the result of the parameter estimation using a modification of the score function
gives the values of 0
8
1
= -5.7266 and 0
8
"
= 1.9464. A modification on the score function is able to solve
un-convergence parameter estimation problem of maximum likelihood using Newton Raphson iteration.
Convergence begins at the 564
th
iteration.
Table 1. The results of maximum likelihood parameter estimation
using Newton Raphson iteration without modification of
score function, n = 10, proportion of y = 1 is 0.1.
Iteration 0
8
1
0
8
"
1 -4.9355 2.3556
2 -8.3121 4.0197
3 -11.5656 5.6104
4 -14.8322 7.1973
5 -21.579 8.8085
X X X
X X X
10,000 (g (g
Table 2. The results of maximum likelihood parameter estimation
using Newton Raphson iteration with modification of the
score function, n = 10, proportion of y = 1 is 0.1
Iteration 0
8
1
0
8
"
1 -4.9355 2.3556
2 -4.9355 2.2556
3 -4.9355 2.1556
4 -5.0355 2.1556
5 -5.1355 2.1556
X X X
563 -5.7267 1.9463
564 -5.7266 1.9464
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
6
Table 3. The results of maximum likelihood parameter estimation
using Newton Raphson iteration with modification of
score function, n = 20, proportion of y = 1 is 0.1.
Iteration 0
8
1
0
8
"
1 0.1741 -0.9404
2 2.5349 -2.6134
3 6.0205 -5.0421
4 9.1708 -7.4126
5 11.8665 -9.4988
6 12.6815 -10.1196
7 13.0317 -10.3947
8 13.7242
-10.9536
9 13.7242
-10.9536
10 13.7242
-10.9536
Table 4. The results of maximum likelihood parameter estimation
using Newton Raphson iteration with modification of
score function, n = 30, proportion of y = 1 is 0.1.
Iteration 0
8
1
0
8
"
1 -0.1838 -0.7757
2 1.2144 -1.9532
3 2.9857 -3.3758
4 4.1439 -4.3899
5 4.5526 -4.7656
6 4.5949 -4.8054
h 4.5949 -4.8054
8 4.5949 -4.8054
Table 3 and table 4 are the results of iteration for sample sizes of 20 and 30, respectively, with a
proportion of success of 0.1 using the maximum likelihood parameter estimation method using
Newton Raphson iteration with modification of the score function.
Based on the results of tables 3 and table 4, the maximum likelihood estimation method with
modification on the score function using Newton Raphson iteration with larger sample sizes can give
values of parameter estimation rapidly. For sample size of n = 20, the convergence parameter starts on
the 8
th
iteration. Furthermore, for sample size of n = 30, the convergence parameter starts on the
6
th
iteration.
4. Conclusion
To estimate the parameters of the logistic regression model using the maximum likelihood method is to
differentiate the likelihood function, then set this first derivative to 0, and continue to solve the equation
to obtain the estimate of parameters. The first derivative of the likelihood function on the parameters is
not linear and it is difficult to obtain the solution analytically. Therefore, it required Newton-Raphson
2nd BASIC 2018
Journal of Physics: Conference Series 1725 (2021) 012014
IOP Publishing
doi:10.1088/1742-6596/1725/1/012014
7
iteration. Here, the iterations never gave a convergent result. Furthermore, modification on score
functions is needed, that is, using bias vectors and information matrices to estimate parameters in the
logistic regression model. Mathematically, the purpose of modification of score functions is to change
a score function that is the first derivative matrix, to the first derivative matrix itself minus multiplication
of information matrix and biased vectors. The modification of score functions can quickly yield values
of parameter estimation. Based on the results of computations to estimate parameters, if the sample size
is small and the proportion of success events is also small, using the maximum likelihood method with
Newton-Raphson iteration will not working properly. This problem can be solved using modification of
score functions.
References
[1] Agresti A 2015 Foundations of Linear and Generalized Linear Models (Hoboken: Wiley)
[2] Hogg R V and Craig A 1995 Introduction To Mathematical Statistics 5th edition (New Jersey:
Prentice Hall)
[3] Hosmer D W and Lemeshow S 2000 Applied Logistic Regression 2nd edition (Hoboken: Wiley)
[4] Montgomery D C, Peck E A and Vinning G G 2001 Introduction to Linear Regression Analysis
3rd edition (Hoboken: John Wiley & Sons, Inc)
[5] Pawitan Y 2001 In All Likelihood: Statistical Modelling and Inference Using Likelihood
(Oxford: Oxford University Press)
[6] Badi N H S 2017 Open Access Lib. J. 4 e3625
[7] Czepiel S A 2003 Maximum likelihood Estimation of Logistic Regression Models: Theory and
Implementation available at https://czep.net/stat/mlelr.pdf
[8] Firth D 1993 Biometrika 80 27-38
... To fit the models, the functional limitation value "No" was adopted as the reference. The Wald test (Hosmer et al., 2013) was used to calculate the p values for the parameter estimation and the likelihood-ratio chi-square test (Febrianti et al., 2021) to assess model fit. Lastly, the odds ratios (ORs) obtained by exponentiation of the regression coefficients and the corresponding 95% confidence intervals (95% CIs) were reported for each model. ...
Article
Full-text available
Background Loss of the ability to perform activities of daily living (ADLs) leads to negative health outcomes such as reduced quality of life, institutionalization, and mortality. In Korea, the proportion of older adults with disabilities is increasing along with rapid population aging. Therefore, providing a comprehensive approach to the prevention and management of ADL limitations in people with disabilities is necessary. This can be accomplished by understanding the trends and factors affecting these limitations over time. Purpose This study was developed to examine the longitudinal trend and factors affecting ADL limitations over time among people with disabilities in Korea. Methods Data from 346 people with disabilities in the 2008–2020 Korean Welfare Panel Study were used. Bivariate analysis and a Kendall trend test were performed to determine the longitudinal trends for ADL limitations, and multiple logistic regression was used to evaluate whether relevant variables could predict these limitations. Results The prevalence of ADL limitations among people with disabilities increased by 16.5% over the 12 years of the study. The highest rate of increase in these limitations over time was found in people with mental disorders (27.8%), those over 65 years of age (27.3%), and those with depression (25.6%). In multiple logistic regression, the odds ratios of the variables were slightly different at each survey wave. However, severe disability and low educational level were consistently found to be associated with ADL limitations over time. Conclusions The findings provide evidence of a significant relationship between level of disability and/or educational status and ADL limitations in people with disabilities in Korea. To prevent the development of ADL limitations in people with disabilities, comprehensive identification of longitudinal trends and factors affecting ADL limitations is necessary. Early intervention, including integrated services such as home rehabilitation services to prevent ADL limitations, especially for disabled people with severe disabilities and low educational levels, has the potential to delay ADL limitations.
... 23) The Wald test was used to calculate the P-value for parameter estimation, and the likelihood-ratio chi-square (LRC) test was used to assess the model fit. 24) We report the odds ratios (ORs) obtained by exponentiating the regression coefficients and the corresponding 95% confidence intervals (95% CIs). OR represents the relative risk ratios of anxiety OR depression associated with a one-level change in the respective predictors. ...
Article
Full-text available
Background: The influx of immigrants into Korea has increased in recent years, affecting Korean society and the healthcare system. This study analyzed the frequency of anxiety and depression in immigrants, which negatively affects their quality of life. Methods: We analyzed data from a 2020 survey on the Health Rights of Migrants and the Improvement of the Medical Security System. Bivariate analyses and a multiple logistic regression model were used to identify the risk factors associated with the presence of anxiety or depression among immigrants. Results: We included 746 immigrants, 55.9% of whom were female. The overall rate of anxiety or depression was 31.77%, with 38.3% in females, which was significantly higher than the 26.62% in males. The frequency of anxiety and depression was also strongly associated with certain immigrant groups, including immigrants of African or Western Asian origin (over 64%); those with student visas (60.53%); those who self-reported poor health (52%), physical or mental disabilities (69.23%), or chronic diseases (58.43%); and those facing difficulties accessing medical services (59.47%). Conclusion: This study showed the frequency of feelings of anxiety or depression and associated risk factors among immigrants. These findings may have implications for policymakers in reducing the likelihood of developing anxiety or depression in the future and improving the quality of life of immigrants in Korea.
... The Wald test was used to calculate the P-value for parameter estimation, and the likelihood-ratio chi-square (LRC) test was used to assess model fit. 29) The "not depressed" category of depression status was designated as the reference for the model. The odds ratios (ORs) were obtained by exponentiating the regression coefficients and the corresponding 95% confidence intervals (95% CIs). ...
Article
Full-text available
Background: Depression is a mental disorder common worldwide. This study determined the relationships between demographics, health status, household parameters, and depression rates among working-age household heads. Methods: We analyzed data from the Korea Welfare Panel Study Survey conducted in 2020. The 11-item version of the Center for Epidemiologic Studies Depression Scale was used to assess depression. Bivariate analyses and a multiple logistic regression model were used to evaluate the influence of these factors on depression among household heads. Results: The overall prevalence of depression among working-age household heads was 11.69% (19.83% of females and 9.58% of males). The relative risk of depression was 1.71 times higher among the unemployed than among wage earners and 2.18 times higher among those with low income than among those with general income. The relative risk of depression was 3.23 times higher in those with poor health status than in those with good health, and 2.45 times more in those with severe disabilities than in those without disabilities. The rate of depression decreased with education level, number of family members, and presence of children but increased with the presence of the disabled or elderly. Conclusion: This study provides a comprehensive overview of depression among working-age household heads and identifies factors strongly associated with depression. These findings may have implications for policymakers to reduce the burden on and improve the quality of life of household heads.
... where P(A) designates the probability of the event A, and S w (·) is a mapping from R m to [0, 1] called score function [14]. Given a threshold θ ∈[0, 1], we decide that ...
Article
Full-text available
Logistic regression is a commonly used classification algorithm in machine learning. It allows categorizing data into discrete classes by learning the relationship from a given set of labeled data. It learns a linear relationship from the given dataset and then introduces nonlinearity through an activation function to determine a hyperplane that separates the learning points into two subclasses. In the case of logistic regression, the logistic function is the most used activation function to perform binary classification. The choice of logistic function for binary classifications is justified by its ability to transform any real number into a probability between 0 and 1. This study provides, through two different approaches, a rigorous statistical answer to the crucial question that torments us, namely where does this logistic function on which most neural network algorithms are based come from? Moreover, it determines the computational cost of logistic regression, using theoretical and experimental approaches.
... ADL and IADL disabilities at baseline were considered historical disabilities to predict future functional disability. The Wald test [34] was used to calculate the p-value for the parameter estimation and the likelihoodratio Chi-square (LRC) test [35] assessed model fit. The "No" category of ADL/IADL disabilities was designated as the reference for the models. ...
Article
Full-text available
Background The prevalence of functional disabilities, including difficulties in performing activities of daily living (ADLs) and instrumental activities of daily living (IADLs), increased significantly in recent years and burdened the healthcare system. Methods We analysed data from Korean Longitudinal Study of Aging (KLOSA) surveys, including participants aged 65 or older at baseline (2008), and participated in all 4-year follow-up periods in 2012, 2016, and 2020. A 4-year follow-up cohort study was applied to specify the change in functional disability and its trend over time among older adults. The generalized estimation equation (GEE) model was used to verify the uptrend of functional disability. Logistic regression analyses were applied to examine the influence of demographic and health parameters on the change in functional disability. Results The prevalence of ADL disability was 2.24% at baseline, increased to 3.10% after four years, 6.42% after eight years, and reached 11.12% after 12 years, five times higher than that at baseline. For IADL disability, they were 10.67%, 10.61%, 18.18%, and 25.57%, respectively. The uptrend of ADL disability in persons aged 65–74 (1.77% at baseline, increased to 7.65% in 2020, 12-year change of 5.88%) was slower than in those aged 75 or older (4.22% at baseline, increased to 25.90% in 2020, 12-year change of 21.68%). IADL disability were consistent with this. The high ADL/IADL disability rate was also present among persons with poor health status, physical inactivity, depression, dementia, and multiple chronic diseases. The relative risk of ADL/IADL disability in persons with a history of functional disability was significantly higher than in those without historical disabilities. Conclusion The study verified the change in functional disability and its upward trend over time by older adults’ demographic and health parameters. Functional disability was relatively flat tending to increase slowly during the early years but increased rapidly in the following years. Factors that strongly influenced the change in prevalence and the uptrend of functional disability were advanced age, living alone, being underweight or obese, poor health status, physical inactivity, depression, dementia, having multiple chronic diseases, and especially having a historical disability.
... Estimasi parameter koefisien regresi logistikβ k diperoleh menggunakan metode maksimum likelihood. Turunan parsial pertama dari fungsi likelihood koefisien regresi logistik ordinal tidak memiliki bentuk analitik, sehingga diperlukan optimasi secara numerik (Rifada et al., [20]; Febrianti et al., [5]). Pada penelitian ini digunakan package Regression Modelling Strategies (rms) pada perangkat lunak RStudio 2022.02.0 Build 443 untuk menghitung estimasi parameter tersebut. ...
Article
Full-text available
Pengemudi kendaraan bermotor dapat mengalami kecelakaan yang dapat mengakibatkan luka, cedera, patah tulang bahkan kematian. Tingkat kerugian yang mungkin diderita pengemudi kendaraan dapat dibagi menjadi beberapa tingkatan, seperti kerugian ringan, sedang, dan berat. Kecelakaan pada pengemudi kendaraan dipengaruhi oleh faktor manusia (internal) dan faktor kendaraan dan lingkungan (eksternal). Risiko akibat kecelakaan kendaraan bermotor untuk pengemudi dengan karakteristik yang berbeda pada umumnya tidak akan sama. Tingkat kerugian yang mungkin dialami dipengaruhi oleh karakteristik pengemudi dimana beberapa individu lebih cenderung menunjukkan perilaku mengemudi yang tidak aman dibandingkan yang lain. Dalam penelitian ini, digunakan variabel dependen tingkat kerugian material; kerugian material ringan, sedang dan berat. Selain itu digunakan variabel independen jenis kendaraan, jenis kelamin, tingkat pendidikan, pekerjaan, agama dan usia. Analisis dilakukan dengan metode regresi logistik ordinal menggunakan data kecelakaan lalu lintas di Kabupaten Klungkung Provinsi Bali. Berdasarkan hasil penelitian, variabel independen jenis kendaraan, jenis kelamin, tingkat pendidikan, agama dan usia berpengaruh terhadap tingkat kerugian material dengan ketepatan klasifikasi 76,27%.
Article
Full-text available
HIV is a virus that infects the immune system cells, thereby damaging the human immune system. AIDS is a collection of symptoms that arise due to the compromised immune system of the human body as a result of a positive infection by the HIV virus. HIV/AIDS remains a complex and significant global health issue. Despite advancements in treatment and prevention, the severity of HIV/AIDS remains a primary focus in healthcare management efforts. This study aims to determine the factors influencing the severity of HIV/AIDS patients at the Regional Hospital of Idaman Banjarbaru using ordinal logistic regression analysis. Ordinal logistic regression is employed to understand the relationship between the dependent variable (severity of the disease) and independent variables, where the dependent variable is ordinal in scale. The data used for this analysis is secondary data extracted from the inpatient medical records of the Idaman Banjarbaru Regional Hospital, comprising a total of 68 cases of HIV/AIDS. Assumed factors influencing the severity of patients include gender, age, duration of hospitalization, education, employment status, marital status, and place of residence. The analysis results indicate a significant relationship between the severity of HIV/AIDS patients and marital status. The highest likelihood of patients experiencing HIV/AIDS is in the divorced response category with a stage 3 category, where the probability value is 0.943. Individuals in the married and divorced categories are 1.53 times more likely to experience HIV/AIDS with a stage 4 status and complications ranging from 3 to 5. Keywords: Severity of Disease, HIV/AIDS, Ordinal Logistic Regression, Odds Ratio
Article
Game theory-based decision-making model provides an effective means to enable intelligent and human-like Mandatory Lane Change (MLC), which is closely linked to driving safety and efficiency. However, current relevant models have limitations, such as imperfect game structure and incomplete information considered in payoff definitions, with the root cause of ignoring differences in driving styles between interacting vehicles, which are directly related to the acceptable safety thresholds of drivers. To address this issue, this study presents a novel game theory-based decision-making strategy, considering diverse driving styles, achieved by constructing a game with a variable structure according to the Relative Driving Style (RDS) between vehicles. Validation of the Next Generation SIMulation (NGSIM) dataset shows that the proposed decision-making strategy achieves an average accuracy of 98%, which is superior to that of existing single-type game theory-based algorithms.
Article
Full-text available
An understanding of factors that affect the recovery time from a disease is important for the community, medical staff, and also the government. This research analyzed factors that affect the recovery time of Covid-19 sufferers in West Sumatra. In addition, the consumption of a herbal made from Sungkai leaves, which is believed by some people in West Sumatra to accelerate the healing from Covid-19, was also included in the analysis. The recovery time here was categorized into two classes (binary): 1 for within 2 weeks, and 0 for more than 2 weeks. The methods used were logistic regression and geographically weighted logistic regression (GWLR). GWLR provides estimates of parameters for each location. The data used in this study is Covid-19 data of 2021 taken from the Regional Research and Development Agency (Litbangda) of West Sumatra with a total of 764 observations collected from 19 regencies/cities in West Sumatra. The results showed that there was no difference between the logistic regression model and the GWLR models based on the values of AIC and the ratio of deviance and degrees of freedom (df). The addition of spatial factors through GWLR models did not provide additional information regarding the recovery of Covid-19 sufferers within 2 weeks or more than 2 weeks. The logistic regression model gives the result that, at significance level α = 10%, residence, vaccination status, and symptoms significantly affect the recovery time within 2 weeks or more for Covid-19 sufferers, while other variables, namely sex, age, Sungkai leaves consumption status, and ginger consumption status have no significant effects.
Article
It is shown how, in regular parametric problems, the first-order term is removed from the asymptotic bias of maximum likelihood estimates by a suitable modification of the score function. In exponential families with canonical parameterization the effect is to penalize the likelihood by the Jeffreys invariant prior. In binomial logistic models, Poisson log linear models and certain other generalized linear models, the Jeffreys prior penalty function can be imposed in standard regression software using a scheme of iterative adjustments to the data.
Article
This article presents an overview of the logistic regression model for dependent variables having two or more discrete categorical levels. The maximum likelihood equations are derived from the probability distribution of the dependent variables and solved using the Newton-Raphson method for nonlinear systems of equations. Finally, a generic implementation of the algorithm is discussed.
  • N H S Badi
Badi N H S 2017 Open Access Lib. J. 4 e3625