Article

Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods.

Department of Biomedical Engineering, Hanyang University, Seoul, Republic of Korea.
Artificial Intelligence in Medicine (impact factor: 1.35). 02/2008; 42(1):37-53. DOI:10.1016/j.artmed.2007.09.005 pp.37-53
Source: DBLP

ABSTRACT Diabetic nephropathy is damage to the kidney caused by diabetes mellitus. It is a common complication and a leading cause of death in people with diabetes. However, the decline in kidney function varies considerably between patients and the determinants of diabetic nephropathy have not been clearly identified. Therefore, it is very difficult to predict the onset of diabetic nephropathy accurately with simple statistical approaches such as t-test or chi(2)-test. To accurately predict the onset of diabetic nephropathy, we applied various machine learning techniques to irregular and unbalanced diabetes dataset, such as support vector machine (SVM) classification and feature selection methods. Visualization of the risk factors was another important objective to give physicians intuitive information on each patient's clinical pattern.
We collected medical data from 292 patients with diabetes and performed preprocessing to extract 184 features from the irregular data. To predict the onset of diabetic nephropathy, we compared several classification methods such as logistic regression, SVM, and SVM with a cost sensitive learning method. We also applied several feature selection methods to remove redundant features and improve the classification performance. For risk factor analysis with SVM classifiers, we have developed a new visualization system which uses a nomogram approach.
Linear SVM classifiers combined with wrapper or embedded feature selection methods showed the best results. Among the 184 features, the classifiers selected the same 39 features and gave 0.969 of the area under the curve by receiver operating characteristics analysis. The visualization tool was able to present the effect of each feature on the decision via graphical output.
Our proposed method can predict the onset of diabetic nephropathy about 2-3 months before the actual diagnosis with high prediction performance from an irregular and unbalanced dataset, which statistical methods such as t-test and logistic regression could not achieve. Additionally, the visualization system provides physicians with intuitive information for risk factor analysis. Therefore, physicians can benefit from the automatic early warning of each patient and visualize risk factors, which facilitate planning of effective and proper treatment strategies.

0 0
 · 
0 Bookmarks
 · 
69 Views
  • Article: Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus provisional report of a WHO consultation.
    [show abstract] [hide abstract]
    ABSTRACT: The classification of diabetes mellitus and the tests used for its diagnosis were brought into order by the National Diabetes Data Group of the USA and the second World Health Organization Expert Committee on Diabetes Mellitus in 1979 and 1980. Apart from minor modifications by WHO in 1985, little has been changed since that time. There is however considerable new knowledge regarding the aetiology of different forms of diabetes as well as more information on the predictive value of different blood glucose values for the complications of diabetes. A WHO Consultation has therefore taken place in parallel with a report by an American Diabetes Association Expert Committee to re-examine diagnostic criteria and classification. The present document includes the conclusions of the former and is intended for wide distribution and discussion before final proposals are submitted to WHO for approval. The main changes proposed are as follows. The diagnostic fasting plasma (blood) glucose value has been lowered to > or =7.0 mmol l(-1) (6.1 mmol l(-1)). Impaired Glucose Tolerance (IGT) is changed to allow for the new fasting level. A new category of Impaired Fasting Glycaemia (IFG) is proposed to encompass values which are above normal but below the diagnostic cut-off for diabetes (plasma > or =6.1 to <7.0 mmol l(-1); whole blood > or =5.6 to <6.1 mmol l(-1)). Gestational Diabetes Mellitus (GDM) now includes gestational impaired glucose tolerance as well as the previous GDM. The classification defines both process and stage of the disease. The processes include Type 1, autoimmune and non-autoimmune, with beta-cell destruction; Type 2 with varying degrees of insulin resistance and insulin hyposecretion; Gestational Diabetes Mellitus; and Other Types where the cause is known (e.g. MODY, endocrinopathies). It is anticipated that this group will expand as causes of Type 2 become known. Stages range from normoglycaemia to insulin required for survival. It is hoped that the new classification will allow better classification of individuals and lead to fewer therapeutic misjudgements.
    Diabetic Medicine 07/1998; 15(7):539-53. · 2.90 Impact Factor
  • Source
    Article: Diabetic nephropathy: diagnosis, prevention, and treatment.
    [show abstract] [hide abstract]
    ABSTRACT: Diabetic nephropathy is the leading cause of kidney disease in patients starting renal replacement therapy and affects approximately 40% of type 1 and type 2 diabetic patients. It increases the risk of death, mainly from cardiovascular causes, and is defined by increased urinary albumin excretion (UAE) in the absence of other renal diseases. Diabetic nephropathy is categorized into stages: microalbuminuria (UAE >20 microg/min and < or =199 microg/min) and macroalbuminuria (UAE > or =200 microg/min). Hyperglycemia, increased blood pressure levels, and genetic predisposition are the main risk factors for the development of diabetic nephropathy. Elevated serum lipids, smoking habits, and the amount and origin of dietary protein also seem to play a role as risk factors. Screening for microalbuminuria should be performed yearly, starting 5 years after diagnosis in type 1 diabetes or earlier in the presence of puberty or poor metabolic control. In patients with type 2 diabetes, screening should be performed at diagnosis and yearly thereafter. Patients with micro- and macroalbuminuria should undergo an evaluation regarding the presence of comorbid associations, especially retinopathy and macrovascular disease. Achieving the best metabolic control (A1c <7%), treating hypertension (<130/80 mmHg or <125/75 mmHg if proteinuria >1.0 g/24 h and increased serum creatinine), using drugs with blockade effect on the renin-angiotensin-aldosterone system, and treating dyslipidemia (LDL cholesterol <100 mg/dl) are effective strategies for preventing the development of microalbuminuria, in delaying the progression to more advanced stages of nephropathy and in reducing cardiovascular mortality in patients with type 1 and type 2 diabetes.
    Diabetes Care 02/2005; 28(1):164-76. · 8.09 Impact Factor
  • Article: Microalbuminuria, blood pressure and diabetic renal disease: origin and development of ideas.
    Diabetologia 04/1999; 42(3):263-85. · 6.81 Impact Factor

Full-text

View
2 Downloads
Available from

Keywords

2-3 months
 
classification performance
 
Diabetic nephropathy
 
facilitate planning
 
feature selection methods
 
graphical output
 
intuitive information
 
kidney function varies
 
Linear SVM classifiers
 
new visualization system
 
nomogram approach
 
patient's clinical pattern
 
physicians intuitive information
 
prediction performance
 
proper treatment strategies
 
risk factor analysis
 
simple statistical approaches
 
support vector machine
 
unbalanced diabetes dataset
 
visualize risk factors
 

Baek Hwan Cho