David Dagan Feng

Shanghai Jiao Tong University, Shanghai, Shanghai Shi, China

Are you David Dagan Feng?

Claim your profile

Publications (252)175.5 Total impact

  • [Show abstract] [Hide abstract]
    ABSTRACT: Content-based medical image retrieval (CBMIR) is an active research area for disease diagnosis and treatment but it can be problematic given the small visual variations between anatomical structures. We propose a retrieval method based on a bag-of-visual-words (BoVW) to identify discriminative characteristics between different medical images with Pruned Dictionary based on Latent Semantic Topic description. We refer to this as the PD-LST retrieval. Our method has two main components. First, we calculate a topic-word significance value for each visual word given a certain latent topic to evaluate how the word is connected to this latent topic. The latent topics are learnt, based on the relationship between the images and words, and are employed to bridge the gap between low-level visual features and high-level semantics. These latent topics describe the images and words semantically and can thus facilitate more meaningful comparisons between the words. Second, we compute an overall-word significance value to evaluate the significance of a visual word within the entire dictionary. We designed an iterative ranking method to measure overall-word significance by considering the relationship between all latent topics and words. The words with higher values are considered meaningful with more significant discriminative power in differentiating medical images. We evaluated our method on two public medical imaging datasets and it showed improved retrieval accuracy and efficiency.
    No preview · Article · Nov 2015 · Neurocomputing
  • Yangyu Fan · David Dagan Feng · Renjie He · Zhiyong Wang
    [Show abstract] [Hide abstract]
    ABSTRACT: Existing single image dehazing techniques have limitations on estimating accurate transmission maps for pixels which do not meet the assumption of the dark channel prior. A combined constraint is presented for single image dehazing by taking both geometric and natural characteristics of outdoor images into consideration. The combined constraint can be applied to various regularisation-based optimisation frameworks. Both qualitative and quantitative comparisons with the state-of-the-art dehazing approaches indicate the superiority of the proposed constraint.
    No preview · Article · Oct 2015 · Electronics Letters
  • Fredro Harjanto · Zhiyong Wang · Shiyang Lu · Ah Chung Tsoi · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Human action recognition from videos is very important for visual analytics. Due to increasing abundance of diverse video content in the era of big data, research on human action recognition has recently shifted towards more challenging and realistic settings. Frame rate is one of key issues in diverse and realistic video settings. While there have been several evaluation studies investigating different aspects of action recognition such as different visual descriptors, the frame rate issue has been seldom addressed in the literature. Therefore, in this paper, we investigate the impact of frame rate on human action recognition with several state-of-the-art approaches and three benchmark datasets. Our experimental results indicate that those state-of-the-art approaches are not robust to the variations of frame rate. As a result, more robust visual features and advanced learning algorithms are required to further improve human action recognition performance towards its more practical deployments. In addition, we investigate key-frame selection techniques for choosing a set of suitable frames from an action sequence for action recognition. Promising results indicate that well designed key-frame selection methods can produce a set of representative frames and eventually reduce the impact of frame rate on the performance of human action recognition.
    No preview · Article · Aug 2015
  • Wei Zou · Jiajun Wang · David Dagan Feng · Erxi Fang
    [Show abstract] [Hide abstract]
    ABSTRACT: The analysis of fluorescence molecular tomography is important for medical diagnosis and treatment. Although the quality of reconstructed results can be improved with the increasing number of measurement data, the scale of the matrices involved in the reconstruction of fluorescence molecular tomography will also become larger, which may slow down the reconstruction process. A new method is proposed where measurement data are reduced according to the rows of the Jacobian matrix and the projection residual error. To further accelerate the reconstruction process, the global inverse problem is solved with level-by-level Schur complement decomposition. Simulation results demonstrate that the speed of the reconstruction process can be improved with the proposed algorithm. © 2015 Society of Photo-Optical Instrumentation Engineers (SPIE).
    No preview · Article · Jul 2015 · Optical Engineering
  • Yang Song · Weidong Cai · Fan Zhang · Heng Huang · Yun Zhou · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Bone texture characterization is important for differentiating osteoporotic and healthy subjects. Automated classification is however very challenging due to the high degree of visual similarity between the two types of images. In this paper, we propose to describe the bone textures by extracting dense sets of local descriptors and encoding them with the improved Fisher vector (IFV). Compared to the standard bag-of-visual-words (BoW) model, Fisher encoding is more discriminative by representing the distribution of local descriptors in addition to the occurrence frequencies. Our method is evaluated on the ISBI 2014 challenge dataset of bone texture characterization, and we demonstrate excellent classification performance compared to the challenge entries and large improvement over the BoW model.
    No preview · Article · Jul 2015
  • Yang Song · Weidong Cai · Heng Huang · Yun Zhou · Yue Wang · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: In this paper, we propose a new Locality-constrained Subcluster Representation Ensemble (LSRE) model, to classify high-resolution computed tomography (HRCT) images of interstitial lung diseases (ILDs). Medical images normally exhibit large intra-class variation and inter-class ambiguity in the feature space. Modelling of feature space separation between different classes is thus problematic and this affects the classification performance. Our LSRE model tackles this issue in an ensemble classification construct. The image set is first partitioned into subclusters based on spectral clustering with approximation-based affinity matrix. Basis representations of the test image are then generated with sparse approximation from the subclusters. These basis representations are finally fused with approximation- and distribution-based weights to classify the test image. Our experimental results on a large HRCT database show good performance improvement over existing popular classifiers. Copyright © 2015 Elsevier B.V. All rights reserved.
    No preview · Article · Mar 2015 · Medical image analysis
  • [Show abstract] [Hide abstract]
    ABSTRACT: Content-based image retrieval (CBIR) has been applied to a variety of medical applications, e.g., pathology research and clinical decision support, and bag-of-features (BOF) model is one of the most widely used techniques. In this study, we address the problem of vocabulary pruning to reduce the influence from the redundant and noisy visual words. The conditional probability of each word upon the hidden topics extracted using probabilistic Latent Semantic Analysis (pLSA) is firstly calculated. A ranking method is then proposed to compute the significance of the words based on the relationship between the words and topics. Experiments on the publicly available Early Lung Cancer Action Program (ELCAP) database show that the method can reduce the number of words required while improving the retrieval performance. The proposed method is applicable to general image retrieval since it is independent of the problem domain.
    No preview · Chapter · Feb 2015
  • Siqi Liu · Sidong Liu · Weidong Cai · Sonia Pujol · Ron Kikinis · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Feature learning with high dimensional neuroimaging features has been explored for the applications on neurodegenerative diseases. Low-dimensional biomarkers, such as mental status test scores and cerebrospinal fluid level, are essential in clinical diagnosis of neurological disorders, because they could be simple and effective for the clinicians to assess the disorder’s progression and severity. Rather than only using the low-dimensional biomarkers as inputs for decision making systems, we believe that such low-dimensional biomarkers can be used for enhancing the feature learning pipeline. In this study, we proposed a novel feature representation learning framework, Multi-Phase Feature Representation (MPFR), with low-dimensional biomarkers embedded. MPFR learns high-level neuroimaging features by extracting the associations between the low-dimensional biomarkers and the high-dimensional neuroimaging features with a deep neural network. We validated the proposed framework using the Mini-Mental-State-Examination (MMSE) scores as a low-dimensional biomarker and multi-modal neuroimaging data as the high-dimensional neuroimaging features from the ADNI baseline cohort. The proposed approach outperformed the original neural network in both binary and ternary Alzheimer’s disease classification tasks.
    No preview · Chapter · Feb 2015
  • Shaohui Mei · Genliang Guan · Zhiyong Wang · Shuai Wan · Mingyi He · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: The rapid growth of video data demands both effective and efficient video summarization methods so that users are empowered to quickly browse and comprehend a large amount of video content. In this paper, we formulate the video summarization task with a novel minimum sparse reconstruction (MSR) problem. That is, the original video sequence can be best reconstructed with as few selected keyframes as possible. Different from the recently proposed convex relaxation based sparse dictionary selection method, our proposed method utilizes the true sparse constraint L-0 norm, instead of the relaxed constraint L-2,L-1 norm, such that keyframes are directly selected as a sparse dictionary that can well reconstruct all the video frames. An on-line version is further developed owing to the real-time efficiency of the proposed MSR principle. In addition, a percentage of reconstruction (POR) criterion is proposed to intuitively guide users in obtaining a summary with an appropriate length. Experimental results on two benchmark datasets with various types of videos demonstrate that the proposed methods outperform the state of the art.
    No preview · Article · Feb 2015 · Pattern Recognition
  • Peng Fu · Changyang Li · Quansen Sun · Weidong Cai · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Accurate estimation of noise level in images plays an important role in different image processing applications. The current algorithms can precisely estimate noise with smooth images, but it is still the challenge to approximate noise level from richly textured images. In this paper, we proposed a new adaptive superpixel classification algorithm for noise estimation in complicated textured images. Firstly, our new superpixel algorithm adapts the finite Gaussian clustering approach, which can better approximate homogeneous patches in noisy images. Then noise information is obtained locally from each superpixel patch. Finally, the best estimation of noise level is calculated with a statistical approach. Experimental results with various kinds of images demonstrate that our method is more accurate and robust compared to the five existing common used algorithms.
    No preview · Article · Jan 2015
  • [Show abstract] [Hide abstract]
    ABSTRACT: Medical images usually exhibit large intra-class variation and inter-class ambiguity in the feature space, which could affect classification accuracy. To tackle this issue, we propose a new Large Margin Local Estimate (LMLE) classification model with sub-categorization based sparse representation. We first sub-categorize the reference sets of different classes into multiple clusters, to reduce feature variation within each subcategory compared to the entire reference set. Local estimates are generated for the test image using sparse representation with reference subcategories as the dictionaries. The similarity between the test image and each class is then computed by fusing the distances with the local estimates in a learning-based large margin aggregation construct to alleviate the problem of inter-class ambiguity. The derived similarities are finally used to determine the class label. We demonstrate that our LMLE model is generally applicable to different imaging modalities, and applied it to three tasks: interstitial lung disease (ILD) classification on high-resolution computed tomography (HRCT) images, phenotype binary classification and continuous regression on brain magnetic resonance (MR) imaging. Our experimental results show statistically significant performance improvements over existing popular classifiers.
    No preview · Article · Jan 2015 · IEEE Transactions on Medical Imaging
  • Lelin Zhang · Zhiyong Wang · Tao Mei · David Dagan Feng

    No preview · Article · Jan 2015 · IEEE Transactions on Knowledge and Data Engineering
  • Lin Shu · Xiaoming Tao · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Resistive sensor arrays have been increasingly adopted in wearable electronic applications, which require low-complexity and low-energy circuits. However, current readout strategies for resistive sensor arrays require additional electrical components, such as transistors, diodes, multiplexers, op-amps, switches, current sources, and A/D converters, leading to a considerable increase in circuit complexity, power consumption, system instability, and crosstalk error. To address the problem, this paper proposes a new approach, which determines sensor resistance values by establishing and solving resistance matrix equations of sensor arrays. Unlike conventional approaches, it allows crosstalk currents in arrays to avoid additional components that are originally used for eliminating crosstalk currents and minimizing crosstalk error. Meanwhile, it takes advantage of on-chip resources of wearable platforms, thereby reducing redundant chips. It was implemented on a prototype of 10 × 10 textile resistive sensor array, which was taken in a sensing cushion for sitting pressure monitoring of chair bound people. Experimental results on this array platform showed the new approach achieved a satisfactory accuracy (0.61% ± 0.41%), as well as a low crosstalk error (2.77% ± 0.61%). The fabricated sensing cushion also exhibited a relatively low pressure measurement error (6.30% ± 0.75%). Compared with other approaches, the proposed approach demonstrated the lowest circuit complexity on a microcontroller based wearable platform, and a sufficient sensor capacity. It is ideal for a wide range of applications like wearable or implantable sensing, presenting a reference for the design of low-complexity and low-crosstalk error wearable systems based on resistive sensor arrays.
    No preview · Article · Jan 2015 · IEEE Sensors Journal
  • Peng Fu · Changyang Li · Yong Xia · Zexuan Ji · Quansen Sun · Weidong Cai · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Accurate approximation of noise in hyperspectral (HS) images plays an important role in better visualization and image processing. Conventional algorithms often hypothesize the noise type to be either purely additive or of a mixed noise type for the signal-dependent (SD) noise component and the signal-independent (SI) noise component in HS images. This can result in application-driven algorithm design and limited use in different noise types. Moreover, as the highly textured HS images have abundant edges and textures, existing algorithms may fail to produce accurate noise estimation. To address these challenges, we propose a noise estimation algorithm that can adaptively estimate both purely additive noise and mixed noise in HS images with various complexities. First, homogeneous areas are automatically detected using a new region-growing-based approach, in which the similarity of two pixels is calculated by a robust spectral metric. Then, the mixed noise variance of each homogeneous region is estimated based on multiple linear regression technology. Finally, intensities of the SD and SI noise are obtained with a modified scatter plot approach. We quantitatively evaluated our algorithm on the synthetic HS data. Compared with the benchmarking and state-of-the-art algorithms, the proposed algorithm is more accurate and robust when facing images with different complexities. Experimental results with real Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) images further demonstrated the superiority of our algorithm.
    No preview · Article · Oct 2014 · Applied Optics
  • Yang Song · Weidong Cail · Heng Huang · Yun Zhou · David Dagan Feng · Mei Chen
    [Show abstract] [Hide abstract]
    ABSTRACT: Medical images typically exhibit complex feature space distributions due to high intra-class variation and inter-class ambiguity. Monolithic classification models are often problematic. In this study, we propose a novel Large Margin Local Estimate (LMLE) method for medical image classification. In the first step, the reference images are subcategorized, and local estimates of the test image are computed based on the reference subcategories. In the second step, the local estimates are fused in a large margin model to derive the similarity level between the test image and the reference images, and the test image is classified accordingly. For evaluation, the LMLE method is applied to classify image patches of different interstitial lung disease (ILD) patterns on high-resolution computed tomography (HRCT) images. We demonstrate promising performance improvement over the state-of-the-art.
    No preview · Article · Sep 2014
  • [Show abstract] [Hide abstract]
    ABSTRACT: Neuroimaging has played an important role in non-invasive diagnosis and differentiation of neurodegenerative disorders, such as Alzheimer's disease and Mild Cognitive Impairment. Various types of features have been extracted from the neuroimaging data to characterize the disorders, and these features can be roughly divided into global and local features. Recent studies show a tendency of using the local features in disease characterization, since they are capable of identifying the subtle disease-specific patterns associated with the effects of the disease on human brain. However, problems arise if the neuroimaging database involved multiple disorders or progressive disorders, for disorders of different types or at different progressive stages might exhibit different degenerative patterns. It is difficult for the researchers to reach consensus on what brain regions could best distinguish multiple disorders or multiple progression stages. In this study we proposed a Multi-Channel pattern analysis approach to identify the most discriminative local brain metabolism features for neurodegenerative disorder characterization. We compared our method to the global methods and other pattern analysis methods based on clinical expertise or test statistics. The preliminary results suggested that the proposed Multi-Channel pattern analysis method outperformed other approaches in Alzheimer's disease characterization, and meanwhile provided important insights of underlying pathology of Alzheimer's disease and Mild Cognitive Impairment.
    No preview · Article · Sep 2014 · Computerized Medical Imaging and Graphics
  • Source
    Gillian Ng · Yang Song · Weidong Cai · Yun Zhou · Sidong Liu · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: With the increasing amount of image data available for cancer staging and diagnosis, it is clear that content-based image retrieval techniques are becoming more important to assist physicians in making diagnoses and tracking disease. Domain-specific feature descriptors have been previously shown to be effective in the retrieval of lung tumors. This work proposes a method to improve the rotation invariance of the hierarchical spatial descriptor, as well as presents a new binary descriptor for the retrieval of lung nodule images. The descriptors were evaluated on the ELCAP public access database, exhibiting good performance overall.
    Full-text · Conference Paper · Aug 2014
  • Genliang Guan · Zhiyong Wang · Shaohui Mei · Max Ott · Mingyi He · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: While most existing video summarization approaches aim to identify important frames of a video from either a global or local perspective, we propose a top-down approach consisting of scene identification and scene summarization. For scene identification, we represent each frame with global features and utilize a scalable clustering method.We then formulate scene summarization as choosing those frames that best cover a set of local descriptors with minimal redundancy. In addition, we develop a visual word-based approach to make our approach more computationally scalable. Experimental results on two benchmark datasets demonstrate that our proposed approach clearly outperforms the state-of-the-art.
    No preview · Article · Aug 2014 · ACM Transactions on Multimedia Computing Communications and Applications
  • Lei Bi · Jinman Kim · David Dagan Feng · Michael Fulham
    [Show abstract] [Hide abstract]
    ABSTRACT: Fluorodeoxyglucose positron emission tomography - computed tomography (FDG PET-CT) is the preferred image modality for lymphoma diagnosis. Sites of disease generally appear as foci of increased FDG uptake. Thresholding methods are often applied to robustly separate these regions. However, its main limitation is that it also includes sites of FDG excretion and physiological FDG uptake regions, which we define as FEPU - sites of FEPU include the bladder, renal, papillae, ureters, brain, heart and brown fat. FEPU can make image interpretation problematic. The ability to identify and label FEPU sites and separate them from abnormal regions is an important process that could improve image interpretation. We propose a new method to automatically separate and label FEPU sites from the thresholded PET images. Our method is based on the selective use of features extracted from data types comprising of PET, CT and PET-CT. Our FEPU classification of 43 clinical lymphoma patient studies revealed higher accuracy when compared to non-selective image features.
    No preview · Conference Paper · Aug 2014
  • Younhyun Jung · Jinman Kim · Michael Fulham · David Dagan Feng
    [Show abstract] [Hide abstract]
    ABSTRACT: Multi-modality positron emission tomography and computed tomography (PET-CT) imaging depicts biological and physiological functions (from PET) within a higher resolution anatomical reference frame (from CT). The need to efficiently assimilate the information from these co-aligned volumes simultaneously has resulted in 3D visualisation methods that depict e.g., slice of interest (SOI) from PET combined with direct volume rendering (DVR) of CT. However because DVR renders the whole volume, regions of interests (ROIs) such as tumours that are embedded within the volume may be occluded from view. Volume clipping is typically used to remove occluding structures by `cutting away' parts of the volume; this involves tedious trail-and-error tweaking of the clipping attempts until a satisfied visualisation is made, thus restricting its application. Hence, we propose a new automated opacity-driven volume clipping method for PET-CT using DVR-SOI visualisation. Our method dynamically calculates the volume clipping depth by considering the opacity information of the CT voxels in front of the PET SOI, thereby ensuring that only the relevant anatomical information from the CT is visualised while not impairing the visibility of the PET SOI. We outline the improvements of our method when compared to conventional 2D and traditional DVR-SOI visualisations.
    No preview · Conference Paper · Aug 2014

Publication Stats

2k Citations
175.50 Total Impact Points


  • 2009-2015
    • Shanghai Jiao Tong University
      Shanghai, Shanghai Shi, China
  • 2000-2015
    • University of Sydney
      • • School of Information Technologies
      • • Biomedical and Multimedia Information Technology Research Group (BMIT)
      Sydney, New South Wales, Australia
    • Northwestern Polytechnical University
      Xi’an, Liaoning, China
  • 2014
    • Yale University
      • Department of Diagnostic Radiology and Pediatric Diagnostic Radiology
      New Haven, Connecticut, United States
    • Budapest University of Technology and Economics
      Budapeŝto, Budapest, Hungary
  • 1998-2012
    • The Hong Kong Polytechnic University
      • • Department of Electronic and Information Engineering
      • • Department of Electrical Engineering
      Hong Kong, Hong Kong
  • 2009-2010
    • The University of Hong Kong
      Hong Kong, Hong Kong
  • 2008
    • Royal Prince Alfred Hospital
      • Department of Pet & Nuclear Medicine
      Camperdown, New South Wales, Australia
  • 2005
    • Heilongjiang University
      Charbin, Heilongjiang Sheng, China