Determination of similarity measures for pairs of mass lesions on mammograms by use of BI-RADS lesion descriptors and image features.

Kurt Rossmann Laboratories for Radiologic Image Research, Department of Radiology, The University of Chicago, Chicago, IL, USA.
Academic radiology (Impact Factor: 2.08). 05/2009; 16(4):443-9. DOI: 10.1016/j.acra.2008.10.012
Source: PubMed

ABSTRACT To determine similarity measures for selection of pathology-known similar images that would be useful for radiologists as a reference guide in the diagnosis of new breast lesions on mammograms.
The images were obtained from the Digital Database for Screening Mammography developed by the University of South Florida. For determination and evaluation of similarity measures, the "gold standard" of similarities for 300 pairs of masses was determined by 10 breast radiologists. For determining similarity measures that would agree with radiologists' similarity determination, an artificial neural network (ANN) was trained with the radiologists' subjective similarity ratings and the image features. The image features were determined subjectively using the Breast Imaging Reporting and Data System (BI-RADS) lesion descriptors and objectively by computerized image analysis. The similarity measures determined by the ANN were compared to the gold standard and evaluated in terms of the correlation coefficient.
The similarity measures determined using the BI-RADS descriptors only were not as useful as those determined by use of the image features only. When the BI-RADS margin ratings were combined with the image features, the correlation coefficient between the subjective ratings and the objective measures improved slightly (r = 0.76) compared to those based on the image features alone (r = 0.74).
The inclusion of the BI-RADS margin descriptors may be useful for determination of similarity measures, especially when it is difficult to obtain the manual outlines of the masses and if the BI-RADS descriptors were provided consistently by radiologists.

1 Follower
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We have been developing a computerized scheme for selecting visually similar images that would be useful to radiologists in the diagnosis of masses on mammograms. Based on the results of the observer performance study, the presentation of similar images was useful, especially for less experienced observers. The test cases included 50 benign and 50 malignant masses. Ten observers, including five breast radiologists and five residents, were asked to provide the confidence level of the lesions being malignant before and after the presentation of similar images. By use of multireader, multi-case receiver operating characteristic analysis, the average areas under the curves for the five residents were 0.880 and 0.896 without and with similar images, respectively (p=0.040). There were four malignant cases in which the initial ratings were relatively low, but the similar images alerted the residents to increase their confidence levels of malignancy close to those by the breast radiologists. The presentation of similar images may cause some observers falsely to increase their suspicion for some benign cases; however, if similar images can alert radiologists to recognize the signs of malignancy and also help them to decrease their suspicion correctly for some benign cases, they can be useful in the diagnosis on mammograms.
    Proceedings of SPIE - The International Society for Optical Engineering 02/2009; DOI:10.1117/12.811447 · 0.20 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: We conducted an observer study to investigate how the data collection method affects the efficacy of modeling individual radiologists' judgments regarding the perceptual similarity of breast masses on mammograms. Six observers of varying experience levels in breast imaging were recruited to assess the perceptual similarity of mammographic masses. The observers' subjective judgments were collected using (i) a rating method, (ii) a preference method, and (iii) a hybrid method combining rating and ranking. Personalized user models were developed with the collected data to predict observers' opinions. The relative efficacy of each data collection method was assessed based on the classification accuracy of the resulting user models. The average accuracy of the user models derived from data collected with the hybrid method was 55.5 ± 1.5%. The models were significantly more accurate (P < .0005) than those derived from the rating (45.3 ± 3.5%) and the preference (40.8 ± 5%) methods. On average, the rating data collection method was significantly faster than the other two methods (P < .0001). No time advantage was observed between the preference and the hybrid methods. A hybrid method combining rating and ranking is an intuitive and efficient way for collecting subjective similarity judgments to model human perceptual opinions with a higher accuracy than other, more commonly used data collection methods.
    Academic radiology 11/2013; 20(11):1371-1380. DOI:10.1016/j.acra.2013.08.002 · 2.08 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Presentation of similar reference images can be useful for diagnosis of new lesions. A similarity map which can visually present the overview of the relationship between the lesions with different types may provide the supplemental information to the reference images. A new method for constructing the similarity map by multidimensional scaling (MDS) for breast masses on mammograms was investigated. Nine pathologic types were included; three regions of interests each from the nine groups were employed in this study. Subjective similarity ratings by expert readers were obtained for all possible 351 pairs of masses. Using the average ratings, MDS similarity map was created. Each axis of the MDS configuration was fitted by the linear model with 13 image features to reconstruct the similarity map. Dissimilarity based on the distance in the reconstructed space was determined and compared with the subjective rating. The MDS map consistently represented the similarity between cysts and fibroadenomas, invasive lobular carcinomas and scirrhous carcinomas, and ductal carcinomas in situ, solid-tubular carcinomas, and papillotubular carcinomas with the experts' data. The correlation between the average subjective ratings and the dissimilarities based on the distance in the reconstructed feature space was much greater (-0.87) than that of the dissimilarities based on the distance in the conventional feature space (-0.65). The new similarity map by MDS can be useful for visualizing the relationship between breast masses with different pathologic types. It has potential usefulness in selecting the similarity measures and providing the supplemental information.
    Journal of Digital Imaging 01/2013; 26(4). DOI:10.1007/s10278-012-9569-0 · 1.20 Impact Factor