Pathologic grading for prognostic stratification of neuroendocrine tumors (NETs) is critical but presents a challenging interpretive dilemma. Tumor cell proliferative rate is an important factor in the determination of prognosis, and immunohistochemical analysis with Ki67 is becoming more widely used to quantify the proliferative rate. However, Ki67 assessment has limitations due to lack of uniformity and consistency in quantification. These limitations are accentuated in well-differentiated NETs, as differences in the range of 1% to 5% can alter tumor grade, with potential implications for treatment. We therefore performed a concordance study to assess different Ki67 quantification techniques including: (a) digital image analysis (DIA); (b) manual counting (MC) of >2000 cells; and (c) "eyeballed" estimate (EE) of labeling percentage by pathologists (n=18), including individuals experienced in evaluating Ki67 labeling as well as others who had little prior experience assessing Ki67 percentages. Forty-five Ki67 images were selected and analyzed using the 3 methods. On the basis of the recommendations of the World health Organization (WHO) for grading NETs, MC of 2000 cells was used as the "gold standard" reference against which the other techniques were compared. Three images were presented twice, the second being inverted, to assess intraobserver consistency. Statistical analyses were performed to evaluate: (a) the concordance between methods; (b) intraobserver and interobserver consistency; and (c) correlation of NET grades on the basis of Ki67 scores by EE versus the gold standard. Agreement between scores was assessed by intraclass correlation (ICC). DIA and MC were highly concordant (ICC=0.98). The ICC between DIA and the mean EE of all observers was 0.88. However, there was discordance among individual observers on all cases quantified by EE (ICC=0.13). The ICC for intraobserver consistency was 0.39±0.26. With Ki67 in the ranges of <1%, 2% to 3%, and >20%, the mean of Ki67 by EE was, respectively, 93%±2%, 55%±7%, and 55%±15% correct against the gold standard. The κ statistics for EE exhibited low agreement (κ=0.24; 95% confidence interval, 0.23-0.25) for all WHO NET grades. Incorrect assessment by EE resulted in upgrading of all WHO G1 group tumors (n=14); in the WHO G2 group, downgrading of 41% cases occurred (n=11) when Ki67 was <5% (by DIA or MC), and upgrading of 59% cases occurred (n=16) when Ki67 was >5%. We conclude that DIA and MC are the acceptable standards for Ki67 assessment. Given the inherent discordance in determining the grade, the use of an approximate EE of the Ki67-labeling index requires critical reevaluation, especially for NETs with a labeling index straddling the cut-points between grades. Consequently, determination of therapeutic strategies should be guided by an amalgamation of clinicopathologic characteristics, including but not limited to the Ki67 index.