Issues in optimising and standardising the accuracy and utility of the colposcopic examination in the HPV era
Mark Schiffman and Nicolas Wentzensen
Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, USA 9609 Medical Center Drive, Rockville, MD 20850
Correspondence to: Mark Schiffman. Email: schiffmm@mail.nih.gov
Abstract
For this tribute to Mario Sideri, we reviewed some of the current issues in colposcopy, many of which we were researching with him. The review concentrates on the impact of HPV testing on cervical screening, specifically on the practice of colposcopy as the major diagnostic procedure in cervical screening programmes. Topics include the changing population of women referred to colposcopy, evolving views of the colposcopic impression, differing approaches to directed and random biopsy, issues in teaching colposcopy using static images, and the development of colposcopy aids, and simplified visual assessment techniques.
Keywords: cervix, screening, colposcopy, HPV
Copyright: © the authors; licensee ecancermedicalscience. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Published: 29/04/2015; Received: 15/11/2014
Colposcopic examination and directed biopsies remain the major diagnostic procedure in cervical cancer screening. But since cervical screening is changing because of advances in the understanding of cervical carcinogenesis, a scrutiny of the practice of colposcopy in the human papillomavirus (HPV) era is therefore warranted.
Conventional cervical cancer screening programmes typically have the following steps. 1) screening: periodic screening tests in the general population to find women at risk; 2) initial management of non-normal screening results: colposcopic examination and biopsy of women with definite screening abnormalities (e.g. low- or high-grade lesion), or intensified repeated cytology/HPV testing to triage women who screen equivocally positive (e.g. ASCUS); 3) post-colposcopic management: treatment of precancer/cancer found on colposcopic biopsy, or intensified re-testing of women with no cancer or precancerous lesions (at least initially) found; and 4) post-treatment management: follow-up of treated women [1–4]. Whereas screening, triage, post-colposcopic management, and even post-treatment management protocols are evolving based on new knowledge about HPV and cervical carcinogenesis, the colposcopic examination itself has changed little. It remains a visually-guided estimation of the severity of cervical pathology, with biopsy of lesions suspected to represent cancer or its precursors. In some ways, colposcopy lags behind the HPV-related technical advances that promise to modernise cervical screening programmes. We will review some of these issues, which were of major interest to our colleague Mario Sideri to whom this manuscript is dedicated [5–10]. The comments are general, but refer especially to the US setting; thus we do not discuss topics such as screening colposcopy.
HPV testing is being increasingly introduced in primary cervical cancer screening and management of screening abnormalities [11, 12]. HPV is the required cause of almost all cervical cancers; HPV infection is the first step in the multi-step progression to cervical cancer. Therefore, HPV-based screening allows detecting lesions earlier than cytology. Because of this change in screening and triage protocols, the population of women referred to colposcopy is changing. Colposcopy was originally designed for diagnosis of cancer and severe precancerous lesions requiring treatment. These relatively uncommon, very severe lesions are almost universally HPV-positive and are evident on colposcopy, but do not constitute the majority of examinations today. More minor abnormalities form the bulk of colposcopic examinations. Many of the referrals based on the relatively common Bethesda system, i.e. results of atypical squamous cells of undetermined significance (ASCUS) or the analogous reading of borderline dyskaryosis would never progress to cervical cancer. Triage of these equivocal cytologic findings by HPV testing, to determine need for immediate colposcopy, has proved to be cost-effective. Now, however, we are entering an era when HPV testing is being used or proposed for primary screening because of its high sensitivity for the detection of cancer and precancer [13–15]. In contrast to cytology, the non-specificity of HPV testing results from positive high-risk HPV tests that are benign infections destined to clear without need for intervention. Such benign infections are even more common than equivocal cytologic findings. Thus, referring all HPV-positive women immediately to colposcopy would flood the colposcopist with unproductive examinations; hence, the even more acute need for triage in screen-positives [4]. The earlier, subtler spectrum of lesions detected by positive HPV testing as opposed to cytologic abnormalities, further challenges the visual technique of colposcopy. The best choice of triage method for HPV-positive women is not yet determined. Current guidelines in the US recommend referral to colposcopy of women who have had two consecutive positive HPV tests without cytologic abnormalities [11]. The consequence is that the practice of colposcopy has become even more challenging in the HPV era, since some early lesions that are not even detected by cytology need to be evaluated at colposcopy.
Despite a changing referred population presenting even more diagnostic challenges, the basic equipment and reagents of colposcopy are unchanged [16]. The classic equipment remains standard (we will discuss colposcopic adjuncts below). The use of acetic acid to highlight lesions is still fundamental, with some use of Lugol’s solution. Acetowhitening has been demonstrated to be sensitive for detection of cancer and precancer, i.e. serious lesions are almost always acetowhite, but the change is non-specific [17]. Less research has been conducted on the diagnostic properties of Lugol’s staining. More specific topical markers could help to highlight cervical precancers at colposcopy, but no revolutionary new stain is on the immediate horizon, although research on candidates proceeds actively.
Equipment and reagents aside, the practice of colposcopy is under revision but without a consensus on how to improve it. In discussing colposcopic practice, it is worth noting that colposcopy serves two discrete functions: estimation of the severity of underlying cervicovaginal disease (colposcopic impression) and placement of biopsies. Ultimately, the colposcopic examination results in treatment decisions, which can be based on colposcopic impression or biopsy results in the context of screening and triage test results and previous medical history. Importantly, the approach to confirming a precancer at colposcopy differs widely in different settings, suggesting that there is a lack of strong evidence for any of the approaches.
To discuss colposcopic impression first: as usually practiced, colposcopy has not proven particularly reproducible or accurate when formally evaluated. Colposcopic impression is typically derived from a criteria-based evaluation of visible lesions. The criteria used have varied. Reid’s criteria, including consideration of factors such as lesion color, margins, and aspects of vascularity have been widely taught in the U.S. [16]. The Swede score was developed to modify some of the Reid criteria and to expand the criteria to include lesion size [18, 19]. The evidence for cancer risk discrimination provided by colposcopic impression is very heterogeneous. A normal colposcopic impression, leading to no biopsies, is often taken to represent absence of cervical precancer, but the negative predictive value of this assessment is imperfect. Grading of the non-normal (e.g. low-grade versus high-grade) colposcopic impression is inaccurate and not especially reproducible.
Recently, the International Federation for Cervical Pathology and Colposcopy (IFCPC) published a new colposcopy terminology [5]. The new terminology includes several new colposcopic signs and emphasises the evaluation of lesion location that is closely linked to excision treatment types and dimensions. Formal evaluation of the reproducibility and accuracy of this new system for determining presence or risk of cancer/precancer is ongoing.
An important role of colposcopy is to guide the placement of biopsies to diagnose the underlying disease state. Histologic diagnosis of cervical disease (which itself is imperfect) invariably includes any error in biopsy placement. Biopsy protocols remain varied among colposcopists, and the choice of protocols is frankly controversial. At the simplest level, technical issues like the forceps type and biopsy size are not standardised. But the major issue by far is how many biopsies to take and how to choose their location. There are at least three approaches that are prominently advocated: biopsy of the worst-appearing lesion if a lesion is observed, biopsy of several acetowhite lesions, and strategies involving random biopsies of normal-appearing epithelium in four quadrants [20, 21]. These strategies are presented in decreasing order of confidence based on which the colposcopist can accurately determine the true worst disease visually. An important limitation of many studies evaluating the different approaches to detection of cervical precancers is the problem of verification bias: For example, if a study evaluates taking a single biopsy only, i.e. only evaluates disease endpoints based on the single biopsy, disease missed by this strategy cannot be detected, unless there is extended follow-up. As a result, the performance of a single biopsy to detect prevalent precancer is overestimated.
Many studies suggest that even experienced colposcopists are challenged at identifying the worst lesion on the cervix, using visual criteria [22, 23]. We have observed that totally normal, non-acetowhite epithelium only rarely harbours precancer, limiting the value of random biopsies [23]. In support of this, a longitudinal study of women with minor cytological abnormalities and normal colposcopy impression showed a low risk of CIN3 over multiple years [24]. But even subtle acetowhitening can represent precancer (including histologic CIN3) and taking multiple biopsies of acetowhite lesions rather than one biopsy of the ‘worst’ lesion, substantially increases the sensitivity of the colposcopy [17, 23]. We recently demonstrated that the increased sensitivity achieved by using multiple targeted biopsies is consistent across important risk strata like referral cytology and HPV status, but we also showed that the absolute yield of disease strongly differed by prior risk [23]. Together, these data allow tailoring colposcopy–biopsy protocols to women based on their age, cytology result, and HPV status. While increasing evidence supports multiple biopsies of acetowhite lesions as an effective way to improve detection of cervical precancers, confirmation of the best colposcopic biopsy strategy and its promulgation as an international standard remains a major issue.
Another controversial issue in colposcopy is the value of endocervical sampling, specifically endocervical curettage (ECC). There are good data showing low marginal gain in sensitivity for detection of precancer in younger women once ectocervical biopsies are taken, but for older women (or in the case of inadequate visualisation of the transformation zone), the use of ECC is more supported [25]. Some practitioners routinely add ECC to ectocervical biopsies, perhaps because of particular concern over hard-to-detect glandular lesions, while others rarely do. A large, randomised trial would be required to answer this question, given the relative rarity of precancer restricted to the endocervical canal; we know, however, of no plans to launch such an investigation.
Regardless of which colposcopy-biopsy protocol is chosen, it remains an issue how to research, teach, and evaluate colposcopy. Pathologists are able to exchange microscope slides and, similarly, it is common to rely on colpophotographs, i.e. static images used for teaching and evaluation of colposcopists [26]. With modern equipment, static images can be extremely clear and realistic. However, static images cannot show the dynamic changes in response to uptake of acetic acid, in addition to the obvious issue of incomplete visualisation of epithelium within the endocervical canal especially as women age. Images are widely used for teaching colposcopy and certainly represent a wide range of cervical abnormalities. Colposcopy videos and direct ‘apprenticeship’ in the clinic can overcome the limitations of static images, but are much harder to standardise, disseminate, and teach to a larger population of colposcopists in training.
Aids for colposcopists have been extensively researched and even submitted formally for clinical approval and commercial use [27]. So far, none has proven particularly useful, despite sophisticated image analysis and addition of non-visible spectra or electrical current. The accurate distinction of precancer from atypical metaplasia has not yet been resolved. One recent effort relies on the sensitivity and dynamics of acetowhitening, but is still to be proven as a viable clinical tool [28]. Another interesting technology for use during colposcopy is high-resolution microendoscopy, a technique to evaluate lesion severity by real-time analysis of cellular and nuclear features using a probe. This technique is under evaluation for use in both low-resource and high-resource settings [29].
The net result of inaccuracy in the colposcopic examination is that it yields imperfect risk stratification; in many instances, neither a positive nor a negative examination can be completely trusted. As a corollary, a major contemporary problem is the large fraction of women undergoing colposcopy who are not identified to have precancer requiring treatment, but whose risk profile based on preceding screening results precludes return to routine screening. Women frequently return to colposcopy clinics, with strings of repeated examinations at substantial societal expense and negative psychologic effects. This problem might worsen when the fraction of referred women sent because of persistent HPV test positivity (rather than abnormal cytologic findings) increases. We need somehow to increase the sensitivity of the colposcopic procedure, with resultant increased reassurance of negative examination/biopsies, permitting release from colposcopic surveillance. Taking more biopsies of even subtle acetowhite lesions apparently can increase colposcopic sensitivity. Improving the targeting of biopsies of the worst lesion on the cervix could achieve the same safety with reduced number of biopsies, but currently, there is no proven strategy that can accomplish this. On the other side, because colposcopy is imperfectly sensitive, when the positive predictive value of certain combinations of screening results is very high, treatment might be indicated regardless of colposcopic findings or biopsy results. An example would be that of a persistently HPV16-positive woman with high-grade squamous intraepithelial lesion (HSIL) cytology; a normal colposcopic examination in this context is likely to represent a missed incipient precancer diagnosis.
The colposcopic procedure is imperfect, but by no means useless, as suggested by some; in fact, it rightfully remains the reference standard of diagnosis of cervical cancer/precancer. Because colposcopes are expensive and not universally available, there is a long history of attempts to replace this procedure with something inexpensive and easy to use. Cervicography (off-site evaluation of a static image) and speculoscopy (using a special blue-white light source) are now of historic interest. Unmagnified visual inspection with acetic acid (VIA), however, represents a very current issue [30]. VIA has been recommended by some prominent groups as a screening or triage option; screen-and-treat protocols without histologic confirmation are sometimes employed. However, VIA shares many problems with colposcopy, with additional disadvantages of less magnification and generally less training. Its role is beyond this discussion, but VIA produces a highly imperfect estimate of cervical cancer risk and one could hope for eventual replacement by a more accurate method.
Conclusion
Colposcopy or a derivative procedure will remain an important intermediate step in most cervical screening programmes for the foreseeable future. Dr Mario Sideri was among a small group of physician-researchers most actively involved in evaluating and improving this important clinical procedure. He was instrumental in encouraging people with different, and often strong, opinions about colposcopy practice and performance to work together for the common cause of improving teaching and the practice of colposcopy. We need more attention to this area, and his loss is being felt acutely.
References
1. Carozzi F et al (2013) Risk of high-grade cervical intraepithelial neoplasia during follow-up in HPV-positive women according to baseline p16-INK4A results: a prospective analysis of a nested substudy of the NTCC randomised controlled trial Lancet Oncol 14(2) 168–76 DOI: 10.1016/S1470-2045(12)70529-6
2. Schiffman M et al (2007) Human papillomavirus and cervical cancer Lancet 370(9590) 890–907 DOI: 10.1016/S0140-6736(07)61416-0 PMID: 17826171
3. Schiffman M et al (2011) Human papillomavirus testing in the prevention of cervical cancer J Natl Cancer Inst 103(5) 368–83 DOI: 10.1093/jnci/djq562 PMID: 21282563 PMCID: 3046952
4. Wentzensen N (2013) Triage of HPV-positive women in cervical cancer screening Lancet Oncol 14(2) 107–9 DOI: 10.1016/S1470-2045(12)70568-5
5. Bornstein J et al (2012) 2011 terminology of the vulva of the International Federation for Cervical Pathology and Colposcopy J Low Genit Tract Dis 16(3) 290–5 DOI: 10.1097/LGT.0b013e31825934c7 PMID: 22659778
6. Bucchi L et al (2013) Rationale and development of an on-line quality assurance programme for colposcopy in a population-based cervical screening setting in Italy BMC Health Serv Res 13 237 DOI: 10.1186/1472-6963-13-237 PMID: 23809615 PMCID: 3701540
7. Cristiani P et al (2014) An online quality assurance program for colposcopy in a population-based cervical screening setting in Italy: results on colposcopic impression J Low Genit Tract Dis 18(4) 309–13 DOI: 10.1097/LGT.0000000000000017 PMID: 24886869
8. Maggi R et al (1989) Comparison of repeat smear, colposcopy, and colposcopically directed biopsy in the evaluation of the mildly abnormal smear Gynecol Oncol 35(3) 294–6 DOI: 10.1016/0090-8258(89)90066-8 PMID: 2599462
9. Sideri M et al (1995) Operator variability in disease detection and grading by colposcopy in patients with mild dysplastic smears Cancer 76(9) 1601–5 PMID: 8635064
10. Sideri M et al (2004) Interobserver variability of colposcopic interpretations and consistency with final histologic results J Low Genit Tract Dis 8(3) 212–6 DOI: 10.1097/00128360-200407000-00009
11. Saslow D et al (2012) American Cancer Society, American Society for Colposcopy and Cervical Pathology, and American Society for Clinical Pathology screening guidelines for the prevention and early detection of cervical cancer CA Cancer J Clin 62(3) 147–72 DOI: 10.3322/caac.21139 PMID: 22422631 PMCID: 3801360
12. Massad LS et al (2013) 2012 updated consensus guidelines for the management of abnormal cervical cancer screening tests and cancer precursors J Low Genit Tract Dis 17(5 Suppl 1) S1–S27 DOI: 10.1097/LGT.0b013e318287d329 PMID: 23519301
13. Castle PE et al (2011) Performance of carcinogenic human papillomavirus (HPV) testing and HPV16 or HPV18 genotyping for cervical cancer screening of women aged 25 years and older: a subanalysis of the ATHENA study Lancet Oncol 12(9) 880–90 DOI: 10.1016/S1470-2045(11)70188-7 PMID: 21865084
14. Gage JC et al (2014) Reassurance against future risk of precancer and cancer conferred by a negative human papillomavirus test J Natl Cancer Inst 106(8) DOI: 10.1093/jnci/dju153 PMID: 25038467 PMCID: 4111283
15. Sankaranarayanan R et al (2009) HPV screening for cervical cancer in rural India N Engl J Med 360(14) 1385–94 DOI: 10.1056/NEJMoa0808516 PMID: 19339719
16. Mayeaux EJ and Cox JT (2011) Modern Colposcopy Textbook & Atlas Lippincott Williams & Wilkins
17. Massad LS et al (2009) The accuracy of colposcopic grading for detection of high-grade cervical intraepithelial neoplasia J Low Genit Tract Dis 13(3) 137–44 DOI: 10.1097/LGT.0b013e31819308d4 PMID: 19550210 PMCID: 2921444
18. Strander B et al (2005) The performance of a new scoring system for colposcopy in detecting high-grade dysplasia in the uterine cervix Acta Obstet Gynecol Scand 84(10) 1013–7 DOI: 10.1080/j.0001-6349.2005.00895.x PMID: 16167921
19. Bowring J et al (2010) The Swede score: evaluation of a scoring system designed to improve the predictive value of colposcopy J Low Genit Tract Dis 14(4) 301–5 DOI: 10.1097/LGT.0b013e3181d77756 PMID: 20885156
20. Pretorius RG et al (2004) Colposcopically directed biopsy, random cervical biopsy, and endocervical curettage in the diagnosis of cervical intraepithelial neoplasia II or worse Am J Obstet Gynecol 191(2) 430–4 DOI: 10.1016/j.ajog.2004.02.065 PMID: 15343217
21. Pretorius RG et al (2011) Regardless of skill, performing more biopsies increases the sensitivity of colposcopy J Low Genit Tract Dis 15(3) 180–8 DOI: 10.1097/LGT.0b013e3181fb4547 PMID: 21436729
22. Gage JC et al (2006) Number of cervical biopsies and sensitivity of colposcopy Obstet Gynecol 108(2) 264–72 DOI: 10.1097/01.AOG.0000220505.18525.85 PMID: 16880294
23. Wentzensen N et al (2015) Multiple biopsies and detection of cervical cancer precursors at colposcopy J Clin Oncol 33(1) 83–9 DOI: 10.1200/JCO.2014.55.9948
24. Kelly RS et al (2012) Incidence of cervical intraepithelial neoplasia grade 2 or worse in colposcopy-negative/human papillomavirus-positive women with low-grade cytological abnormalities BJOG 119(1) 20–5 DOI: 10.1111/j.1471-0528.2011.02970.x
25. Gage JC et al (2010) Detection of cervical cancer and its precursors by endocervical curettage in 13,115 colposcopically guided biopsy examinations Am J Obstet Gynecol 203(5) 481–9 DOI: 10.1016/j.ajog.2010.06.048 PMID: 20800216 PMCID: 2975767
26. Jeronimo J et al (2007) Interobserver agreement in the evaluation of digitized cervical images Obstet Gynecol 110(4) 833–40 DOI: 10.1097/01.AOG.0000281665.63550.8f PMID: 17906017
27. Alvarez RD and Wright TC (2007) Effective cervical neoplasia detection with a novel optical detection system: a randomized trial Gynecol Oncol 104(2) 281–9 DOI: 10.1016/j.ygyno.2006.08.056
28. Wade R et al (2013) Adjunctive colposcopy technologies for examination of the uterine cervix—DySIS, LuViva Advanced Cervical Scan and Niris Imaging System: a systematic review and economic evaluation Health Technol Assess 17(8) 1–240 DOI: 10.3310/hta17080 PMID: 23449335
29. Pierce MC et al (2012) A pilot study of low-cost, high-resolution microendoscopy as a tool for identifying women with cervical precancer Cancer Prev Res (Phila) 5(11) 1273–9 DOI: 10.1158/1940-6207.CAPR-12-0221
30. World Health Organization (2013) WHO Guidelines for Screening and Treatment of Precancerous Lesions for Cervical Cancer Prevention Geneva World Health Organization