2010 Census

Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results Gao ID: GAO-08-414 April 15, 2008

An accurate decennial census relies on finding and counting people--only once--in the right place and collecting complete and correct information. GAO reviewed the Department of Commerce's U.S. Census Bureau's (Bureau) plans to (1) improve the census's accuracy--or coverage--with its coverage follow-up operation, (2) assess the accuracy of the census through its census coverage measurement program, and (3) evaluate these efforts after 2010. GAO reviewed documentation and interviewed Bureau officials about the Bureau's coverage follow-up and measurement plans. GAO compared plans for measuring coverage in 2000 and 2010 and interviewed 15 experts about changes to the plans.

The Bureau plans to expand its coverage follow-up operation to improve the accuracy of enumeration data collected in 2010. As in Census 2000, coverage follow-up is intended to resolve count discrepancies (when the number of household members reported on a completed questionnaire fails to match the number of persons for whom information is collected) and complete the enumeration of large households (more than six persons living in a housing unit). For 2010, the Bureau researched new techniques to increase the number of ways a person or household could be selected for coverage follow-up. These include (1) using coverage probes on the census form to identify cases with potential under- or overcount problems, (2) using administrative records to identify households that include persons who may not have been counted, and (3) using computer matching to determine potential duplicate persons. However, it is not clear on what basis the Bureau will select or prioritize cases from this potentially expanded universe. The Bureau plans to further assess the techniques during the 2008 Dress Rehearsal, leaving little time to consider and implement improvements for the 2010 coverage follow-up operation. Because of the potential for the coverage follow-up workload to increase, it will be important for the Bureau to be clear on how it will decide which techniques to use and how it will select coverage follow-up cases to ensure that the operation may best improve coverage. Although the Bureau's overall design for measurement is similar in many respects to the design for 2000, its measurement plan differs in important ways. The Bureau is planning to use a sample design similar to that used in 2000 and still intends to produce net coverage error estimates. However, the Bureau does not plan to use measurement results to adjust the 2010 count of the nation's population and is planning to make four significant changes: (1) conduct the measurement interview 4 months later than it did in 2000, (2) use a different method--logistic regression--to estimate net error, (3) use estimates of the components of coverage error in an effort to improve future decennials, and (4) expand computer matching and clerical matching to match persons in its postenumeration survey to the entire census. Still, the Bureau has not specified when it will produce estimates of net coverage error and the components of coverage error for the 2010 Census. The Bureau also has not clearly described how it will associate the components of coverage error to census operations in order to improve future decennials. Because of the Bureau's changes to its coverage measurement program, it is important for the Bureau to solidify its plans and share them with stakeholders to ensure that there is an agreement on what the program will produce and when the Bureau will produce its results. The Bureau is in the early stages of planning how it will evaluate coverage follow-up and measurement after the 2010 Census. As part of this effort, the Bureau is considering recommendations of the National Academy of Sciences for the Bureau's program of evaluations and experiments for Census 2010. The Bureau has not yet developed a timeline for completing its evaluation plans.

Recommendations

Our recommendations from this work are listed below with a Contact for more information. Status will change from "In process" to "Open," "Closed - implemented," or "Closed - not implemented" based on our follow up work.

Director: Team: Phone:


GAO-08-414, 2010 Census: Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results This is the accessible text file for GAO report number GAO-08-414 entitled '2010 Census: Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results' which was released on May 17, 2008. This text file was formatted by the U.S. Government Accountability Office (GAO) to be accessible to users with visual impairments, as part of a longer term project to improve GAO products' accessibility. Every attempt has been made to maintain the structural and data integrity of the original printed product. Accessibility features, such as text descriptions of tables, consecutively numbered footnotes placed at the end of the file, and the text of agency comment letters, are provided but may not exactly duplicate the presentation or format of the printed version. The portable document format (PDF) file is an exact electronic replica of the printed version. We welcome your feedback. Please E-mail your comments regarding the contents or accessibility features of this document to Webmaster@gao.gov. This is a work of the U.S. government and is not subject to copyright protection in the United States. It may be reproduced and distributed in its entirety without further permission from GAO. Because this work may contain copyrighted images or other material, permission from the copyright holder may be necessary if you wish to reproduce this material separately. Report to the Chairman, Committee on Oversight and Government Reform, House of Representatives: United States Government Accountability Office: GAO: April 2008: 2010 Census: Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results: Improving and Measuring Census Coverage: GAO-08-414: GAO Highlights: Highlights of GAO-08-414, a report to the Chairman, Committee on Oversight and Government Reform, House of Representatives. Why GAO Did This Study: An accurate decennial census relies on finding and counting people”only once”in the right place and collecting complete and correct information. GAO reviewed the Department of Commerce‘s U.S. Census Bureau‘s (Bureau) plans to (1) improve the census‘s accuracy”or coverage”with its coverage follow-up operation, (2) assess the accuracy of the census through its census coverage measurement program, and (3) evaluate these efforts after 2010. GAO reviewed documentation and interviewed Bureau officials about the Bureau‘s coverage follow-up and measurement plans. GAO compared plans for measuring coverage in 2000 and 2010 and interviewed 15 experts about changes to the plans. What GAO Found: The Bureau plans to expand its coverage follow-up operation to improve the accuracy of enumeration data collected in 2010. As in Census 2000, coverage follow-up is intended to resolve count discrepancies (when the number of household members reported on a completed questionnaire fails to match the number of persons for whom information is collected) and complete the enumeration of large households (more than six persons living in a housing unit). For 2010, the Bureau researched new techniques to increase the number of ways a person or household could be selected for coverage follow-up. These include (1) using coverage probes on the census form to identify cases with potential under- or overcount problems, (2) using administrative records to identify households that include persons who may not have been counted, and (3) using computer matching to determine potential duplicate persons. However, it is not clear on what basis the Bureau will select or prioritize cases from this potentially expanded universe. The Bureau plans to further assess the techniques during the 2008 Dress Rehearsal, leaving little time to consider and implement improvements for the 2010 coverage follow-up operation. Because of the potential for the coverage follow-up workload to increase, it will be important for the Bureau to be clear on how it will decide which techniques to use and how it will select coverage follow-up cases to ensure that the operation may best improve coverage. Although the Bureau‘s overall design for measurement is similar in many respects to the design for 2000, its measurement plan differs in important ways. The Bureau is planning to use a sample design similar to that used in 2000 and still intends to produce net coverage error estimates. However, the Bureau does not plan to use measurement results to adjust the 2010 count of the nation‘s population and is planning to make four significant changes: (1) conduct the measurement interview 4 months later than it did in 2000, (2) use a different method”logistic regression”to estimate net error, (3) use estimates of the components of coverage error in an effort to improve future decennials, and (4) expand computer matching and clerical matching to match persons in its postenumeration survey to the entire census. Still, the Bureau has not specified when it will produce estimates of net coverage error and the components of coverage error for the 2010 Census. The Bureau also has not clearly described how it will associate the components of coverage error to census operations in order to improve future decennials. Because of the Bureau‘s changes to its coverage measurement program, it is important for the Bureau to solidify its plans and share them with stakeholders to ensure that there is an agreement on what the program will produce and when the Bureau will produce its results. The Bureau is in the early stages of planning how it will evaluate coverage follow-up and measurement after the 2010 Census. As part of this effort, the Bureau is considering recommendations of the National Academy of Sciences for the Bureau‘s program of evaluations and experiments for Census 2010. The Bureau has not yet developed a timeline for completing its evaluation plans. What GAO Recommends: GAO recommends that the Secretary of Commerce direct the Bureau to specify (1) criteria it will use to assess techniques for identifying coverage follow-up cases; (2) when it will provide coverage measurement estimates and how it will relate components of coverage error to specific census operations to improve future decennials; and (3) key decision points and plans for evaluating aspects of the 2010 coverage follow-up and measurement efforts. In commenting on a draft of this report, Commerce had no substantive disagreements with our recommendations and will develop action plans for each one; and provided technical comments that we incorporated as appropriate. To view the full product, including the scope and methodology, click on [hyperlink, http://www.gao.gov/cgi-bin/getrpt?GAO-08-414]. For more information, contact Mathew J. Scirč at (202) 512-6806 or sciremj@gao.gov. [End of section] Contents: Letter: Results in Brief: Background: Bureau Plans to Expand Coverage Follow-up to Improve Census 2010 but Has Not Finalized Plans for Identifying Follow-up Cases: Bureau Has Made Changes to Coverage Measurement for 2010 Intended to Benefit the 2020 Census, but Key Aspects of Program Plan Remain Unclear: Bureau Plans for Evaluating the 2010 Coverage Follow-up Operation and Census Coverage Measurement Program Are in Early Stages: Conclusions: Recommendations for Executive Action: Agency Comments and Our Evaluation: Appendix I: Objectives, Scope, and Methodology: Appendix II: Comments from the Department of Commerce: Appendix III: GAO Contact and Staff Acknowledgments: Figures: Figure 1: Decennial Census Population Net Undercount Rates from the Bureau's Demographic Analysis in Percentages, 1940-2000: Figure 2: Operation Flowchart for the Bureau's 2008 Dress Rehearsal Coverage Follow-up Operation: Figure 3: Undercount and Overcount Probes on Census Questionnaire for the 2008 Dress Rehearsal: Figure 4: Timeline for Coverage Follow-up Interviews and Census Coverage Measurement Interviews Using Actual Dates for 2000 and Estimated Dates for 2010: United States Government Accountability Office: Washington, DC 20548: April 15, 2008: The Honorable Henry A. Waxman: Chairman: Committee on Oversight and Government Reform: House of Representatives: Dear Mr. Chairman: An accurate decennial census relies on finding and counting people-- only once--in the right place and collecting complete and correct information. The Department of Commerce's U.S. Census Bureau (Bureau) faces the daunting challenge of cost effectively counting a population that is growing steadily larger, more diverse, and according to the Bureau, increasingly difficult to find and reluctant to participate in the census. These challenges are made even more difficult by the Bureau's April 3, 2008 decision to redesign certain aspects of the decennial census. Two basic errors can affect the count of the population: omission of persons who should have been counted and erroneous enumerations of persons who should not have been counted, such as persons counted more than once or persons counted in the wrong geographic place. To ensure accuracy, the Bureau, among other operations, will conduct a coverage follow-up operation in 2010 to determine if additional persons might have been missed (undercounted) from already enumerated households, or if persons might have been counted in error (overcounted). Further, apart from the enumeration process, the Bureau will conduct a census coverage measurement program to assess the accuracy of the 2010 Census. The census is a critical national effort mandated by the Constitution. Census data are used to apportion seats in the Congress, redraw congressional districts, allocate billions of dollars in federal assistance to state and local governments, and for numerous other public and private sector purposes. The Bureau estimates that it has generally undercounted the total population in prior censuses. For the first time, in 2000, although estimates showed that it undercounted certain population groups, the Bureau estimates that it somewhat overcounted the total population. For 2010, the Bureau plans to improve the accuracy of its population count by, among other things, expanding its coverage follow-up operation. Further, the Bureau and others questioned the accuracy of the data obtained from the Bureau's coverage measurement program in 2000. As a result, for 2010, the Bureau plans to revise its methodology for determining coverage error--the extent to which persons are under-or overcounted in the enumeration process. As agreed, we determined (1) how the Bureau plans to improve coverage through its coverage follow-up operation; (2) how the Bureau's census coverage measurement plans for assessing the accuracy of the 2010 Census compare with efforts in 2000 and what the potential impact will be of those changes, including the Bureau's plans for using its measurement program results to adjust population counts; and (3) how the Bureau plans to evaluate the coverage follow-up and measurement efforts after the 2010 Census. To meet these objectives, we reviewed information about the Bureau's coverage follow-up and coverage measurement plans and interviewed Bureau officials. We reviewed relevant evaluations from Census 2000 and studies conducted by the Bureau and others, including the National Academy of Sciences. Further, we interviewed 15 experts about the Bureau's coverage measurement methodology and plans. We identified these experts based on published reports as well as recommendations from the Bureau and the National Academy of Sciences. (For a list of experts we interviewed, see app. I.) We conducted this performance audit from January 2007 to March 2008 in accordance with generally accepted government auditing standards. Those standards require that we plan and perform the audit to obtain sufficient, appropriate evidence to provide a reasonable basis for our findings and conclusions based on our audit objectives. We believe that the evidence obtained provides a reasonable basis for our findings and conclusions based on our audit objectives. Results in Brief: The Bureau plans to improve coverage by expanding the techniques it uses to identify and select cases for follow-up. However, it is not clear on what basis the Bureau will decide which techniques to use to identify potential cases, and which of these cases it will select or prioritize for coverage follow-up. As in Census 2000, coverage follow- up is intended to resolve count discrepancies (when the number of household members reported on a completed questionnaire fails to match the number of persons for whom information is collected) and complete the enumeration of large households (more than six persons living in a housing unit). The Bureau plans to include all such count discrepancy cases and all large households in its coverage follow-up universe in 2010. To improve coverage for 2010, the Bureau researched new ways to identify a more comprehensive set of cases for follow-up. Specifically, to increase the number of ways a person or household could be selected for follow-up, the Bureau tested three new coverage follow-up techniques: (1) using coverage probes on the census form mailed to households and during the nonresponse follow-up interviews to identify instances where persons may be under-or overcounted; (2) using administrative records to identify households that include persons who may not have been counted; and (3) using computer matching to determine potential duplicate persons. The Bureau is uncertain how many cases for follow-up are likely to be found based on these new techniques. Nonetheless, the number of follow-up cases could be very large and very costly to resolve. Based on its experience with the 2008 Dress Rehearsal, the Bureau expects to decide what techniques it will use for identifying and selecting cases during the 2010 Census coverage follow- up operation and how many cases it can afford to select for follow-up. Bureau officials explained that they intend to consider cost and the percentage of cases resulting in a change relative to the cost--known as yield--but the Bureau has not fully specified how it will make decisions about expanding the coverage follow-up universe or how it will select or prioritize cases from those identified by the new techniques. Using results from the 2006 Census Test, the Bureau has already decided that coverage follow-up will only be conducted by telephone rather than personal visits because the yield was higher for telephone follow-up. This decision, however, does not necessarily reflect trade-offs in the limitations of each technique used to select cases for follow-up. For example, coverage may not be improved for groups for which the Bureau does not have telephone numbers--the very wealthy or the very poor. Given that the Bureau expects to use the dress rehearsal to learn more about the expanded selection techniques, and that the dress rehearsal has been delayed by a month, the Bureau will have little time to complete its final plans for coverage follow- up for the 2010 Census. Because of the potential for the coverage follow-up workload to expand, it will be important for the Bureau to be clear on how it will decide which techniques to use and cases to select to ensure that coverage follow-up may best improve coverage. Although the Bureau's plan for coverage measurement in 2010 is similar in many ways to its measurement design for 2000 and previous coverage evaluation programs, its plan differs in important ways, including its overall objectives, timing, statistical techniques used, and use of computer matching. Nonetheless, much of what the Bureau will do is yet to be decided, including most notably, when the Bureau will produce estimates of net coverage error and components of coverage error, and how it will meet one of its objectives for coverage measurement in 2010--improving future census operations. One important aspect of the Bureau's plans for 2010--specification of the logistic regression model--cannot be known until after the Bureau collects the 2010 coverage measurement data. Similarities between 2000 and 2010 include the Bureau's sample design for coverage measurement[Footnote 1] and that the Bureau once again intends to use a dual systems approach to produce net coverage error estimates--or the difference between the coverage measurement program's estimate of the total population and the decennial count of the total population.[Footnote 2] However, several of the Bureau's objectives for the 2010 coverage measurement program differ. For 2010, the Bureau places greater focus on improving future decennials. In contrast, for 2000, the Bureau designed coverage measurement to consider adjusting the count of the nation's population. The Bureau does not plan to use coverage measurement results to adjust the 2010 count of the nation's population because it believes that with a sufficiently correct count, adjustment would introduce as much or more error than it was designed to correct. The Bureau's measurement program in 2010 differs in other significant ways from that in 2000, for example, (1) starting the coverage measurement interview about 4 months later, to permit expansion of the Bureau's efforts to conduct the coverage follow-up operation;[Footnote 3] (2) estimating net coverage error using logistic regression, which is a standard statistical modeling technique; (3) estimating the components of net coverage error to better understand the details of erroneous enumerations and omissions; and (4) expanding computer matching and clerical matching to match persons in the coverage measurement program's postenumeration survey to the entire census to resolve or identify duplicates.[Footnote 4] Bureau officials said that the coverage measurement interview will begin later because they expect to expand the coverage follow-up operation, and that to maintain independence and minimize possible contamination between enumeration and measurement, the Bureau did not want to overlap coverage follow-up and coverage measurement. Experts we interviewed were generally concerned about the contamination issue but also viewed the time delay as a very significant challenge because respondent recall[Footnote 5] is vital to conducting the measurement program. The Bureau has simulated the use of its new method of estimating net error--logistic regression--by using data from 2000 and found it to work. The Bureau will be unable to fully specify the model it will ultimately use until after the data collection phase of its coverage measurement program. One aspect of its plans that the Bureau should be able to specify now is when it will produce net coverage error estimates and components of coverage error estimates for 2010. Further, though the Bureau has preliminary plans--using data table shells--to associate components of coverage error to census operations, the Bureau has not clearly explained how it plans to meet its objective of improving future decennials. For the most part, experts we interviewed, after they reviewed our summary of the Bureau's coverage measurement plans, stated that the Bureau had improved its research model and had taken significant and constructive steps in attempting to address key issues for coverage error, but several experts noted that more detailed planning is needed. Because of the Bureau's changes to its coverage measurement program, it is important for the Bureau to make its plans clear and share them with stakeholders to ensure that there is an agreement on what the program will produce and when the Bureau will produce its measurement results. The Bureau's plans for evaluating its coverage follow-up operation and census coverage measurement program for the 2010 Census are in early stages of development. In terms of evaluation, the Bureau suggested research on various aspects of coverage follow-up and measurement and contracted with the National Academy of Sciences to recommend proposed research for the 2010 Census. The academy's final report is due by September 2009, but an interim report was issued December 2007. The interim report contained recommendations for the Bureau's program of evaluations and experiments for the 2010 Census. Bureau officials have no timeline yet for making decisions about the evaluations of the Bureau's coverage follow-up operation or census coverage measurement program following the 2010 Census, and they noted that to determine which recommendations to implement, the Bureau will consider costs and staffing needs. We are recommending that the Secretary of Commerce direct the Bureau to provide Congress and other census stakeholders with more specific plans on conducting coverage follow-up and census coverage measurement and evaluating the results of those programs for the 2010 Census. To improve program operations, we recommend that the Bureau specify the criteria it will use to assess the techniques for identifying cases for coverage follow-up in the 2008 Dress Rehearsal and the timeline it plans to follow that will ensure that the results of the assessment may be used in planning for 2010. To enhance transparency and oversight for improving coverage measurement for the 2010 Census, we recommend that the Bureau describe when it will provide estimates of net coverage error and components of coverage error and how it plans to relate components of coverage error to census operations in order to improve future decennials. Finally, we recommend that the Bureau provide key decision points and plans for evaluating aspects of the 2010 Census coverage follow-up operation and census coverage measurement program. The Secretary of Commerce provided written comments on a draft of this report. The comments are reprinted in their entirety in appendix II and we have incorporated in the report, as appropriate, the Department of Commerce's technical comments. Commerce had no substantive disagreements with our recommendations and stated that it will develop formal action plans for each recommendation. While Commerce believes a timeline for producing estimates of net coverage error and components of coverage error is certainly a reasonable expectation, Commerce noted that it is unclear as to how it can at this time provide any specifics on how coverage results might improve future censuses before the coverage data are obtained and analyzed. We recognize that the Bureau has developed table shells that associate components of coverage error to census operations, but these table shells and plans for how the Bureau will conduct its analyses have not been shared with the Congress and other stakeholders. In addition, it is not clear from current Bureau plans how it will go about obtaining the data required to complete the tables. Background: Historically, the census has been affected by undercounts--those who were missed in the decennial count. Further, certain groups may be undercounted more than others--such as babies, minorities, and renters who move often. Seeking to obtain an accurate count has been a concern since the first census in 1790. Concern about undercounting the population continued through the decades. In the 1940s, demographers began to obtain a more thorough understanding of the scope and nature of the undercount. For example, the selective service registration of October 1940 showed 2.8 percent more men than the census count. According to the Bureau, operations and programs designed to improve coverage have resulted in the total undercount declining in all but one decade since the 1940s. These measures of coverage are based on demographic analysis, which compares the census count to birth and death certificates and other administrative data (see fig. 1). Figure 1: Decennial Census Population Net Undercount Rates from the Bureau's Demographic Analysis in Percentages, 1940-2000: This figure is a versicle bar graph showing the decennial census population net undercount rates from the bureau's demographic analysis in percentages, 1940-2000. The X axis is the year, and the Y axis is the percentage. Year: 1940; Percentage: 5.4%. Year: 1950; Percentage: 4.1%. Year: 1960; Percentage: 3.1%. Year: 1970; Percentage: 2.7%. Year: 1980; Percentage: 1.2%. Year: 1990; Percentage: 1.6%. Year: 2000; Percentage: 0.1%. [See PDF for image] Source: U.S. Census Bureau. [End of figure] In contrast to the long-standing use of demographic analysis to estimate census undercount, modern coverage measurement began with the 1980 Census, when the Bureau compared decennial figures to the results of an independent sample survey of the population. In using statistical methods such as these, the Bureau began to generate detailed measures of the differences among undercounts of particular ethnic, racial, and other groups. In 1990, the Bureau relied on its Post-Enumeration Survey to verify the data it collected through the 1990 Census. For this effort, the Bureau interviewed a sample of households several months after the 1990 Census, and compared the results to census questionnaires to determine if each sampled person was correctly counted, missed, or double counted in the census. Using the definition of those missed minus those double counted, the Bureau estimated whether there was a net undercount or net overcount. For the 1990 Census, the Bureau estimated a net undercount of about 4 million people. During Census 2000, for the first time, the Bureau's coverage measurement program measured a net population overcount. To estimate the accuracy of the 2000 Census, the Bureau conducted its Accuracy and Coverage Evaluation, expecting it could be used to adjust the Census 2000 results for all nonapportionment purposes if it improved the census data. The original March 2001 estimates of the Accuracy and Coverage Evaluation, which was an independent sample survey designed to estimate the number of people who were under-and overcounted in the census, a problem that the Bureau refers to as coverage error, became available in time to correct census files. However, in a revised March 2003 evaluation, the Bureau was unable to conclude that the adjusted data were more accurate and determined that the 2000 Census tabulations would not be adjusted for any purpose. The decision not to adjust was consistent with decisions made during the 1990 decennial, when the Department of Commerce decided, because of other problems, not to use the postenumeration survey to adjust the decennial count. During this decade, the Bureau has conducted a testing and evaluation program to assess certain activities that it believes can improve the accuracy of the census. In preparing for the 2010 Census, the Bureau has conducted two field tests--in 2004, the Bureau tested in the Queens Borough of New York City and southwest Georgia and, in 2006, tested at the Cheyenne River Reservation in South Dakota and parts of Austin, Texas, in Travis County. In 2003 and 2005, the Bureau also conducted national surveys to test various aspects of changes planned for 2010. We reported earlier about the importance of these earlier assessments leading to a design that would be sufficiently mature so that the dress rehearsal for the 2010 Census demonstrates the feasibility of the various operations and technologies planned for the decennial under conditions that are as close as possible to the actual census.[Footnote 6] The Bureau is currently conducting its dress rehearsal, one of the final planning activities for the decennial. Census Day for the 2008 Dress Rehearsal activities in sites located in North Carolina and California will be May 1, 2008, a delay of 1 month from the Bureau's initial plan. The first 2010 Census operation, building an accurate address list, overlaps with dress rehearsal activities and has also already begun.[Footnote 7] For the 2010 Census, the majority of the housing units in the country will receive paper census questionnaires delivered either by mail or by census field workers before April 1, 2010. Those households that do not return their questionnaires will be contacted by census field workers during the nonresponse follow-up operation to determine the number of people living in the housing unit on Census Day and other information. The Bureau also uses other techniques for counting persons. For example, people in group quarters (such as college dormitories, homeless shelters, and nursing homes) will be enumerated in separate operations at the facilities by enumerators who will then leave forms listing the names of the people living or staying there and then return to pick up the completed forms and, if necessary, conduct follow-up interviews. For the 2010 Census, the Bureau has tested and planned revisions to the census form with the goal of improving the census' coverage. The revisions are intended to improve methods for ensuring that persons are counted in the correct place and only once.[Footnote 8] These changes include revising residency rules, simplifying the instructions provided to respondents on residency, and including questions--called coverage probes--about persons who may be missing or incorrectly included in the response. For Census 2000, the Bureau relied on 31 residence rules to instruct respondents on who to include on the mail-back form. The Bureau revised these 31 rules to 1 rule with 13 broad living situations for 2010. For 2010, the Bureau is also planning on using two coverage probes, one asking about potentially missed people (undercounted) and one asking about potentially duplicated people (overcounted), on the census form. Although the Bureau used coverage probes in the nonresponse follow-up operation for 2000, it did not include probes in the 2000 Census mailed questionnaires as is now planned for 2010. The Bureau intends to use answers to these coverage probes and other techniques to determine if an enumeration is likely to be incorrect or incomplete and, therefore, if cases should be selected for follow-up. The Bureau uses the results of the coverage follow-up operation to correct the results of the initial enumeration. Coverage follow-up interviews with respondents determine if changes should be made to their household rosters as reported on their initial census returns. Specifically, these results lead to a correction of census information and counts, as coverage follow-up results are combined with the initial enumeration and are used as the official enumeration of the household. Bureau Plans to Expand Coverage Follow-up to Improve Census 2010 but Has Not Finalized Plans for Identifying Follow-up Cases: For Census 2010, the Bureau is planning to improve coverage by expanding its coverage follow-up operation to obtain more accurate enumeration data. To prepare for this, the Bureau has conducted tests of an expanded follow-up operation. As in Census 2000, coverage follow- up is intended to resolve count discrepancies and complete the enumeration of large households. For 2010, the Bureau also plans to use additional techniques to increase the number of ways a person or household could be selected for follow-up. However, the Bureau has not finalized plans for how to identify and select cases from the expanded follow-up universe. The expansion of selection criteria for coverage follow-up has the potential to dramatically increase the workload for this operation and, as a result, increase costs. While the Bureau plans to budget for coverage follow-up in order to achieve the best results or highest yield of changes to household rosters, it is unclear on what basis the Bureau will assess the additional techniques for identifying potential cases or the basis for selecting cases for follow-up using the expanded techniques. The Bureau plans to assess its experiences with coverage follow-up, including important interfaces within the Bureau and between the Bureau and its contractor, during the 2008 Dress Rehearsal in making further decisions on follow-up for 2010. For example, three separate groups within the Bureau have responsibility for selecting follow-up cases, but one of those groups eliminates overlap in cases before sending them to the contractor for telephone follow-up. Recent decisions to delay the dress rehearsal, including follow-up activities, increase the risk that the results of additional testing of these techniques in 2008 will not be sufficiently timely or complete to improve operations for 2010, and the Bureau has not stated when it will complete its assessment of coverage follow-up for the 2008 Dress Rehearsal. Bureau Plans to Expand Techniques for Identifying Coverage Follow-up Cases: The Bureau is planning to improve coverage by expanding its coverage follow-up operation. To meet this objective, the Bureau expects to rely on previously used techniques as well as techniques it is now developing to identify cases for follow-up. As in Census 2000, coverage follow-up for 2010 will include: * resolving count discrepancies, when the number of household members reported on a completed questionnaire fails to match the number of persons for whom information is collected, and: * completing the enumeration of large households, in which there are more than six persons living in a housing unit. For 2010, the Bureau also plans to use additional techniques for selecting a person or household for follow-up. The techniques planned include: * using questions--called coverage probes--on the census form itself and in the nonresponse follow-up interviews to identify where persons may be omitted or counted more than once, * using administrative records to identify persons who may be omitted, and: * using computer matching from the universe of census returns to identify persons who may have been counted more than once. Although the Bureau has not finalized coverage follow-up plans for the 2010 Census, in June 2007, the Bureau developed a work flow and narrative for the 2008 Dress Rehearsal coverage follow-up operation. This work flow shows the process for collecting enumeration data, selecting coverage follow-up cases, and conducting follow-up (see fig. 2). Once the Bureau selects a case for follow-up, it plans to work through its Decennial Response Integration System (DRIS) contractor, which will conduct coverage follow-up interviews by telephone. The telephone interviews determine if changes should be made to the household rosters as reported on the initial census returns. The questions in the follow-up interview are designed to identify if persons were omitted or counted in error because they should have been counted at a different address or not at all--for example, because a person had been born after or died before Census Day. Figure 2: Operation Flowchart for the Bureau's 2008 Dress Rehearsal Coverage Follow-up Operation: This figure is a an operation flowchart showing the bureau's 2008 dress rehearsal coverage follow-up operation. [See PDF for image] Source: GAO analysis of U.S. Census Bureau data. [End of figure] As shown in figure 2, enumeration data will be captured by DRIS. Also, in selecting cases for follow-up, the Bureau will identify any overlap in cases represented by large households, count discrepancies, coverage probes, administrative records review, and computer matching. While the Bureau has tested new techniques for coverage follow-up, it has not yet tested these steps with the new processing interfaces within the Bureau--Decennial Systems and Processing Office, Decennial Statistical Studies Division, and Data Integration Division--and between the Bureau and DRIS. During the 2008 Dress Rehearsal, from May 27 to August 9, 2008, the Bureau will not only be assessing various aspects of the expanded follow-up, but it will also be testing for the first time the DRIS integration of data as well as the Bureau's process for identifying overlap cases. After the Bureau selects cases for follow- up, the DRIS contractor assigns an identification number to each case and oversees telephone interviews. Each case may receive up to seven follow-up telephone calls, and the DRIS contract includes such aspects as on-demand automated call distribution, call routing, reporting, and quality assurance monitoring. Resolving Count Discrepancies: For 2010, as in Census 2000, all count discrepancy cases identified on mailed paper census forms will be part of the coverage follow-up operation. The Bureau will check completed questionnaires for discrepancies between the number of persons reported as members of a household and the number of persons for whom census information was provided on the census forms. For example, questionnaires that fail edit checks might include cases where the respondent may indicate that five people were living in the housing unit on Census Day, but there is information for only two people. Enumerating Large Households: As in 2000, the Bureau is also aiming to improve coverage of people in housing units by completing the enumeration of large households, meaning those that indicate on their census forms that there are more than six persons in the housing unit and for which there was not enough room on the mailed questionnaire form for self-response. The questionnaire has space for only six household members to be fully listed. The nonresponse follow-up interview is not limited in the number of people that can be reported and all household members can be enumerated during nonresponse follow-up; therefore, no large household cases will need to be enumerated during the coverage follow-up operation as a result of the nonresponse follow-up interview. Using Coverage Probes to Identify Potential Under-or Overcounting: The Bureau plans to expand its universe for coverage follow-up in 2010 by including cases resulting from the use of coverage probes. For the dress rehearsal, the Bureau plans to contact anyone who affirmatively answers the undercount question (which is at the household level) and the overcount question (which is at the person level) (see fig. 3). If either or both coverage probes prove unproductive in the dress rehearsal, Bureau officials indicated that they might be eliminated from the follow-up operation. The coverage probes are included on the mail-back questionnaire and among the questions asked during the nonresponse follow-up. During nonresponse follow-up, for example, if a person responds in the affirmative to the coverage undercount probe, the interviewer collects the name and demographic information for the added person. All names of individuals added to the household roster[Footnote 9] through the undercount probe during nonresponse follow-up will be sent to the coverage follow-up operation for additional follow-up. Figure 3: Undercount and Overcount Probes on Census Questionnaire for the 2008 Dress Rehearsal: This figure is a copy of undercount and overcount probes on census questionnaire for the 2008 dress rehearsal. [See PDF for image] Source: GAO presentation of U.S. Census Bureau information. [End of figure] As shown in figure 3, for the 2008 Dress Rehearsal, the undercount probe asks if there are any additional people staying at the residence on Census Day that were not included earlier--children, such as newborn babies or foster children; relatives, such as adult children, cousins, or in-laws; nonrelatives, such as roommates or live-in baby sitters; or people staying at the residence temporarily. The overcount probe for the 2008 Dress Rehearsal asks if persons sometimes lived or stayed elsewhere--such as while attending college, while in the military, while staying at a seasonal or second residence, during a child custody arrangement, while in jail or prison, while in a nursing home, or for another reason. The overcount probe is asked of the first six household members on the mail-back questionnaire and of all persons in the nonresponse follow-up interview. Evaluations of the coverage follow-up operation for the Bureau's 2004, 2005, and 2006 tests examined the performance of both the undercount and overcount probes. In the Bureau's 2004 test, the overcount probe was found to be more effective in finding households with erroneous enumerations than the undercount probe was in finding persons omitted from the count. In this test several problems were identified with the undercount probe, most notably a low rate of added residents to the household roster. Results from the test showed that of the 3,919 respondents at the Georgia test site who answered the undercount probe affirmatively, only 53 persons were added to the household roster. The Bureau noted that the low rate of added persons was "perplexing." Other problems included difficulties with the interviewer reading the probe as worded and indications that use of the probe may have actually increased the number of count discrepancy cases. In 2005, the Bureau tested two different versions of both the undercount and overcount probes, and changes to both probes were adopted for the 2006 Census Test. During the 2006 test, the Bureau found that the overcount probe "performed as expected" and generated a higher rate of deleted persons from the census test roster for both the mail-back questionnaire and nonresponse follow-up operation than other new techniques tested to identify coverage follow-up cases. Although the Bureau designed the undercount probe to add persons, the undercount probe resulted in more deleted persons than added persons.[Footnote 10] The Bureau noted that over half of the deletions stemmed from cases in which the respondent either answered "yes" to both coverage probes or the respondent answered "yes" to the undercount probe and had a count discrepancy. Indeed, in a separate analysis that examined the rate of overlap among coverage follow-up cases--meaning households were flagged for coverage follow-up through more than one technique--the Bureau found that the highest significant rate of overlap was among cases where the respondent answered "yes" to both coverage probes. Additionally, there were some operational difficulties using the undercount probe during the nonresponse follow-up operation in the 2006 Census Test.[Footnote 11] For example, the Bureau found that although persons answered the undercount probe affirmatively (1,234 households) during nonresponse follow-up, very few persons (8) were added to the coverage follow-up roster.[Footnote 12] The Bureau referred to this number as "surprisingly low," because if a household replies "yes" to the undercount probe, the Bureau expected to see a person added to the roster for coverage follow-up. In its evaluation of coverage follow-up during the 2006 Census Test, the Bureau speculated that there could have been operational error during nonresponse follow-up that led to this discrepancy. Furthermore, of those 8 persons who were added and appeared in coverage follow-up, only 3 could be confirmed as residents during follow-up, leading the Bureau to conclude that the undercount question added persons incorrectly more than half of the time after nonresponse follow-up. Although the Bureau has made some slight modifications to the response options for the overcount probe it plans to use in the dress rehearsal, it plans to use the same undercount probe for the dress rehearsal as it did for the 2006 test. Using Administrative Records to Identify Potentially Omitted Persons: For the dress rehearsal, the Bureau plans one wave of matching with administrative records to identify duplicate or suspected incorrect data. For administrative matching, the Bureau will use the Enhanced Statistical Administrative Records System, which is a Bureau-wide resource that collects certain data from the Internal Revenue Service, the Social Security Administration,[Footnote 13] the Selective Service, the Department of Housing and Urban Development, as well as other federal agencies. The Bureau will match census returns against records from these sources using various algorithms to associate persons with housing units to identify households with potential undercounting.[Footnote 14] The Bureau plans to match names on census questionnaires with names in the administrative database--if a household is found to have more persons in the administrative records than on the census form, it would be a candidate for coverage follow- up. The Bureau first tested selecting cases for follow-up through comparison with administrative records in 2005. For the 2006 Census Test, all of the cases with matches from the administrative records search had personal visits; however, Bureau officials noted that personal visits were conducted because staff had sufficient time, given the smaller geographic area of a test site. Personal visits will not be part of the national 2010 coverage follow-up operation. The 2008 Dress Rehearsal marks the first time that cases selected through administrative record matching will be tested in the coverage follow-up telephone operation. Using Computer Matching to Identify Persons Who May Have Been Duplicated: For the dress rehearsal, the Bureau will conduct several waves of computer matching for its unduplication process. In 2000, the Bureau had difficulty with duplicate enumerations and did not have a planned operation for dealing with the problem; officials explained that in 2000 there was an "ad hoc" plan that they developed "on the fly" for identifying duplicates. However, Bureau officials noted that one of the lessons learned from 2000 was that the Bureau needed to expand the search for duplicates by computer matching. Thus, to improve the 2010 Census, along with the overcount probe specifically designed to try and find duplicates, the Bureau will also match census returns against the universe of all other census returns to identify duplicates nationwide; for the dress rehearsal, the matching will be conducted within the test site, rather than on a national scale. While the Bureau has been evaluating and testing this procedure, the limited likelihood of duplication of individuals that will occur within census test sites of limited geographic size, as compared to the actual national decennial, limits the Bureau's ability to forecast how the nationwide matching and unduplication system will affect workload for coverage follow- up.[Footnote 15] Although the Bureau has not finalized its coverage follow-up techniques, it plans to apply a computer-based record linkage system to the data collection files during the 2010 Census to match files against themselves to find potential duplicates based on housing unit and person information. Those households would then be selected for follow- up. For example, erroneous enumerations are created when duplicated people are counted correctly in one housing unit and incorrectly in another housing unit. Duplicated housing units, which occur when multiple addresses of the same housing unit are identified during census operations, are removed. Further, in contrast to Census 2000, the addresses that could be included in the census in both the housing unit and group quarter enumerations can no longer cause duplication because, according to the Department of Commerce, the Bureau now has one list for both housing units and group quarters. Consequently, according to Bureau officials, the Bureau faces the challenge of identifying the types of duplicates that can be most easily identified and addressed, which will result in the most cost-effective yield. The Bureau's 2006 test of computer matching found that potential duplicates generally were not deleted. Specifically, in its evaluation of computer matching during the 2006 Census Test, the Bureau found that 93 percent of respondents marked as possible duplicates of persons counted outside the census block were neither deleted nor linked to a deleted match. The Bureau considered this result disappointing. It noted that the coverage follow-up interviews did not resolve the cases very well because follow-up interviews were not completed for 24 percent of the cases and, in 61 percent of the cases, respondents did not confirm during the follow-up interview that they lived or stayed somewhere else. The interviewer cannot provide information about the suspected duplicate during the follow-up interview because of concerns related to confidentiality; the interview is limited in its potential for success in that the respondent has to volunteer information about the suspected duplicate case. Consequently, the Bureau is not only challenged in identifying suspected duplicate cases, but in actually resolving the cases as well. Bureau officials explain that for the unduplication process, the Bureau will assign a match score and identify links among suspected duplicates. The Bureau plans to develop a match score threshold that will vary by level of geographic distance between matches. The Bureau will then assess the quality of the match based on the match score and conduct further research on the cases exceeding this threshold. For example, once the Bureau identifies potential duplicates, it determines whether the potential duplication is a potential housing-level problem (duplicated persons identified within a census block) or person-level problem (more long distance-identified duplicates nationally). If it is a housing-level problem, then the Bureau would consider using personal visits by field staff from another of its operations, the field verification operation, to verify housing unit additions. Further, according to the Department of Commerce, field verification may be undertaken to determine if housing units should be deleted from the list. If it is a person-level problem, the Bureau would consider a coverage follow-up operation telephone interview to determine if the person is counted twice. Bureau Will Rely on Assessment of Dress Rehearsal to Complete Coverage Follow-up Plans for the 2010 Census: The Bureau is developing a number of techniques for selecting cases for follow-up and plans to expand the coverage follow-up operation overall. However, it has not determined which techniques it will use, what thresholds it will use in selecting cases for follow-up, or the resources it expects to devote to the coverage follow-up operation. The most pressing question remaining is how the Bureau will prioritize its expanded universe of follow-up cases identified through coverage probes, administrative record matching, and the unduplication process. Currently, households that display count discrepancies, are large, reply in the affirmative to coverage probes, are flagged through comparisons with administrative records, or are flagged as potential duplicates are all eligible for follow-up. Although there may be a large number of eligible cases for coverage follow-up, the Bureau is unsure how many cases it can financially afford for follow-up, and discussions are still under way within the Bureau about what units will be sent to the coverage follow-up operation. The Bureau is reviewing data from tests--such as the add and delete rates within each type of case--to aid in determining what types of techniques will be most productive in terms of identifying overcounted and missed persons. The Bureau plans to rely on additional data from its assessment of the coverage follow-up operation in the 2008 Dress Rehearsal in making decisions about which techniques, including coverage probes, administrative record matching, and the unduplication process, it plans to use for identifying and selecting cases for coverage follow-up in 2010. Dress rehearsal plans for coverage follow-up should mirror the plans intended for the decennial because the 2008 Dress Rehearsal will be the Bureau's last opportunity to assess the various procedures and systems for 2010 under as close to census-like conditions as possible. However, the expanded coverage follow-up operation conducted during the dress rehearsal will not necessarily be what the Bureau implements in 2010 because the Bureau plans to use the dress rehearsal to assess key aspects of a likely coverage follow-up operation, including (1) the potential coverage follow-up workload and how it may be distributed, (2) the extent to which the coverage follow-up operation improves census coverage through such factors as the percentage of added residents and deleted residents, and (3) problems administering the follow-up interview and managing work. The assessment will also address problems with the operational work flow and interfaces and provide recommendations to fix or improve these problems. While the Bureau's expectation is that it will have the necessary data from the dress rehearsal to inform the final plans for the 2010 coverage follow-up operation, Bureau officials have noted that the delayed dress rehearsal (a delay in the Census Day from April 1 to May 1, 2008) may make it more difficult to incorporate the findings from the dress rehearsal into the Bureau's plans for the 2010 Census. Bureau officials have stated in several forums that resource constraints are a factor in the final decision on the scope of follow- up in 2010. The expansion of selection criteria has the potential to dramatically increase the workload for coverage follow-up in 2010 and, as a result, increase costs. Bureau officials have cautioned that it is not necessarily cost-effective for the Bureau to follow up on the entire universe eligible to be included in the coverage follow-up operation. Thus, the Bureau will be using results--such as the percentage of cases resulting in a change relative to the cost or yield--from the dress rehearsal to decide which techniques to use and how many cases it can afford to include for follow-up. Evaluations of the impact of the criteria the Bureau will use in judging what techniques to use for identifying cases for follow-up or the thresholds for selecting specific cases for follow-up will be critical to determining the final plan for the coverage follow-up operation. The Bureau has made one judgment about how it will proceed for 2010 as a result of its 2006 test. Specifically, in its evaluation of the 2006 Census Test, the Bureau calculated that the cost was $155 per roster change for coverage follow-up cases done by telephone versus $766 per roster change for cases done by personal visits. Consequently, the Bureau concluded that it should conduct follow-up interviews by telephone. This decision, based partly on the percentage of cases with a change and partly on cost, represents two criteria. Effective program management, including assessment of risks,[Footnote 16] requires that agencies identify information that is critical to achieving important program objectives.[Footnote 17] In this case, the Bureau should clearly state the basis for deciding which techniques to use or the number of cases it will include for coverage follow-up. Because the Bureau has not specified these criteria, it is not clear what criteria--other than roster change and cost--the Bureau will consider. For example, the Bureau noted in its decision to conduct coverage follow-up by telephone that not adding a personal visit component to the operation would limit its ability to reduce coverage errors for households it could not reach by telephone. Bureau research indicated that households for which it did not have telephone numbers tended to be very wealthy or very poor. Thus, coverage may not be improved for these demographic groups through telephone follow-ups alone. Bureau Has Made Changes to Coverage Measurement for 2010 Intended to Benefit the 2020 Census, but Key Aspects of Program Plan Remain Unclear: The Bureau's overall design of its census coverage measurement program for the 2010 Census is similar in many respects to its measurement design for 2000 but differs in some important ways. The objectives, timing, and some techniques to be used for 2010 differ from those used in 2000; however, the Bureau has not made clear when it will produce estimates of net coverage error and components of coverage error, nor does it make clear how it will meet one of its objectives for coverage measurement in 2010--improving future census operations. In 2000, the Bureau focused on estimating net coverage error with the possibility of producing adjusted numbers at the census block level. In contrast, for 2010, while still intending to produce net coverage error estimates at higher levels of geography--the national and state levels--the Bureau has refocused its efforts in order to use its measurement program to help improve enumeration methodologies for the future. Importantly, the Bureau does not plan for the measurement program to prepare adjusted numbers for the 2010 Census. In addition to starting coverage measurement interviews 4 months later than it did in 2000, the Bureau has also made several other important changes to its measurement program. For example, to better estimate net coverage error, the Bureau plans to use logistic regression for the first time. The Bureau also plans to assess the association of components of coverage error to various census operations. Finally, to better measure coverage, the Bureau plans to better identify duplicates by expanding its computer and clerical matching. Expert statisticians and methodologists we interviewed generally agreed that the Bureau's changes for its 2010 measurement program are an improvement from the 2000 coverage measurement program; however, several experts noted that more detailed planning is needed. For the 2010 coverage measurement program, the Bureau expects to include multiple operations. First, the Bureau selects its sample of census block clusters to be included in the coverage measurement program. Using the sample, the Bureau collects information on the housing units in the census blocks to develop an address list independent from that used in the census enumeration. The Bureau then uses this address list to draw a sample of housing units to include in the person interview operation. During this operation, Bureau field staff interview--in person--residents of these housing units and collect demographic, Interview Day and Census Day residency information of all persons living at the housing unit. According to the Department of Commerce, additional information will be collected in 2010 to identify alternate addresses where persons on the coverage sample roster may have been counted on Census Day and this information will be used in person-based computer and clerical matching operations. Next, the Bureau compares data collected during the person interviews with all census enumerations. Where the Bureau is unable to make a match, the Bureau follows up by sending interviewers back to the housing unit to reinterview the person or persons in question. During the final operation, the Bureau produces estimates of coverage error for persons and housing units. For 2010, the Bureau plans to provide estimates of net coverage error at the national level by race/ethnicity, age/sex, and tenure--the status of an occupied housing unit as either owner occupied or renter occupied. The Bureau will also produce estimates of net coverage error for states. 2010 Sample Design for Census Coverage Measurement Is Similar to That for 2000: The Bureau's 2010 coverage measurement program sample design is similar to that used for measurement in 2000. As in 2000, the Bureau plans to use a 300,000 national housing unit sample. The methods are basically consistent with the 2000 coverage measurement program and are expected to yield similar levels of reliability, although slightly different methods are being used to determine state sampling allocation. The objective for 2010 is to have the sample allocated proportionally across states and the District of Columbia, with a minimum of 1,800 housing units in each state. Moreover, the Bureau is using a differential within state sample allocation in order to allow for oversampling in areas as needed. Notably, according to Bureau officials, it is possible that that the sampling numbers could still change depending on budget or operational constraints. Most experts we spoke with thought that the Bureau's sample design was appropriate for its objectives. Census Coverage Measurement Interviews Will Begin Later, and Estimates Will Take Longer to Complete in 2010: In 2000, the timeline for coverage measurement overlapped with the Bureau's coverage follow-up efforts, though it was conducted independently from the census enumeration; however, measurement and coverage follow-up are planned to be conducted sequentially for the 2010 Census. Specifically, in 2010, the census coverage measurement interview will begin after the coverage follow-up operation is completed (see fig. 4). According to the Bureau's 2010 Census Operations and Systems Plan, dated August 31, 2007, the Bureau expects to conduct the person interviews for coverage measurement from August 20 through October 16, 2010. In 2000, the Bureau conducted the coverage measurement person interviews from April 24 through September 11, 2000. Figure 4: Timeline for Coverage Follow-up Interviews and Census Coverage Measurement Interviews Using Actual Dates for 2000 and Estimated Dates for 2010: This figure is a timeline for coverage follow-up interviews and census coverage measurement interviews using actual dates for 2000 and estimated dates for 2010. [See PDF for image] Source: GAO analysis of U.S. Census Bureau data. [End of figure] According to the Bureau, the expansion of the coverage follow-up operation contributed to delaying the measurement program in order to maintain data independence and reduce the likelihood of data contamination resulting from the overlap of follow-up and measurement field operations.[Footnote 18] In past coverage measurement programs, the person interview phase overlapped with coverage follow-up conducted as part of the census enumeration. In 2010, the Bureau plans to delay the start of the person interview phase for coverage measurement to minimize the risk of data contamination. While many experts agreed that delaying the coverage measurement interviews will reduce the risk of data contamination, some experts also highlighted that the delay may also negatively affect the data quality because it could contribute to recall bias. Recall bias, as it pertains to coverage measurement, means that a person incorrectly recalls where he or she or other household members resided on Census Day and thus biases the coverage measurement data. Two experts also noted that delaying coverage measurement interviews could also affect quality of data on movers. Generally, experts said that decreasing the amount of time between the coverage follow-up interview and the coverage measurement interview would benefit the coverage measurement data. However, a few experts who were also familiar with the issue stated that they were unsure of the magnitude of contamination and, therefore, did not know if the Bureau should decrease the time. Several of these experts argued that the Bureau could make a better case by using results from its contamination studies to determine if the delay is warranted. In 2006, the Bureau conducted a contamination study using the results of the 2006 Census Test to determine if the coverage measurement person interviews caused a difference in responses to the coverage follow-up interviews.[Footnote 19] Although the Bureau stated that the number of sample cases involved in the study was small and that the result should be viewed with caution, it concluded that there might be some small impact on coverage follow-up results if the measurement interview is conducted at the same time as the coverage follow-up interview. For 2010, the Bureau also expects to complete its coverage error estimates later than was the case for the Bureau's initial 2000 estimates. In 2000, the Bureau produced initial estimates in March 2001; however, after evaluating the 2001 estimates, the Bureau released revised estimates in March 2003. According to Bureau officials, the Bureau has preliminary plans to complete coverage measurement estimation[Footnote 20] for the 2010 Census sometime after October 2011[Footnote 21]--6 months after the Bureau provides population counts for redistricting. Bureau officials have stated that the decision to delay measurement estimation will allow for more time to adequately evaluate the 2010 measurement data and estimates, noting that the Bureau did not have sufficient time for analysis in 2000. Objectives for the 2010 Census Coverage Measurement Differ from Those for 2000, but How the Bureau Will Meet New Objectives Is Unclear: The Bureau's 2010 coverage measurement objectives have changed from those of Census 2000. In 2000, the Bureau focused on estimating net coverage error with the possibility of adjusting the census count at the census block level of geography. Though the Bureau still intends to produce net error estimates for 2010--at the state level and higher--it has refocused the coverage measurement program on improving future enumeration methodologies; consequently, it does not plan to use the results of its coverage measurement program to improve or adjust the count of the nation's population in 2010. For 2010, the Bureau's goals for measuring coverage accuracy include the following aspects: * The Bureau plans to use a dual systems approach to estimate net coverage error by using logistic regression instead of the poststratification technique used in 2000. The Bureau believes its move to logistic regression will also allow more flexibility because it can incorporate a greater number and type of factors or predictors in the model. * The Bureau plans to improve future census operations by associating estimates of components of coverage error, including erroneous enumerations and omissions, to census operations. * To better resolve and identify duplicate enumerations, a major source of error in previous decennials and coverage measurement programs, the Bureau plans to expand computer and clerical matching to match persons in its postenumeration survey to the entire census from the more limited matching done in 2000. The Bureau plans to conduct a census coverage measurement program for the 2010 Census, but Bureau officials said that they did not know when the Bureau would produce detailed documentation describing its plans. The Bureau has developed preliminary plans for associating components of coverage error to census operations, but these plans also are not finalized. Bureau Does Not Plan to Adjust the 2010 Census in Time for Redistricting: According to the Bureau, it has no plans to use the results of its coverage measurement program to improve or adjust the 2010 Census count. In response to a May 12, 2003 letter from Representative William Lacy Clay asking about decennial plans, the Bureau explained that with a sufficiently correct count, adjustment would introduce as much or more error than it was designed to correct. If the Bureau were asked by the Congress to provide adjusted numbers, Bureau officials commented that it would be unable to quickly or easily do so because the current coverage measurement program's timeline produces estimates after October 2011, and the Bureau has not designed the software to produce estimates at lower geographic levels. Consequently, according to the Bureau, it cannot prepare adjusted numbers in time to meet the legal deadline for producing count data for redistricting, which is April 1, 2011. Furthermore, the Bureau is not planning to provide net error estimates below the state level; such estimates are necessary for redistricting. Some of the interviewed experts agreed that producing adjusted numbers would be challenging; however, several also said that the Bureau could still provide some adjusted numbers if asked to do so. Regarding the challenges, two experts stated that adjusted numbers could be produced at a higher geographic level, the national level, but not accurately at lower geographic levels, the state and local levels, without appropriate changes to the current design and methodology. For example, one expert explained that the Bureau would need to use poststratification to produce estimates at the state and local levels. Additionally, several experts noted that the 2010 coverage measurement timeline poses a challenge for producing adjusted numbers by the April 1, 2011, redistricting deadline. Bureau Plans to Use a Different Statistical Method for Estimating Net Error but Has Not Fully Specified Its Plans: The Bureau plans to use logistic regression to improve its estimation of net coverage error. Of the experts familiar with logistic regression, the majority thinks that the methodology is an improvement; however, a few experts question how the Bureau will explain a more complex method to its stakeholders. Previously, for the 2000 coverage measurement, the Bureau used a poststratification[Footnote 22] method. However, Bureau officials said that poststratification was limited because of the small sample sizes of many of the strata. They explained that logistic regression permits greater flexibility in dealing with small sample sizes because it allows for a greater number and type of factors or predictors in the model.[Footnote 23] By definition, poststratification is limited because it only allows for a small number of factors with defined categories to be included. In addition to greater flexibility, logistic regression can potentially reduce errors in estimation. According to Bureau officials, to verify estimates from logistic regression, the Bureau is also considering using some poststratification. Logistic regression provides greater flexibility in dealing with small samples, but it may be more difficult methodologically to use logistic regression to produce state-level estimates by race/ethnicity, age/sex, and tenure or to estimate lower than the state level. Some experts argued for continuing to produce estimates using poststratification to enable direct comparisons of error estimates to prior decennials and to verify that logistic regression is working. To enhance transparency and oversight for improving coverage measurement for the 2010 Census, the Bureau should provide the Congress with timely and accurate information to allow monitoring of agency activities--specifically, timely and accurate information about its plans for coverage measurement.[Footnote 24] The Bureau plans to refine its estimation methods for 2010 by using logistic regression to estimate net coverage error, but it does not have an operational or working plan that outlines what resources--such as the amount of time that it will take to run the data set(s)--will be used to complete this operation. Further, despite conducting some exploratory logistic regression work using the 2006 test data, as well as simulating its estimation model for logistic regression using decennial data from 2000 and finding workable models, the Bureau has not documented what approaches or processes it intends to apply in the further development of the regression model or models it plans to use. In addition, the Bureau does not have a written plan detailing how it will explain to stakeholders and Bureau data users the rationale for moving from poststratification to logistic regression and what effect this change may have on data interpretation--particularly as it relates to comparisons with past census data. Experts we interviewed also noted some challenges, including communicating the new method to users of census data and stakeholders, having an adequate number of staff to complete the analysis, and designing the best models--such as choosing the variables to include. Bureau's New Focus on Estimating Components of Coverage Error Is Intended to Benefit Future Decennials, but Plans Are Incomplete: The Bureau plans to estimate the components of coverage error for 2010 in order to associate the errors to census operations to improve future decennials; in 2000, the coverage measurement program focused on net coverage error, not components of coverage error. Although the Bureau will once again estimate net coverage error, Bureau documentation shows that estimating the components of coverage error is now a higher priority for the Bureau. Coverage error comprises two types of errors: errors of omission and errors from a person or housing unit mistakenly enumerated (erroneous enumerations). The impact of omissions and erroneous enumerations are combined to determine the net error. During Census 2000, the Bureau estimated the net coverage error of the census. However, net error can disguise significant count problems--the Bureau could have a large overcount in some areas and large undercount elsewhere, and it might balance out to a small net error nationwide. In addition, the coverage measurement definitions of erroneous enumerations used in 2000 were not as precise as those that will be used in 2010. For example, the 2000 coverage measurement program did not require Bureau officials to ascertain where a person should have been counted. Instead, some enumerations were treated as omissions or erroneous enumerations in an area where they should have been counted. According to the Department of Commerce, because of the Bureau's new goal to measure the components of coverage error, the Bureau will use a definition for component error estimation that is not as strict as that used for net error. For 2010, erroneous enumerations for the coverage measurement program are defined to include the following: duplicate enumerations, persons born after or who die before Census Day, or persons who are not residents of a housing unit in the United States on Census Day. Though the Bureau recognizes the potential benefits of estimating the components of coverage error to improve future decennials, it has not fully developed its plans for doing so in 2010. Providing the Congress and other key stakeholders with information about how the Bureau plans to use the 2010 coverage measurement program to improve future decennials would enhance transparency and oversight of coverage measurement.[Footnote 25] For Census 2000, the Bureau did not associate the estimates of erroneous enumerations and omissions to specific census operations. For the 2010 Census, the Bureau has not finalized plans to evaluate the quality of its estimates of components of coverage error (i.e., estimates of variance) and has only recently determined the level of geography for which estimates will be made. Moreover, although the Bureau has developed table shells that associate components of coverage error to census operations, the Bureau has not clearly explained how it plans to implement operations and data collection strategies that will support the analyses and lead to the completion of the tables and convert those analyses into plans to improve the census operation for 2020. Importantly, Bureau officials stated that it will be difficult to link estimates of the components of coverage error to specific census operations. Instead, they characterize the process of associating the components of error to census operations as an exploratory effort. A Bureau official explained that associating errors of omission to operations will be most challenging because omitting persons from the census count could be caused by errors in multiple operations. For example, a person could have been omitted because of a bad address or an incorrect enumeration. It may not be possible to determine which operation contributed to the omission. In contrast, she said that it may be more feasible to associate errors of erroneous enumerations to individual census operations because there is more that they know about the enumerations. In associating errors to census operations, a Bureau official stated that the Bureau may use a modeling approach to identify those factors with the highest probability and greatest significance in predicting specific erroneous enumeration errors. Additionally, Bureau officials noted that how the Bureau associates components of coverage error to census operations cannot be determined without first determining the final models for estimating the components of coverage error for the 2010 Census. These models cannot be specified without the actual 2010 Census data and therefore will be specified after data collection. Finally, as recommended by the National Academy of Sciences, Bureau officials acknowledged that there is ongoing experimental research involving the use of a master trace sample database in associating the components of coverage error to census operations. This database is intended to help the Bureau trace response and operational data through various stages of the 2010 Census--such as address listing and data collection. The National Academy of Sciences notes that such a database should allow the Bureau to retain a full history of relevant census operations for a designated sample of households in census blocks. The Bureau should then be able to use the database to investigate the source of a variety of potential deficiencies that can arise in a complicated undertaking like the decennial census. More specifically, the database could be used to determine what percentage of census omissions are from partially enumerated households and what percentage of omissions are from the merged administrative records database. According to a Bureau official, the Bureau plans to begin developing the database in 2008. However, the official noted several limitations with such a database. For example, the database consists of a sample of the U.S. population, as opposed to the whole census population. Still, Bureau officials noted that improving the master trace sample database in 2010 is important, citing the example of using the database for complex cluster analyses that link to coverage measurement. While most experts we contacted agreed that estimating the components of coverage error will likely help to improve future decennials, some experts questioned the Bureau's ability to link the actual errors to specific census operations. For example, one expert said that it "will be practically impossible to project errors to operations." Additionally, some experts discussed other challenges that the Bureau may face in accurately estimating the components of coverage error, including distinguishing between different types of erroneous enumerations and omissions as well as resolving duplicates and accurately matching the data. Still, some experts did say that it may be possible for the Bureau to link certain erroneous enumerations to actual operations. Because of the Bureau's new objective to estimate components of coverage error and relate them to census operations, it is important for the Bureau to solidify its plans and share them with stakeholders to ensure that there is an understanding on what the program will produce in order to improve future decennials. Bureau Plans Expanded Matching for Resolving and Identifying Duplicate Enumerations: Similar to the matching planned to expand the coverage follow-up operation, the Bureau's plans for its 2010 coverage measurement program include expanding the matching process in order to better resolve and identify duplicate enumerations--a major source of error in the 2000 Census and coverage measurement program. Some of the experts we contacted are confident that the plans will be an improvement. Shortly after the census enumeration is completed, a postenumeration survey will be conducted in a sample of block clusters. This re-enumeration sample--known as the P-sample--is used with the E-sample (the actual census enumeration in those sample areas) to derive a corrected population estimate.[Footnote 26] For 2010, the Bureau will also use computer matching to compare the P-sample with all census enumerations, with the objective of identifying and resolving duplicate enumerations. This is an expansion from 2000, when the P-sample was matched against enumerations from sample blocks and surrounding blocks only, although housing units are matched to census housing in the search area. As in 2000, the Bureau will also use clerical matching to do additional matching, but will also do nationwide matching.[Footnote 27] Despite the Bureau's effort to improve its matching process and minimize the number of duplicates in 2010, the Bureau does not yet have complete plans for the 2008 Dress Rehearsal or for the 2010 Census to help accomplish this goal. Bureau officials noted that materials are being developed for certain matching procedures, such as clerical matching. Further, Bureau officials said that budgetary constraints may also affect the matching operations and changes may occur before or following the 2008 Dress Rehearsal. For example, the Bureau may not conduct person matching during the dress rehearsal. Though a few experts cited the challenges the Bureau will continue to face in resolving duplicate enumerations, several also noted that the expanded matching process should help to reduce the number of duplicates considerably. An expert explained that by expanding the geography covered in the Bureau's matching operations, it will be easier to identify duplicates as well as households that may have been overlooked in previous coverage measurement matching operations because the search area was limited to respective census blocks and a few nearby surrounding geographic areas. Still, he also said that the expanded matching will be an improvement only if implemented well. Bureau Plans for Evaluating the 2010 Coverage Follow-up Operation and Census Coverage Measurement Program Are in Early Stages: The Bureau is in the early stages of developing plans for evaluating its 2010 Census coverage follow-up operation and census coverage measurement program. To assist in this effort, the Bureau has contracted with the National Academy of Sciences to recommend proposed research for the 2010 Census. The academy's final report is due in September 2009.[Footnote 28] In its December 2007 interim report, the academy recommended that the Bureau conduct certain experiments and evaluations, such as assessing the use of administrative records and coverage probes for identifying cases for follow-up.[Footnote 29] Bureau officials have also suggested research on various aspects of coverage follow-up and measurement for 2010. Officials said that the Bureau will consider costs and staffing needs in deciding what it will evaluate. However, the Bureau has not yet developed a timetable for making decisions about evaluating the 2010 coverage follow-up operation or census coverage measurement program. For the most recent decennial censuses, the Bureau has carried out experiments and evaluations. A census experiment usually involves field data collection during the census in which alternatives to current census processes are used and assessed for a subset of the population. An evaluation is usually an analysis conducted on data collected as part of the decennial process to determine whether individual steps in the census operated as expected. The Bureau plans to work with internal and external stakeholders on defining and developing its program for experiments and evaluations for the 2010 Census, with design work and implementation starting in 2008 and continuing through 2011. The final set of activities would include analysis, documentation, and presentation of the research, and these activities would start in 2009 and be completed by the end of the fiscal year 2013. According to the National Academy of Sciences' December 2007 interim report, its purpose was to reduce the possible subjects for census research from a list of over 50 research topics. The academy reviewed the Bureau's initial list of research topics to help reduce the possible subjects for census experimentation, and offered broad advice in its interim report on plans for evaluations of the 2010 Census. For this smaller number of research topics, the academy recommended several experiments and evaluations relating to coverage improvement and measurement. For example, the academy recommended that the Bureau assess the use of administrative records in the event that the Bureau opts to review administrative records to identify potential omissions. It also recommended that the Bureau include one or more alternate questionnaire experiments, such as an examination of the usefulness of the residence rules and concepts on the census questionnaire. Further, the academy recommended a possible experiment on comparing telephone interviews to personal interviews for the coverage follow-up operation because a telephone-only operation would follow up for only those households that provide a contact telephone number. However, at this point, the Bureau has not yet made definitive decisions about what it will evaluate and when it will make these decisions. In its interim report, the National Academy of Sciences suggested that the Bureau design a master trace sample database--a tool for investigating the source of potential deficiencies that can arise in such a complicated undertaking as the decennial census--that would help assess the benefits of the coverage follow-up interview. This database could link person, household, and housing unit characteristics; census processes; and the presence or absence of census component coverage error. The academy suggested that the use of this database would better identify the sources of high rates of census component coverage error, including erroneous enumerations, enumerations in the wrong place, duplications, and omissions. The academy also suggested that the Bureau use a reverse record check for coverage measurement. It noted that the reverse record check is an alternative method for estimating the completeness of census coverage of the population, which may have advantages over the methods of dual systems estimation and demographic analysis that have been used for this purpose to date. This procedure has not been used previously to evaluate census data, mainly because of the 10-year period between censuses, which complicates the need to trace people's addresses from one census to the next. However, the Bureau now uses the American Community Survey, a monthly survey of 250,000 housing units to collect and provide long-form data on an annual basis. A reverse check would facilitate comparing decennial enumerations to information collected more frequently during the American Community Survey. The National Academy of Sciences believes that it will be very important for the Bureau to finalize specific designs of its experiments by summer 2008 to meet the planning needs of the census. Because the data needed to support census evaluations are typically output files from the census itself, the exact structure of individual evaluations is not yet as time-sensitive as the experiments. However, the academy notes that some early planning for evaluations is critical so that the necessary data extracts can be prepared and retained, especially because much of the data collection in 2010 will be carried out by contractors, and thus data retention requirements need to be arranged with contractors as early as possible. It also suggests that the Bureau increase its in-house expertise in experimental design for the census. Some experts we interviewed about the Bureau's plans for coverage measurement also noted that the Bureau should plan its evaluations with more urgency. An expert noted that in 1990 and 2000, when the Bureau faced challenges with its adjustment process, there were fewer staff members available to work on evaluations; he said that perhaps this would not be a problem for the 2010 Census because the Bureau did not plan to adjust the census count. Further, a few experts suggested that the Bureau should ask outside researchers, in addition to the National Academy of Sciences, for assistance and feedback in its evaluation planning and development. A few experts also suggested that the Bureau should consider developing more qualitative evaluations, such as studies to look deeper into why certain populations, such as the homeless, are omitted from the census at greater rates. Conclusions: With only 2 years remaining before Census Day, the Bureau faces formidable challenges in successfully implementing and completing the 2010 Census. These challenges are even more formidable given the Bureau's recent decision to redesign certain aspects of the 2010 Census, and make more critical the need for the Bureau to have in place specific and concrete plans for ensuring and measuring the accuracy of the decennial census. To accurately count everyone residing in the United States, the Bureau must overcome significant challenges due to the nation's increasing diversity, as well as, according to the Bureau, a population increasingly reluctant to participate in surveys. In response to these challenges, the Bureau plans to expand its coverage follow-up operation--a final quality assurance method intended to help accurately enumerate the population--by focusing on resolving count discrepancies, enumerating large households, and following up on households with potential enumeration problems identified through techniques the Bureau plans to further test during its dress rehearsal. As in prior decennials, the Bureau plans to measure the accuracy of the 2010 decennial through a coverage measurement program. The Bureau intends to focus its coverage measurement efforts in 2010 on learning more about the components of census error in order to improve operations during the next decennial--2020--and future decennials. The Bureau does not plan to use its 2010 coverage measurement program to adjust the count of the nation's population. The Bureau has initiated efforts to improve the accuracy of the census count by expanding coverage follow-up and improving measurement; however, some areas bear enhanced monitoring, oversight, and management as the Bureau moves from its 2008 Dress Rehearsal to final plans for the 2010 Census. Providing internal and external stakeholders a full description of its plans for coverage follow-up and measurement will help the Bureau ensure that the plans for and expected outcomes of these important operations are understood not only by the Bureau but also by stakeholders. Importantly, many of the operational decisions for the 2010 Census depend on the upcoming 2008 Dress Rehearsal. For example, the Bureau has not finalized its plans for 2010 regarding the scope of its coverage follow-up operation and will base final decisions on how to select cases for follow-up in part on its experiences in the dress rehearsal. Coverage follow-up improvements have been tested on a site-level scale but have not yet been tested with new processing interfaces between the Bureau and its contractor for collecting and disseminating information. The Bureau plans to expand coverage follow- up but has not clearly stated the basis it will use for deciding which techniques to use or the number of cases it will include for follow-up. The Bureau will rely on the dress rehearsal to assess the effectiveness of certain coverage follow-up techniques, such as the undercount probe. Existing analyses suggest that this probe is of limited use. Overall, given the uncertainty surrounding key aspects of the Bureau's plans for coverage follow-up for 2010, as well as the delayed dress rehearsal, the content and magnitude of the 2010 coverage follow-up bear continued scrutiny. Operational plans for coverage measurement lack specificity, and the 2010 Census will be the first time that the Bureau can implement its expanded computer matching for both coverage follow-up and measurement on a national level. Importantly, although an announced purpose for revising the census coverage measurement program was to provide improvements for the 2020 Census and the Bureau is working toward associating components of error to specific operations, it has neither clearly explained how it will do this nor the expected dates for when its plans for this analysis will be complete. All of these unknown aspects increase the level(s) of risk of not meeting these important goals. By having plans and timelines in place, the Bureau increases the likelihood of completing its coverage measurement analyses in time to inform the planning for the 2020 decennial. Because the Bureau plans significant changes to its coverage follow-up and measurement activities, it should be planning well in advance of the decennial how it will evaluate and measure their success. Timely and informed oversight, monitoring, and management can promote the success of these evaluative activities. The Bureau's program for evaluations and experiments has great potential to help improve the next decennial. The National Academy of Sciences has made interim recommendations on various ways that the Bureau could assess its decennial data collection activities. Among other things, the academy recommends using a master trace sample because such a sample would improve the Bureau's efforts to relate census errors to census operations. We believe that such a master trace sample has the potential to help the Bureau better understand sources of error. Nonetheless, designing such a sample is a daunting task. The Bureau has begun to evaluate how to implement the academy's recommendation. The academy's interim report also suggested that the Bureau soon finalize specific designs of the experiments to meet its planning needs so that necessary data retention requirements could be arranged with contractors as early as possible. Recommendations for Executive Action: We are recommending that the Secretary of Commerce direct the Bureau to provide the Congress and other census stakeholders with more specific plans on conducting coverage follow-up, implementing coverage measurement, and evaluating the results of those programs for the 2010 Census. Specifically, we recommend that the Bureau provide: * the criteria it will use to assess the techniques for identifying cases for the coverage follow-up operation in the 2008 Dress Rehearsal and the timeline for completing this assessment to permit planning for 2010--the timeline should specify dates for preliminary assessments, final assessments, and when operational changes might be made to the 2010 Census; * its plans for conducting coverage measurement for the 2010 Census, which should include a description of when it will provide estimates of net coverage error and components of coverage error and how it plans to relate components of coverage error to census operations in order to improve future decennials; and: * key decision points and plans for evaluating aspects of the 2010 Census coverage follow-up operation and census coverage measurement program. Agency Comments and Our Evaluation: We provided a draft of this report to the Secretary of Commerce for his review and comment. Commerce had no substantive disagreements with our recommendations and stated that it will develop formal action plans for each recommendation. While Commerce believes a timeline for producing estimates of net coverage error and components of coverage error is certainly a reasonable expectation, Commerce noted that it is unclear as to how it can at this time provide any specifics on how coverage results might improve future censuses before the coverage data are obtained and analyzed. We recognize that the Bureau has developed table shells that associate components of coverage error to census operations, but these table shells and plans for how the Bureau will conduct its analyses have not been shared with the Congress and other stakeholders. In addition, it is not clear from current Bureau plans how it will go about obtaining the data required to complete the tables. As agreed with your office, unless you publicly announce the contents of this report earlier, we plan no further distribution until 30 days after the report date. We will then send copies of this report to the Secretary of Commerce, the Department of Commerce's Inspector General, the Director of the U.S. Census Bureau, and interested congressional committees. We will make copies available to others upon request. This report will also be available at no charge on our Web site at [hyperlink, http://www.gao.gov]. If you or your staff have any questions concerning this report, please contact me at (202) 512-6806 or sciremj@gao.gov. Contact points for our Offices of Congressional Relations and Public Affairs may be found on the last page of this report. GAO staff who made major contributions to this report are listed in appendix III. Sincerely yours, Mathew J. Scirč: Director, Strategic Issues: [End of section] Appendix I: Objectives, Scope, and Methodology: To determine (1) how the U.S. Census Bureau (Bureau) plans to improve coverage through its coverage follow-up operation, (2) how the Bureau's census coverage measurement plans for assessing the accuracy of the 2010 Census compare with efforts in 2000, and (3) how the Bureau plans to evaluate the coverage follow-up and measurement efforts after the 2010 Census, we requested and obtained source documents from the Bureau's headquarters in Suitland, Maryland. We also interviewed Bureau officials in several of its divisions, including the Decennial Management Division and the Decennial Statistical Studies Division. Further, we reviewed evaluations regarding coverage follow-up and coverage measurement. In addition, for the second objective, related to the Bureau's census coverage measurement program, we interviewed 15 experts about the Bureau's coverage measurement methodology and plans. Prior to our interviews, we summarized the Bureau's plans for coverage measurement and met with Bureau officials to obtain confirmation of our summary. We then identified experts based on published reports as well as recommendations from the Bureau and the National Academy of Sciences and contacted them about reviewing our summary of the Bureau's measurement plans and interviewing them about the Bureau's plans, including the extent to which the Bureau's plans addressed concerns and limitations of previous coverage evaluation programs, the extent to which using logistic regression instead of poststratification would improve the estimation of net coverage error, the main challenges in estimating components of coverage error, and the implications of delaying the coverage measurement interviews. Fifteen experts responded affirmatively to our request for interviews and we subsequently interviewed the following persons: Dr. Barbara Bailar: Statistical Consultant: Washington, D.C.: Dr. Thomas Belin: Professor of Biostatistics, Psychiatry, and Biobehavioral Sciences: Department of Biostatistics: School of Public Health: University of California, Los Angeles: Los Angeles, California: Dr. Michael Brick: Senior Statistician: Vice President, Director of Survey Methods: Associate Director of the Statistical Staff at Westat: Rockville, Maryland: Dr. James Brown: Senior Lecturer in Official Statistics: School of the Social Sciences: University of Southampton Southampton, United Kingdom: Mr. Dave Dolson: Director: Social Survey Methods Division: Statistics Canada: Ottawa, Canada: Dr. Stephen Fienberg: Maurice Falk: University Professor of Statistics and Social Science: Department of Statistics: Carnegie Mellon University: Pittsburgh, Pennsylvania: Dr. Tim Johnson: Director: Survey Research Laboratory: University of Illinois at Chicago: Chicago, Illinois: Mr. Don Royce: Director General: Methodology Branch: Statistics Canada: Ottawa, Canada: Dr. Duane Steffey: Director: Statistical and Data Sciences: Exponet, Inc.: Menlo Park, California: Dr. John Thompson: Executive Vice President of Survey Operations: National Opinion Research Center at the University of Chicago: Chicago, Illinois: Dr. Robert Tortora: Chief Methodologist: Gallup Washington: Washington, D.C.: Dr. Roger Tourangeau: Director: Joint Program in Survey Methodology: University of Maryland: College Park, Maryland: and: Acting Director: Program in Survey Methodology: University of Michigan: Ann Arbor, Michigan: Dr. Kenneth Wachter: Professor of Demography and Statistics: University of California at Berkeley: Berkeley, California: Dr. Martin Wells: Director of Research: Charles A. Alexander: Professor of Statistical Sciences: Cornell University: Ithaca, New York: Dr. Kirk Wolter: Head of the Center for Excellence in Survey Research: Professor in the Department of Statistics: Senior Fellow at the National Opinion Research Center: University of Chicago Chicago, Illinois: We conducted this performance audit from January 2007 to March 2008 in accordance with generally accepted government auditing standards. Those standards require that we plan and perform the audit to obtain sufficient, appropriate evidence to provide a reasonable basis for our findings and conclusions based on our audit objectives. We believe that the evidence obtained provides a reasonable basis for our findings and conclusions based on our audit objectives. [End of section] Appendix II: Comments from the Department of Commerce: Note: GAO comments supplementing those in the report text appear at the end of this appendix. The Secretary Of Commerce: Washington, D.C. 20230: April 3, 2008: Mr. Mathew J. Scire: Director: Strategic Issues: United States Government Accountability Office: Washington, DC 20548 Dear Mr. Scire: The U.S. Department of Commerce appreciates the opportunity to comment on the United States Government Accountability Office's Draft Report Entitled Census 2010: Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results, (GAO-08-414). I enclose the Department's comments on this report. Sincerely, Signed by: Carlos M. Gutierrez: Enclosure: U.S. Department of Commerce Comments on the United States Government Accountability Office Draft Report Entitled Census 2010: Bureau Needs to Specify How It Will Assess Coverage Follow-up Techniques and When It Will Produce Coverage Measurement Results, (GAO-08-414) March 2008: The U.S. Department of Commerce appreciates the United States Government Accountability Office's (GAO) efforts to review our coverage improvement and coverage measurement processes planned for the 2010 Census and this opportunity to review the draft report. We have no substantive disagreements with these recommendations. Once the report is issued in final form, we will develop formal action plans for each recommendation. While we believe a timeline for producing the estimates is certainly a reasonable expectation (Recommendation 2), we are unclear as to how we could at this time provide any specifics on how our coverage results might improve future censuses before the coverage data are obtained and analyzed by Census Bureau and other experts. Specific Comments on the Draft Report: Page - Highlights: Item 4 in the second paragraph reads (4) expand computer matching to match housing units and persons in its post enumeration survey to the entire census. (See comment 1.): Comment: The item should be revised to read (4) expand computer and clerical matching to match persons in its post enumeration survey to the entire census. The nationwide matching is not done for housing units. Page 4: The error above regarding expanding computer matching also appears on page 4 of your report and should be revised. (See comment 2.): Page 4: For clarification, there is important information that needs to be included about the coverage measurement interviewing starting four months later than in 2000. The telephone phase was dropped due to the increased complexity of the person interview instrument, and loss of time in the conversion from a telephone phase to personal-visit implementation. Part of the delay in starting the 2010 Person Interview operation is due to eliminating the telephone phase. (For Census 2000, the person field visit started on 6/1/00, while the telephone phase started on 4/24/00). Although the interviewing will start four months later compared to the Census 2000 phone phase, person interviewing ends only one and a half months later than in 2000 (9/1/2000 vs. 10/16/2010). This should reduce the concerns related to recall bias. (See comment 3.): Page 4, footnote 2: The footnote reads, The dual systems approach . estimates of the remaining error in the count after omissions and erroneous enumerations offset one another. Comment: The second sentence may give a wrong impression that a net error estimate requires estimates of omissions and erroneous enumerations. To be more accurate, we usually say "reflects" rather than "estimates." We would suggest correcting the sentence to read, The net error reflects the error remaining after omissions and erroneous enumeration offset one another. (See comment 4.): Page 4, footnote 3: The footnote reads, "Before computer matching is completed, the Bureau conducts . to estimate errors in the census. such as nonexistent persons or duplicates." (See comment 5. Now footnote 4, p4..): Comment: We offer several clarifications to this footnote. A post enumeration survey consists of two samples: the P and E samples. P- sample persons are matched not only to the E sample, but also to the entire census. E-sample persons are matched to the entire census to identify and resolve duplicates. E-sample housing units are matched to census enumerations in the block cluster and surrounding ring of blocks. Page 16: For clarification, as written, the third paragraph implies that when the nonresponse follow-up (NRFU) undercount probe tells us there may be an additional person who was not included on the NRFU roster, the coverage follow-up (CFU) interview will clarify the residency situation so that a person who should be included (but was not on the NRFU roster) will be added in CFU. The report describes this as "included in the CFU roster after NRFU," which is not accurate. We do not add these persons and collect these data during NRFU. If CFU roster clarifications indicate someone should be added, this happens during CFU. (See comment 6. Now footnotes 11, p. 16..): Page 18: For clarification of the first paragraph, the limitation with identifying duplicates in site tests is not so much the limited sample size of the test sites, but rather the limited likelihood of duplication of individuals that will occur within a site of limited geographic size. (See comment 7.): Page 18: We suggest two clarifications to the second paragraph: (1) duplicated housing units occur when multiple addresses or representations of an address for the same housing unit appear on the address list. The ones that are "identified during census operations" are removed from the list for subsequent operations and (2) the next sentence about units being included on both the housing unit list and group quarters list is no longer accurate, which represents a big change from Census 2000. We now have one list for both, with units identified as either housing units or group quarters. Therefore, in general, we can no longer have the situation described in the document that "addresses can be included in the census in both the housing unit and group quarters enumerations." (See comment 8. Now on p. 19..): Page 19: We suggest two clarifications to the second paragraph: (I) the description of the unduplication process is a bit reversed. During the automated matching process, a match score is assigned to all links based on the level of match. The Census Bureau assesses the quality of the match based on the match score and (2) the Field Verification operation verifies housing unit status of the units selected for follow- up in the operation. Units that are invalid are marked as deletes or duplicates, and valid units are marked as verified. There are no additions during the Field Verification. In some cases, the Field Verification operation verifies the existence of units added in other operations, in particular, the Be Counted program. For the housing unit unduplication cases, the units are already on the list, and Field Verification is undertaken in order to determine if housing units should be deleted from the list. Page 20: For clarification of the second paragraph, the Census Bureau plans to use all our CFU test results to make final decisions about what to implement in 2010. not just the 2008 results. In particular, we have results from the 2005 National Content Test and 2006 Census Test. The 2008 implementation was intended to be an effort to collect additional data. (See comment 10.): Page 22: Your statement reads, "Finally, to better measure coverage, the Bureau plans to better identify duplicates by expanding its computer matching." (See comment 11.): Comment: While true, we also will expand clerical matching. The following revision is suggested: "Finally, to better measure coverage, the Bureau plans to better identify duplicates by expanding its computer and clerical matching." Page 23: The first sentence reads, "During this operation, Bureau field staff interview” in person”residents of these housing units and collect demographic and Census Day residency information of all persons living at the housing unit." Comment: The sentence should be revised to read as follows: "During this operation, Census Bureau field staff interview”in person”residents of these housing units and collect demographics, Interview Day, and Census Day residency information of all persons living at the housing unit. Additional information will be collected in 2010 to identify alternate addresses where the coverage sample roster people may have been counted on Census Day. This information will be used in person- based computer and clerical matching operations." (See comment 12.): Page 23: The fourth sentence in the first paragraph reads, "Additionally, the Bureau conducts computer matching to further resolve cases." (See comment 13.): Comment: This statement is inaccurate. A computer match is not done after we conduct the follow-up. Page 23: The last sentence in the first paragraph reads, "However, unlike for Census 2000, the Bureau will not produce state-level estimates by race/ethnicity, age/sex, and tenure or estimates lower than the state level." (See comment 14.): Comment: This statement is incorrect. The Census Bureau did not produce any state- level estimates broken down by any characteristics in Census 2000. Page 26: The third bullet also reflects the same error as occurred on the Highlights page and page 4. Refer to the comment made on the Highlights page above. (See comment 15.): Page 28, first full paragraph: For clarification, the Census Bureau has not "redefined what constitutes an erroneous enumeration" because of any shortcomings. Instead, because of the new goal to measure the components of coverage error, it became important to use (for component error estimation) a definition that is not as strict as that used for net error. But the strict definition will continue to be used for net error in 2010. (See comment 16. Now p. 29..): Page 31, second paragraph: For clarification, technically speaking, the P-sample is not used to estimate the correct population, but is used to estimate the match rate, while the E-sample is used to estimate the correct enumeration rate. Both of these samples are needed to estimate the true population. Matching is performed to the census, not the E- sample, although housing units are only matched to census housing in the search area. (See comment 17. Now footnote 26, p. 31): Page 31, footnote 25: Your footnote reads, "Bureau officials have noted that the clerical matching procedures being planned for 2010 are comparable to those used for coverage measurement in the 2000 Census." (See comment 18. Now footnote 27, p. 32..): Comment: The clerical matching procedures for 2010 are much more complex than in 2000. We have to account for the duplicate searching at alternate respondent-provided addresses collected in the person interviews and person follow-up interviews. So they are not strictly comparable. Pages 31-32, last sentence: Your statement reads, "Although the Bureau will not include the coverage measurement person follow-up interviews in the dress rehearsal, for 2010 the Bureau plans to use information collected from those interviews as well as initial person interviews in its computer-matching operation." (See comment 19.): Comment: This statement is inaccurate. Information collected from Person Follow-up is not used in computer matching, but only in clerical matching. Page 33: The last paragraph references 52 research topics covering all aspects of its decennial operations that the Bureau provided for the National Academy of Sciences to review. Comment: The paragraph is misleading and should be clarified as follows: The Census Bureau provided a list of 54 research questions sorted among 11 research topics to the National Academy of Sciences for their review. For the most part, the research questions were not identified as either evaluations or experiments. The Academy did not reduce the list of subjects for census experimentation to about six. Rather, the Academy's interim report presents its ideas for 2010 Census experiments, which in some cases are similar to the Census Bureau's proposed research. (See comment 20.): The following are our comments on the Department of Commerce's letter dated April 3, 2008. GAO Comments: We clarified our recommendation on relating components of coverage error to census operations in order to improve future decennials. Specifically, we recognize that the Bureau has developed table shells that associate components of coverage error to census operations, but these table shells and plans for how the Bureau will conduct its analyses should be shared with the Congress and other stakeholders. In addition, it is not clear from current Bureau plans how it will go about obtaining the data required to complete the tables. 1. We have revised the Highlights page to reflect the Department of Commerce's clarification that for the census coverage measurement program, the Bureau will expand its computer and clerical matching to match persons in its postenumeration survey. 2. We have revised the report to reflect the Department of Commerce's clarification that the Bureau will expand its computer and clerical matching to match persons in its postenumeration survey. 3. In footnote 3, we have revised the report to clarify that the although the coverage measurement interview will start 4 months later compared to the Census 2000 telephone phase, person interviewing for the 2010 Census ends 1-1/2 months later than in 2000. 4. In footnote 2, we have revised the sentence to indicate that the net error reflects the error remaining after omissions and erroneous enumerations offset one another. 5. In footnote 4, we have revised the definition of the P-sample and E- sample based on the Department of Commerce's clarification. 6. On page 16, we have clarified information about using the undercount probe during nonresponse follow-up and coverage follow-up. 7. On page 18, we have clarified the information about the limited likelihood of duplication of individuals that will occur within a site of limited geographic size. 8. On page 19, we have incorporated the clarifying information provided by the Department of Commerce. 9. On page 19, we have clarified information about the matching process. 10. On page 20, we have revised the report to reflect that the 2008 Dress Rehearsal is intended to provide additional data to make final decisions about the Bureau's implementation of coverage follow-up in 2010. 11. On page 22, we have revised the report to reflect that the Bureau will expand its computer and clerical matching to match persons in its postenumeration survey. 12. On page 23, we have incorporated the Department of Commerce's revisions on information that is collected during the census coverage measurement program. 13. On page 23, we have deleted the incorrect sentence. 14. On page 23, we have deleted the incorrect sentence. 15. On page 26, we have revised the sentence to reflect that the Bureau will expand its computer and clerical matching to match persons in its postenumeration survey. 16. On page 29, we revised the sentence to clarify the reason for refining the Bureau's definition for erroneous enumeration. 17. On page 31, we clarified the definition of the P-sample and E- sample as suggested. 18. On page 31, for footnote 26, we acknowledged the change in clerical matching for the 2010 Census. 19. On page 31, we deleted the incorrect sentence. 20. On page 33, we provided attribution for the information to the National Academy of Sciences' December 2007 interim report. [End of section] Appendix III: GAO Contact and Staff Acknowledgments: GAO Contact: Mathew J. Scirč (202) 512-6806 or sciremj@gao.gov: Acknowledgments: In addition to the contact name above, Ron Fecso, Chief Statistician; Ernie Hazera, Assistant Director; Jenna Aurand; Betty Clark; and Jennifer Edwards made key contributions to this report. Tom Beall, Andrea Levine, Donna Miller, Lisa Mirel, and Elizabeth Wood provided significant technical support. [End of section] Footnotes: [1] The Bureau is planning to use a 300,000 national housing unit sample. [2] The dual systems approach uses a postenumeration survey to produce an estimate of the true population, which is then used to produce an estimate of the net coverage error. The net error reflects the error remaining after omissions and erroneous enumerations offset one another. See GAO, 1990 Census Adjustment: Estimating Census Accuracy-- A Complex Task, GAO/GGD-91-42 (Washington, D.C.: Mar. 11, 1991). [3] Although the interviewing will start 4 months later compared to the Census 2000 telephone phase, person interviewing for the 2010 Census ends 1-1/2 months later than in 2000. [4] The Bureau conducts a postenumeration survey, independent of the census, in a sample of block clusters. This sample is known as the P- sample and estimates the number of persons missed by the census. A postenumeration survey consists of two samples: the P-sample and the E- sample. P-sample persons are matched not only to the E-sample, but also to the entire census. E-sample persons are matched to the entire census to identify and resolve duplicates. E-sample housing units are matched to census enumerations in the block cluster and surrounding ring of blocks. [5] The coverage measurement interview depends upon the respondents' ability to recall the composition of their households on Census Day. The longer the period of time between Census Day and the day that the measurement interview is conducted, the harder it is for respondents to recall membership of their households on Census Day. [6] For example, see GAO, 2010 Census: Cost and Design Issues Need to Be Addressed Soon, GAO-04-37 (Washington, D.C.: Jan. 15, 2004); 2010 Census: Basic Design Has Potential, but Remaining Challenges Need Prompt Resolution, GAO-05-9 (Washington, D.C.: Jan. 12, 2005); and 2010 Census: Census Bureau Should Refine Recruiting and Hiring Efforts and Enhance Training of Temporary Field Staff, GAO-07-361 (Washington, D.C.: Apr. 27, 2007). [7] Starting in January 2007, the Bureau notified state and local governments that it would seek their help in developing a complete address file through the Local Update of Census Addresses program. Address canvassing--a field operation to build a complete and accurate address list in which census field workers go door to door verifying and correcting addresses for all households and street features contained on decennial maps--will begin in April 2009. For more information, see GAO, 2010 Census: Census Bureau Has Improved the Local Update of Census Addresses Program, but Challenges Remain, GAO-07-736 (Washington, D.C.: June 14, 2007). [8] See also National Academy of Sciences, Once, and Only Once, and in the Right Place: Residence Rules in the Decennial Census (Washington, D.C.: 2006). [9] A household that completes a coverage follow-up interview could either finish the interview with the same roster of household members listed at the beginning of the interview or make some change to the roster. A roster change occurs when a household adds at least one person to the original roster, deletes at least one person from the original roster, or both. [10] Of the 4,191 completed interviews, at least one person was added to 360 households while at least one person was deleted from 531 households. [11] During the nonresponse follow-up operation, if persons reply affirmatively to the undercount probe, the census field workers collect their names and demographic information. Then, these persons appear on the coverage follow-up roster and the coverage follow-up interview confirms the Census Day residency for these persons. [12] While only 8 persons were included in the coverage follow-up roster after nonresponse follow-up, the coverage follow-up added an additional 69 persons from households that were initially enumerated through nonresponse follow-up. [13] Bureau officials said that its administrative records system uses a unique identifier in place of the Social Security number. Social Security numbers are only used for the purpose of validating the Bureau's administrative records system. [14] If the records do not match, the Bureau does not inform the other agencies because of issues related to census confidentiality. [15] In the 2004 Census Test, the Bureau was successful in identifying duplicates using person matching, but it discovered operational complications with trying to resolve person and housing situations within the same operation. Basically, the Bureau concluded that with such a complicated interview, it was too much to ask of the interviewers that they understand the distinction between person and housing duplication. A major component of the 2006 Census Test was to find a way to determine before follow-up whether person duplication was resulting from housing-level issues or person-level issues. [16] See GAO, Standards for Internal Control in the Federal Government, GAO/AIMD-00-21.3.1 (Washington, D.C.: Nov. 1999). [17] See GAO, Internal Control Management and Evaluation Tool, GAO-01- 1008G (Washington, D.C.: Aug. 2001). [18] Data contamination occurs when a person's inclusion or exclusion from one census operation affects the probability of his or her inclusion in a second independent census operation. [19] The experts were not aware of these results because the documents were for internal Bureau use only. [20] According to Bureau officials, the dress rehearsal timeline is the best current indicator of the timeline for the 2010 Census. However, Bureau officials have told us that coverage measurement estimation will not be done for the 2008 Dress Rehearsal. In addition, the Bureau will not include its coverage measurement housing unit field operation, housing matching operation, or person follow-up. The Bureau is also considering dropping person matching from the dress rehearsal. [21] The exact date is yet to be determined. [22] Poststratification entailed calculating dual systems estimates separately for different demographic and geographic groups. Dual systems estimation requires that a postenumeration survey be conducted. It produces an estimate of the true population, which is then used to produce an estimate of the net coverage error. In the 2000 coverage measurement program, over 400 strata or groups were examined. [23] For instance, logistic regression allows for the use of continuous variables like age, instead of only the categorical variables used in poststratification. Additionally, unlike poststratification, logistic regression is able to include or omit interactions between variables. [24] See GAO-01-1008G. [25] See GAO-01-1008G. [26] According to the Department of Commerce, the P-sample is used to estimate the match rate, while the E-sample is used to estimate the correct enumeration rate. Both of these samples are needed to derive a corrected population estimate. Matching is performed to the census, not the E-sample. [27] Bureau officials have noted that clerical matching procedures being planned for 2010 are largely comparable to those used for coverage measurement in the 2000 Census although, with additional information being collected in 2010, the Bureau will have to account for duplicate searching at alternate respondent-provided addresses collected in the person interviews and person follow-up interviews. [28] The National Academy of Sciences has convened a panel of experts to review the Bureau's program of research, evaluation, and experimentation for the 2008 Dress Rehearsal and the 2010 Census. The panel will consider priorities for evaluation in the 2010 Census. The panel will conduct its work over a 3-year period, from September 2006 to September 2009. [29] National Academy of Sciences, Experimentation and Evaluation Plans for the 2010 Census: Interim Report (Washington, D.C.: Dec. 7, 2007). GAO's Mission: The Government Accountability Office, the audit, evaluation and investigative arm of Congress, exists to support Congress in meeting its constitutional responsibilities and to help improve the performance and accountability of the federal government for the American people. GAO examines the use of public funds; evaluates federal programs and policies; and provides analyses, recommendations, and other assistance to help Congress make informed oversight, policy, and funding decisions. GAO's commitment to good government is reflected in its core values of accountability, integrity, and reliability. Obtaining Copies of GAO Reports and Testimony: The fastest and easiest way to obtain copies of GAO documents at no cost is through GAO's Web site [hyperlink, http://www.gao.gov]. Each weekday, GAO posts newly released reports, testimony, and correspondence on its Web site. To have GAO e-mail you a list of newly posted products every afternoon, go to [hyperlink, http://www.gao.gov] and select "E-mail Updates." Order by Mail or Phone: The first copy of each printed report is free. Additional copies are $2 each. A check or money order should be made out to the Superintendent of Documents. GAO also accepts VISA and Mastercard. Orders for 100 or more copies mailed to a single address are discounted 25 percent. Orders should be sent to: U.S. Government Accountability Office: 441 G Street NW, Room LM: Washington, D.C. 20548: To order by Phone: Voice: (202) 512-6000: TDD: (202) 512-2537: Fax: (202) 512-6061: To Report Fraud, Waste, and Abuse in Federal Programs: Contact: Web site: [hyperlink, http://www.gao.gov/fraudnet/fraudnet.htm]: E-mail: fraudnet@gao.gov: Automated answering system: (800) 424-5454 or (202) 512-7470: Congressional Relations: Ralph Dawn, Managing Director, dawnr@gao.gov: (202) 512-4400: U.S. Government Accountability Office: 441 G Street NW, Room 7125: Washington, D.C. 20548: Public Affairs: Chuck Young, Managing Director, youngc1@gao.gov: (202) 512-4800: U.S. Government Accountability Office: 441 G Street NW, Room 7149: Washington, D.C. 20548:

The Justia Government Accountability Office site republishes public reports retrieved from the U.S. GAO These reports should not be considered official, and do not necessarily reflect the views of Justia.