Motor Carrier Safety

A Statistical Approach Will Better Identify Commercial Carriers That Pose High Crash Risks Than Does the Current Federal Approach Gao ID: GAO-07-585 June 11, 2007

The Federal Motor Carrier Safety Administration (FMCSA) has the primary federal responsibility for reducing crashes involving large trucks and buses that operate in interstate commerce. FMCSA decides which motor carriers to review for compliance with its safety regulations primarily by using an automated, data-driven analysis model called SafeStat. SafeStat uses data on crashes and other data to assign carriers priorities for compliance reviews. GAO assessed (1) the extent to which changes to the SafeStat model could improve its ability to identify carriers that pose high crash risks and (2) how the quality of the data used affects SafeStat's performance. To carry out its work, GAO analyzed how SafeStat identified high-risk carriers in 2004 and compared these results with crash data through 2005.

While SafeStat does a better job of identifying motor carriers that pose high crash risks than does a random selection, regression models GAO applied do an even better job. SafeStat works about twice as well as (about 83 percent better than) selecting carriers randomly. SafeStat is built on a number of expert judgments rather than using statistical approaches, such as a regression model. For example, its designers decided to weight more recent motor carrier crashes twice as much as less recent ones on the premise that more recent crashes were stronger indicators of future crashes. GAO estimates that if FMCSA used a negative binomial regression model, FMCSA could increase its ability to identify high-risk carriers by about 9 percent over SafeStat. Carriers identified by the negative binomial regression model as posing a high crash risk experienced 9,500 more crashes than those identified by the SafeStat model over an 18 month follow-up period. The primary use of SafeStat is to identify and prioritize carriers for FMCSA and state compliance reviews. FMCSA measures the ability of SafeStat to perform this role by comparing the crash rate of carriers identified as posing a high crash risk with the crash rate of other carriers. Using a negative binomial regression model would further FMCSA's mission of reducing crashes through the more effective targeting of compliance reviews to the set of carriers that pose the greatest crash risk. Late-reported, incomplete, and inaccurate data reported to FMCSA by states have been a long-standing problem. However, GAO found that late reported data had a small effect on SafeStat's ability to identify carriers that pose high crash risks in 2004. If states had reported all crash data within 90 days after occurrence, as required by FMCSA, a net increase of 299 carriers (or 6 percent) would have been identified as posing high crash risks of the 4,989 that FMCSA identified. Reporting timeliness has improved, from 32 percent of crashes reported on time in fiscal year 2000, to 89 percent in fiscal year 2006. Regarding completeness, GAO found that data for about 21 percent of the crashes (about 39,000 of 184,000) exhibited problems that hampered linking crashes to motor carriers. Having complete information on crashes is important because SafeStat treats crashes as the most important factor for assessing motor carrier crash risk, and crash information is also the crucial factor in the statistical approaches that we employed. Regarding accuracy, a series of studies by the University of Michigan Transportation Research Institute covering 14 states found incorrect reporting of crash data is widespread. GAO was not able to quantify the effect of the incomplete or inaccurate data on SafeStat's ability to identify carriers that pose high crash risks because it would have required gathering crash records at the state level--an effort that was impractical for GAO. FMCSA has acted to improve crash data quality by completing a comprehensive plan for data quality improvement, implementing an approach to correct inaccurate data, and providing grants to states for improving data quality, among other things.

Recommendations

Our recommendations from this work are listed below with a Contact for more information. Status will change from "In process" to "Open," "Closed - implemented," or "Closed - not implemented" based on our follow up work.

Director: Team: Phone:

GAO-07-585, Motor Carrier Safety: A Statistical Approach Will Better Identify Commercial Carriers That Pose High Crash Risks Than Does the Current Federal Approach This is the accessible text file for GAO report number GAO-07-585 entitled 'Motor Carrier Safety: A Statistical Approach Will Better Identify Commercial Carriers That pose high Crash Risks Than Does the Current Federal Approach' which was released on June 11, 2007. This text file was formatted by the U.S. Government Accountability Office (GAO) to be accessible to users with visual impairments, as part of a longer term project to improve GAO products' accessibility. Every attempt has been made to maintain the structural and data integrity of the original printed product. Accessibility features, such as text descriptions of tables, consecutively numbered footnotes placed at the end of the file, and the text of agency comment letters, are provided but may not exactly duplicate the presentation or format of the printed version. The portable document format (PDF) file is an exact electronic replica of the printed version. We welcome your feedback. Please E-mail your comments regarding the contents or accessibility features of this document to Webmaster@gao.gov. This is a work of the U.S. government and is not subject to copyright protection in the United States. It may be reproduced and distributed in its entirety without further permission from GAO. Because this work may contain copyrighted images or other material, permission from the copyright holder may be necessary if you wish to reproduce this material separately. Report to Congressional Requesters: United States Government Accountability Office: GAO: June 2007: Motor Carrier Safety: A Statistical Approach Will Better Identify Commercial Carriers That Pose High Crash Risks Than Does the Current Federal Approach: GAO-07-585: GAO Highlights: Highlights of GAO-07-585, a report to congressional requesters Why GAO Did This Study: The Federal Motor Carrier Safety Administration (FMCSA) has the primary federal responsibility for reducing crashes involving large trucks and buses that operate in interstate commerce. FMCSA decides which motor carriers to review for compliance with its safety regulations primarily by using an automated, data-driven analysis model called SafeStat. SafeStat uses data on crashes and other data to assign carriers priorities for compliance reviews. GAO assessed (1) the extent to which changes to the SafeStat model could improve its ability to identify carriers that pose high crash risks and (2) how the quality of the data used affects SafeStat…s performance. To carry out its work, GAO analyzed how SafeStat identified high-risk carriers in 2004 and compared these results with crash data through 2005. What GAO Found: While SafeStat does a better job of identifying motor carriers that pose high crash risks than does a random selection, regression models GAO applied do an even better job. SafeStat works about twice as well as (about 83 percent better than) selecting carriers randomly. SafeStat is built on a number of expert judgments rather than using statistical approaches, such as a regression model. For example, its designers decided to weight more recent motor carrier crashes twice as much as less recent ones on the premise that more recent crashes were stronger indicators of future crashes. GAO estimates that if FMCSA used a negative binomial regression model, FMCSA could increase its ability to identify high-risk carriers by about 9 percent over SafeStat. Carriers identified by the negative binomial regression model as posing a high crash risk experienced 9,500 more crashes than those identified by the SafeStat model over an 18 month follow-up period. The primary use of SafeStat is to identify and prioritize carriers for FMCSA and state compliance reviews. FMCSA measures the ability of SafeStat to perform this role by comparing the crash rate of carriers identified as posing a high crash risk with the crash rate of other carriers. Using a negative binomial regression model would further FMCSA‘s mission of reducing crashes through the more effective targeting of compliance reviews to the set of carriers that pose the greatest crash risk. Late-reported, incomplete, and inaccurate data reported to FMCSA by states have been a long-standing problem. However, GAO found that late reported data had a small effect on SafeStat‘s ability to identify carriers that pose high crash risks in 2004. If states had reported all crash data within 90 days after occurrence, as required by FMCSA, a net increase of 299 carriers (or 6 percent) would have been identified as posing high crash risks of the 4,989 that FMCSA identified. Reporting timeliness has improved, from 32 percent of crashes reported on time in fiscal year 2000, to 89 percent in fiscal year 2006. Regarding completeness, GAO found that data for about 21 percent of the crashes (about 39,000 of 184,000) exhibited problems that hampered linking crashes to motor carriers. Having complete information on crashes is important because SafeStat treats crashes as the most important factor for assessing motor carrier crash risk, and crash information is also the crucial factor in the statistical approaches that we employed. Regarding accuracy, a series of studies by the University of Michigan Transportation Research Institute covering 14 states found incorrect reporting of crash data is widespread. GAO was not able to quantify the effect of the incomplete or inaccurate data on SafeStat‘s ability to identify carriers that pose high crash risks because it would have required gathering crash records at the state level”an effort that was impractical for GAO. FMCSA has acted to improve crash data quality by completing a comprehensive plan for data quality improvement, implementing an approach to correct inaccurate data, and providing grants to states for improving data quality, among other things. What GAO Recommends: GAO is recommending that FMCSA use a negative binomial regression model to identify carriers that pose high crash risks. In commenting on a draft of this report, the Department of Transportation agreed that the use of a negative binomial regression model looked promising for selecting carriers for compliance reviews, but expressed some reservation about the greater sensitivity of this approach to problems with reported crash data. [Hyperlink, http://www.gao.gov/cgi-bin/getrpt?GAO-07-585]. To view the full product, including the scope and methodology, click on the link above. For more information, contact Sidney H. Schwartz at (202) 512-7387 or schwartzsh@gao.gov, or Susan A. Fleming at (202) 512- 2834 or flemings@gao.gov. [End of section] Contents: Letter: Results in Brief: Background: A Statistical Approach Would Better Identify Carriers That Pose High Crash Risks Than Does FMCSA's Current Approach: Despite Quality Problems, FMCSA's Crash Data Can Be Used to Compare Methods for Identifying Carriers That Pose High Crash Risks: Conclusion: Recommendation for Executive Action: Agency Comments and Our Evaluation: Appendix I: Results of Other Assessments of the SafeStat Model's Ability to Identify Motor Carriers That Pose High Crash Risks: Assessments of SafeStat's Predictive Capability: Impact of Data Quality on SafeStat's Predictive Capability: Appendix II: Scope and Methodology: Appendix III: Additional Results from Our Statistical Analyses of the SafeStat Model: Overview of Regression Analyses: Technical Explanation of the Negative Binomial Regression Model: Tables: Table 1: SafeStat Categories and Their Priority for Compliance Reviews: Table 2: Size Distribution of Carriers Receiving a SafeStat Rating of A through G: Table 3: Results for SafeStat Model and Regression Models: Figures: Figure 1: Commercial Motor Vehicle Fatality Rate, 1975 to 2005: Figure 2: Percentage of Crashes Submitted to MCMIS within 90 Days of Occurrence: Abbreviations: FMCSA: Federal Motor Carrier Safety Administration: MCMIS: Motor Carrier Management Information System: SafeStat: Motor Carrier Safety Status Measurement System: United States Government Accountability Office: Washington, DC 20548: June 11, 2007: The Honorable James L. Oberstar: Chairman: The Honorable John L. Mica: Ranking Republican Member: Committee on Transportation and Infrastructure: House of Representatives: The Honorable Peter A. DeFazio: Chairman: The Honorable John J. Duncan: Ranking Republican Member: Subcommittee on Highways and Transit: Committee on Transportation and Infrastructure: House of Representatives: The Honorable Thomas E. Petri: House of Representatives: The Federal Motor Carrier Safety Administration (FMCSA) within the U.S. Department of Transportation has the primary federal responsibility for reducing crashes, deaths, and injuries involving large trucks and buses operating in interstate commerce. While it carries out a number of activities toward this end, an important tool at its disposal is the compliance review--a detailed inspection of a motor carrier's operations at its place of business. FMCSA decides which carriers to inspect primarily by using an automated, data-driven analysis system called the Motor Carrier Safety Status Measurement System (SafeStat). SafeStat uses data on crashes, vehicle and driver violations, and other information to develop numerical scores for carriers, and then SafeStat assigns each carrier a priority to receive a compliance review. Following an incident in which a bus company, with many driver violations and a low priority for compliance review from the SafeStat model, suffered a fire on one of its buses that resulted in 23 deaths, you were interested in whether SafeStat could better identify commercial motor carriers at risk for crashes. To address your interest, we assessed (1) the extent to which changes to the SafeStat model could improve its ability to identify these carriers and (2) how the quality of the data used affects SafeStat's performance. These two topics are the main focus of this report. We also examined the findings of other studies on how SafeStat's ability to identify carriers at risk for crashes can be improved. (See app. I.) To determine whether statistical approaches could be used to improve FMCSA's ability to identify carriers that pose high crash risks, we tested a number of regression models and compared their performance with SafeStat's results from June 2004. We chose 2004 because it allowed us to examine actual crash data for the 18-month period following June 2004 to determine the degree to which SafeStat successfully identified carriers that proved to be of high risk for crashes. It also allowed us to include crashes that occurred within the 18 months after June 2004 but had not yet been reported to FMCSA by December 2005. Using regression models, we compared the predictive performance of these statistical approaches to SafeStat's performance to determine which method best identified carriers that pose high crash risks. We also calculated crash rates from a series of random samples of all carriers to determine if the SafeStat model did a better job than random selection in identifying motor carriers that pose high crash risks. To assess whether changes could be made to the SafeStat model to improve its identification of carriers that pose high crash risks, we tested changes to selected portions of the SafeStat model and investigated the effect of changing decision rules used to construct the four safety evaluation areas.[Footnote 1] To assess the extent to which data quality affects SafeStat's ability to identify carriers that pose high crash risks, we carried out a series of analyses and surveyed the literature to identify findings from other studies. To address timeliness, we measured the number of days it took states to report crashes. We also added late-reported crashes to FMCSA's June 2004 data and recalculated SafeStat scores to determine the effect of late-reported crashes on carriers' rankings. For completeness, we attempted to match crash records in FMCSA's Motor Carrier Management Information System (MCMIS) crash master file to motor carriers listed in the MCMIS census file and reviewed studies on state reporting. To address accuracy, we reviewed a report that tested the accuracy of electronic data on a sample of paper records and studies that identified the impact of incorrectly reported crashes in individual states on MCMIS data quality. While there are known problems with the quality of the crash data reported to FMCSA for use in SafeStat, we determined that the data were of sufficient quality for our use, which was to compare the ability of regression models to identify carriers that pose high crash risks to the current approach, which is largely derived through professional judgment. We conducted our work in accordance with generally accepted government auditing standards from May 2006 through May 2007. Appendix II provides further information on our scope and methodology. Shortly, we expect to issue a related report that examines how FMCSA identifies and takes action against carriers that are egregious safety violators. In addition, that report examines how thoroughly and consistently FMCSA conducts compliance reviews. Results in Brief: While SafeStat does a better job of identifying motor carriers that pose high crash risks than does a random selection, regression models we applied do an even better job. SafeStat works about twice as well as (about 83 percent better than) selecting carriers randomly and, therefore, has value for improving safety. SafeStat is built on a number of expert judgments. For example, SafeStat's designers used their judgment and experience to weight more recent crashes involving a motor carrier twice as much as less recent crashes on the premise that more recent crashes were stronger indicators that a carrier may have crashes in the future. Using similar reasoning, fatal crashes were weighted more heavily than less serious crashes. We found that if a negative binomial regression model was used instead, FMCSA could increase its ability to identify carriers that pose high crash risks by about 9 percent over SafeStat.[Footnote 2] Moreover, according to our analysis, this 9 percent improvement would enable FMCSA to identify carriers with twice as many crashes in the following 18 months as those carriers identified under its current approach.[Footnote 3] Carriers identified by the negative binomial regression model as posing a high crash risk experienced 9,500 more crashes than those identified by the SafeStat model over an 18 month follow-up period. The primary use of SafeStat is to identify and prioritize carriers for FMCSA and state safety compliance reviews. FMCSA measures the ability of SafeStat to perform this role by comparing the crash rate of carriers identified as posing a high crash risk with the crash rate of other carriers. In our view, using a negative binomial regression model would further FMCSA's mission of reducing crashes through the more effective targeting of safety improvement and enforcement programs to the set of carriers that pose the greatest crash risk. Applying a regression model would be easy to adapt to the existing SafeStat model and, in our opinion, would be beneficial even if FMCSA makes major revisions to its compliance and enforcement approach in the coming years under its Comprehensive Safety Analysis 2010 initiative.[Footnote 4] Crash data reported by the states from December 2001 through June 2004 have problems in terms of timeliness, accuracy, and completeness that potentially hinder FMCSA's ability to identify high risk carriers. Regarding timeliness, we found that including late-reported data had a small impact on SafeStat--including late-reported data added a net of 299 (or 6 percent) more carriers to the original 4,989 carriers that the SafeStat model ranked as highest risk in June 2004.[Footnote 5] The timeliness of crash reporting has shown steady and marked improvement: the percentage of crashes reported by states within 90 days of occurrence jumped from 32 percent in fiscal year 2000 to 89 percent in fiscal year 2006. Regarding completeness, data for about 21 percent of the crashes (about 39,000 of 184,000) exhibited problems that hampered linking crashes to motor carriers. Thirteen percent of the crashes (about 24,000) involving interstate carriers reported to FMCSA from December 2001 through June 2004 are missing the unique identifier that FMCSA assigns to each carrier when the agency authorizes the carrier to engage in interstate commerce. Crashes without a unique identifier to link to a company are excluded from use in SafeStat. An additional 8 percent of the crashes (about 15,000) that were reported had an identification number that could not be matched to a motor carrier in the FMCSA database that contains census information on motor carriers. Linking crashes to carriers is important because the current SafeStat model treats crashes as the most important factor in assessing motor carrier crash risk. Crash information is also the crucial factor in the regression models that we employed. Regarding accuracy, a series of University of Michigan Transportation Research Institute's reports on crash reporting shows that, among the 14 states studied, incorrect reporting of crash data is widespread. For example, in recent reports, the researchers found that, in 2005, Ohio incorrectly reported 1,094 (22 percent) of the 5,037 cases it reported, and Louisiana incorrectly reported 137 (5 percent) of the 2,699 cases it reported. In Ohio, most of the 1,094 crashes did not qualify because they did not meet the crash severity threshold.[Footnote 6] We were not able to quantify the actual effect of the incomplete or inaccurate data on SafeStat's ability to identify carriers that pose high crash risks, because it would have required us to gather crash records at the state level--an effort that was impractical. FMCSA has acted to improve the quality of SafeStat's data by completing a comprehensive plan for data quality improvement, implementing an approach to correct inaccurate data, and providing grants to states for improving data quality, among other things. We could not quantify the effects of FMCSA's efforts to improve the completeness or accuracy of the data for the same reason as mentioned above. This report contains a recommendation to the Secretary of Transportation aimed at applying a negative binomial regression model to the four SafeStat safety evaluation areas that would result in better identification of commercial motor carriers that pose high crash risks. Because FMCSA has initiated efforts to improve the quality of SafeStat's data, we are not making a recommendation in this area. In commenting on a draft of this report, the department agreed that it would be reasonable to consider the use of the negative binomial regression model in order to better target compliance reviews to carriers posing high crash risks, but expressed some concerns about placing more emphasis on crash information and less on other factors, such as driver, vehicle, or safety management issues. In addition, FMCSA noted that, while it has devoted considerable efforts to improving the quality of crash data submitted by states, the negative binomial regression model is more sensitive than SafeStat to problems with the crash data. Background: The interstate commercial motor carrier industry, primarily the trucking industry, is an important part of the nation's economy. Trucks transport over 11 billion tons of goods annually, or about 60 percent of the total domestic tonnage shipped.[Footnote 7] Buses also play an important role, transporting an estimated 631 million passengers annually. There are approximately 711,000 commercial motor carriers registered in MCMIS,[Footnote 8] about 9 million trucks and buses, and more than 10 million drivers. Most motor carriers are small; about 51 percent operate one vehicle, and another 31 percent operate two to four vehicles. Carrier operations vary widely in size, however, and some of the largest motor carriers operate upwards of 50,000 vehicles. Carriers continually enter and exit the industry. Since 1998, the industry has increased in size by an average of about 29,000 interstate carriers per year. In the United States, commercial motor carriers account for less than 5 percent of all highway crashes, but these crashes result in about 13 percent of all highway deaths, or about 5,500 of the approximately 43,000 highway fatalities that occur nationwide annually. In addition, about 106,000 of the approximately 2.7 million highway injuries per year involve motor carriers. The fatality rate for trucks has generally been decreasing over the past 30 years, but this decrease has leveled off, and the rate has been fairly stable since the mid-1990s. The fatality rate for buses has improved slightly from 1975 to 2005 but has more annual variability than the fatality rate for trucks due to a much smaller total vehicle miles traveled. (See fig. 1.) Figure 1: Commercial Motor Vehicle Fatality Rate, 1975 to 2005: [See PDF for image] Source: GAO analysis of Department of Transportation data. [End of figure] Congress created FMCSA through the Motor Carrier Safety Improvement Act of 1999 to reduce crashes, injuries, and fatalities involving commercial motor vehicles. To accomplish this mission, FMCSA carries out a number of enforcement, education, and outreach activities. FMCSA uses enforcement as its primary approach for reducing the number of crashes, fatalities, and injuries involving trucks and buses. Some of FMCSA's enforcement programs include compliance reviews, which are on- site reviews of carriers' records and operations to determine compliance with regulations; safety audits of new interstate carriers; and roadside inspections of drivers and vehicles. FMCSA's education and outreach programs are intended to promote motor carrier safety and consumer awareness. One of the programs is the New Entrant program, which is designed to inform newly registered motor carriers about motor carrier safety standards and regulations to help them comply with FMCSA's requirements. Other programs are designed to identify unregistered carriers and get them to register, promote increased safety belt use among commercial drivers, and inform organizations and individuals that hire buses how to make safe choices. FMCSA plans to make major revisions to its compliance and enforcement approach under an initiative called Comprehensive Safety Analysis 2010. Compliance reviews are an important enforcement tool because they allow FMCSA to take an in-depth look at carriers that have been identified as posing high crash risks because of high crash rates or poor safety performance records. Motor carriers may be identified as high risk from SafeStat or through calls to FMCSA's complaint hotline. Carriers are given a satisfactory, conditional, or unsatisfactory safety rating. A conditional rating means the carrier is allowed to continue operating, but FMCSA may schedule a follow-up compliance review to ensure that problems noted in the first compliance review are addressed. An unsatisfactory rating must be addressed or the carrier is placed out of service, meaning it is no longer allowed to do business, and the carrier may face legal enforcement actions undertaken by FMCSA. Compliance reviews can take several days to complete, depending on the size of the carrier, and may result in enforcement actions being taken against a carrier. FMCSA uses both its own inspectors and state inspectors to carry out its enforcement activities. In total, about 750 staff are available to perform compliance reviews, and more than 10,000 staff do vehicle and driver inspections at weigh stations and other points. Together, FMCSA and its state partners perform about 16,000 compliance reviews a year, which cover about 2 percent of the nation's 711,000 carriers.[Footnote 9] Because the number of inspectors is small compared with the size of the motor carrier industry, FMCSA prioritizes carriers for compliance reviews. To do so, it uses SafeStat to identify carriers that pose high crash risks. SafeStat is a model that uses information gathered from crashes, roadside inspections, traffic violations, compliance reviews, and enforcement cases to determine a motor carrier's safety performance relative to that of other motor carriers that have similar exposure in these areas. A carrier's score is calculated on the basis of its performance in four safety evaluation areas: * Accident safety evaluation area: The accident safety evaluation area reflects a carrier's crash history relative to other motor carriers' histories. The safety evaluation area is based on state-reported crash data, vehicle data from MCMIS, and data on reportable crashes and annual vehicle miles traveled from the most recent compliance review. A carrier must have two or more reportable crashes within the last 30 months to have the potential to receive a deficient value and thus be made a priority for a compliance review. * Driver safety evaluation area: The driver safety evaluation area reflects a carrier's driver-related safety performance and compliance relative to other motor carriers. The driver safety evaluation area is based on violations cited in roadside inspections that have been performed within the last 30 months and compliance reviews that have occurred within the last 18 months, together with the number of drivers listed in MCMIS. A carrier must have three or more driver inspections, three or more moving violations, or at least one acute or critical violation of driver regulations[Footnote 10] from a compliance review to have the potential to receive a deficient value and thus be made a priority for a compliance review. * Vehicle safety evaluation area: The vehicle safety evaluation area reflects a carrier's vehicle-related safety performance and compliance relative to other motor carriers. The vehicle safety evaluation area is based on violations identified during vehicle roadside inspections that have occurred within the last 30 months or vehicle-related acute and critical violations of regulations discovered during compliance reviews that have occurred within the last 18 months. A carrier must have either three or more vehicle inspections or at least one acute or critical violation of vehicle regulations from a compliance review to have the potential to receive a deficient value and thus be made a priority for a compliance review. * Safety management safety evaluation area: The safety management safety evaluation area reflects a carrier's safety management relative to other motor carriers. It is based on the results of violations cited in closed enforcement cases in the past 6 years or violations of regulations related to hazardous materials and safety management discovered during a compliance review performed within the last 18 months. A carrier must have had at least one enforcement case initiated and closed or at least two enforcement cases closed within the past 6 years, or at least one acute, critical, or severe violation of hazardous material or safety management regulations[Footnote 11] identified during a compliance review within the last 18 months to have the potential to receive a deficient value and thus be made a priority for a compliance review. A motor carrier's score is based on its relative ranking, indicated as a value, in each of the four safety evaluation areas. For example, if a carrier receives a value of 75 in the accident safety evaluation area, then 75 percent of all carriers with sufficient data for evaluation performed better in that safety evaluation area, while 25 percent performed worse. The calculation used to determine a motor carrier's SafeStat score is as follows: SafeStat Score = (2.0x accident value) + (1.5x driver value) + vehicle value + safety management value: As shown in the formula, the accident and driver safety evaluation areas have 2.0 and 1.5 times the weight, respectively, of the vehicle and safety management safety evaluation areas. Safety evaluation area values less than 75 are ignored in the formula used to determine the SafeStat score. For example, a carrier with values of 74 for all four safety evaluation areas has a SafeStat score of 0. FMCSA assigned more weight to these safety evaluation areas because, according to FMCSA, crashes and driver violations correlate relatively better with future crash risk. In addition, more weight is assigned to fatal crashes and to crashes that occurred within the last 18 months. In consultation with state transportation officials, insurance industry representatives, safety advocates, and the motor carrier industry, FMCSA used its expert judgment and professional knowledge to assign these weights, rather than determining them through a statistical approach, such as regression modeling. FMCSA assigns carriers categories ranging from A to H according to their performance in each of the safety evaluation areas. A carrier is considered to be deficient in a safety evaluation area if it receives a value of 75 or higher in that particular safety evaluation area. Although a carrier may receive a value in any of the four safety evaluation areas, the carrier receives a SafeStat score only if it is deficient in one or more safety evaluation areas. Carriers that are deficient in two or more safety evaluation areas and have a SafeStat score of 225 or more are considered to pose high crash risks and are placed in category A or B. (See table 1.) Carriers that are deficient in two safety evaluation areas but have a SafeStat score of less than 225 are placed in category C and receive a medium priority for compliance reviews. Carriers that are deficient in only one of the safety evaluation areas are placed in category D, E, F, or G. Carriers that are not deficient in any of the safety evaluation areas do not receive a SafeStat score and are placed in category H. Table 1: SafeStat Categories and Their Priority for Compliance Reviews: Category: A; Condition: Deficient in all four safety evaluation areas; or; Deficient in three safety evaluation areas that result in a weighted SafeStat score of 350 or more; Priority for compliance review: High. Category: B; Condition: Deficient in three safety evaluation areas that result in a weighted SafeStat score of less than 350; or; Deficient in two safety evaluation areas that result in a weighted SafeStat score of 225 or more; Priority for compliance review: High. Category: C; Condition: Deficient in two safety evaluation areas that result in a weighted SafeStat score of less than 225; Priority for compliance review: Medium. Category: D; Condition: Deficient in the accident safety evaluation area (accident safety evaluation area value between 75-100); Priority for compliance review: Low. Category: E; Condition: Deficient in the driver safety evaluation area (driver safety evaluation area value between 75-100); Priority for compliance review: Low. Category: F; Condition: Deficient in the vehicle safety evaluation area (vehicle safety evaluation area value between 75-100); Priority for compliance review: Low. Category: G; Condition: Deficient in the safety management safety evaluation area (safety management safety evaluation area value between 75-100); Priority for compliance review: Low. Category: H; Condition: Not deficient in any of the safety evaluation areas (value below 75 in each of the safety evaluation areas); Priority for compliance review: Low. Source: GAO summary of FMCSA data. [End of table] Of the 622,000 motor carriers listed in MCMIS as having one or more vehicles in June 2004, about 140,000, or 23 percent, received a SafeStat category A through H. There are several reasons why a small proportion of carriers receive a score. First, approximately 305,900, or about 42 percent, of the carriers have crash, vehicle inspection, driver inspection, or enforcement data of any kind. SafeStat relies on these data to calculate a motor carrier's score, so carriers without such data are not rated by SafeStat. It is likely that some of the carriers listed in MCMIS are no longer in business, but it is also possible that these carriers had no crashes, inspections, or compliance reviews in the 30-month period prior to June 2004. Second, a carrier must meet the minimum requirements to be assigned a value in a given safety evaluation area.[Footnote 12] If, for example, a carrier had only one reportable crash within the last 30 months, then the carrier would not be assigned an accident safety evaluation area value. Of the 305,900 carriers that have any safety data in SafeStat, 140,000 met the SafeStat minimum requirements in one or more safety evaluation areas. Of these 140,000 carriers, 45,000 were rated in categories A through G. The other carriers were placed in category H because they were not considered deficient, meaning they did not receive a value of 75 or more in any of the safety evaluation areas. The design of SafeStat and its data sufficiency requirements increase the likelihood that larger motor carriers will be deficient in one of the safety evaluation areas, in other words, rated in categories A through G, than are small carriers. About 51 percent of all carriers listed in MCMIS operate one vehicle, and about 3 percent of them received a SafeStat rating in categories A through G. (See table 2.) In contrast, fewer than 1 percent of the carriers listed in MCMIS have more than 100 vehicles, and nearly 25 percent of them received a SafeStat rating in categories A through G. Table 2: Size Distribution of Carriers Receiving a SafeStat Rating of A through G: Carrier size (number of vehicles): 1; Number of carriers (percentage[A] ): 317,037 (51%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 8,697 (3%). Carrier size (number of vehicles): >1 to 4; Number of carriers (percentage[A] ): 191,739 (31%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 14,430 (8%). Carrier size (number of vehicles): >4 to 10; Number of carriers (percentage[A] ): 66,422 (11%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 10,595 (16%). Carrier size (number of vehicles): >10 to 25; Number of carriers (percentage[A] ): 28,780 (5%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 6,504 (23%). Carrier size (number of vehicles): >25 to 100; Number of carriers (percentage[A] ): 14,148 (2%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 3,550 (25%). Carrier size (number of vehicles): >100; Number of carriers (percentage[A] ): 3,903 (1%); Number of carriers within size category receiving A through G SafeStat rating (percentage of carriers in size category): 909 (23%). Source: GAO analysis of FMCSA data. Note: The table only includes those carriers listed as having one or more vehicles. [A] Percentages do not equal 100 because of rounding. [End of table] A Statistical Approach Would Better Identify Carriers That Pose High Crash Risks Than Does FMCSA's Current Approach: We found that FMCSA could improve SafeStat's ability to identify carriers that pose high crash risks if it applied a statistical approach, called a negative binomial regression model, to the four SafeStat safety evaluation areas instead of its current approach. Through this change, FMCSA could more efficiently target compliance reviews to the set of carriers that pose the greatest crash risk. Applying a negative binomial regression model would improve the identification of high risk carriers over SafeStat's performance by about 9 percent,[Footnote 13] compared with the current approach, which incorporates safety data weighted in accordance with the professional judgment and experience of SafeStat's designers. Moreover, according to our analysis, this 9 percent improvement would enable FMCSA to identify carriers with almost twice as many crashes in the following 18 months as those carriers identified under its current approach. Targeting these high-risk carriers would result in FMCSA giving compliance reviews to carriers that experienced both a higher crash rate and, in conjunction with the higher crash rate, 9,500 more crashes over an 18- month period than those identified by the SafeStat model. Applying a negative binomial regression model approach to the SafeStat safety evaluation areas would be easy to implement and, in our opinion, would be consistent with other FMCSA uses for SafeStat beyond identifying carriers that pose high risks for crashes. In addition, adopting a negative binomial regression model approach would be beneficial even if FMCSA makes major revisions to its compliance and enforcement approach in the coming years under its Comprehensive Safety Analysis 2010 initiative. Overall, other changes to the SafeStat model that we explored, such as modifying decision rules used in the construction of the safety evaluation areas, did not improve the model's overall performance. Regression Models Identify Carriers That Pose High Crash Risks Better Than Expert Judgment: Although SafeStat is nearly twice as effective as (83 percent better than) random selection in identifying carriers that pose high crash risks[Footnote 14] and, therefore, has value for improving safety, we found that FMCSA could improve SafeStat's ability to identify such carriers by about 9 percent if it applied a negative binomial regression model approach to its analysis of motor carrier safety data. The use of a regression model does not entail assigning the letter categories currently assigned by the SafeStat model. Rather, the model predicts carriers' crash risks, sorts the carriers according to their risk level, and assigns a high priority for a compliance review to the highest risk carriers. The improvement in identification of high-risk carriers, which we observed with the negative binomial regression model, is consistent with results obtained in an earlier analysis of MCMIS data performed by a team of researchers at Oak Ridge National Laboratory.[Footnote 15] To compare the effectiveness of regression models and SafeStat in identifying carriers that pose high crash risks, we applied several regression models to the four safety evaluation areas (accident, driver, vehicle, and safety management) used by the SafeStat model. We recalculated SafeStat's June 2004 accident safety evaluation area values because the data FMCSA provided on the number of crashes for each carrier differed in 2006 from the data used in the model in 2004.[Footnote 16] Using our accident safety evaluation area value and the original driver, vehicle, and safety management safety evaluation area values from June 2004, we selected the 4,989 carriers that our regression models identified as the highest crash risks,[Footnote 17] calculated the crash rate per 1,000 vehicles for these carriers over the next 18 months, and compared this rate with the crash rate per 1,000 vehicles for the 4,989 carriers identified by the SafeStat model as posing high crash risks (categories A and B). All of the regression models that we estimated were at least as effective as SafeStat in identifying motor carriers that posed high crash risks. (See app. III for these results.) Of these, the negative binomial regression approach gave the best results and proved 9 percent more effective than SafeStat, as measured by future crashes per 1,000 vehicles. The set of carriers in SafeStat categories A and B had a crash rate of 102 per 1,000 vehicles for the 18 months after June 2004 while the set of high-risk carriers identified by the negative binomial regression model had 111 crashes per 1,000 vehicles. Even though this 9 percent improvement rate seems modest, it translates into nearly twice as many "future crashes" identified. Specifically, the negative binomial regression model identified carriers that had nearly twice as many crashes (from July 2004 to December 2005) as the carriers identified by SafeStat--19,580 crashes compared with 10,076.[Footnote 18] SafeStat (categories A and B) and our negative binomial regression model identified many of the same carriers--1,924 of the 4,989 (39 percent)--as posing high crash risks. However, our model also identified a number of high-risk carriers that SafeStat did not identify, and vice versa. For example, our model identified 2,244 carriers as posing high crash risks, while SafeStat placed these carriers in category D (the accident area), assigning them a lower priority for compliance reviews. One reason for this difference is the decision rules that SafeStat employs. Under SafeStat, carriers must perform worse than 75 percent of all carriers to be considered deficient in any safety evaluation area. The regression approach identifies the carriers with the highest crash risks regardless of how they compare with their peers in individual areas. For example, we identified as posing high crash risks 482 carriers that SafeStat did not consider at all for compliance reviews because the carriers had not performed worse than 75 percent of their peers in any of the four safety evaluation areas. FMCSA Can Apply a Regression Model Approach in the Short Term, Even Though It Is Planning to Overhaul SafeStat: In the short term, FMCSA could easily implement a regression model approach for SafeStat.[Footnote 19] All the information required as input for the negative binomial regression model is already entered into SafeStat. In addition, a standard statistical package can be used to apply the negative binomial approach to the four SafeStat safety evaluation areas. Like SafeStat, the negative binomial regression model would be run every month to produce a list of motor carriers that pose high crash risks, and these carriers would then be assigned priorities for a compliance review. As with SafeStat, the results of the negative binomial model would change slightly each month with the addition of new safety data to MCMIS. In discussing the concept of adopting a negative binomial regression model approach with FMCSA officials, they were interested in understanding how the use of the negative binomial regression model results could be used to identify and improve the safety of those carriers that pose the greatest crash risks (much as the SafeStat categories of A and B do now) and how it could employ the proposed approach for current uses beyond identifying carriers that pose high crash risks. These uses include providing an understandable public display to shippers, insurers, and others who are interested in the safety of carriers; selecting carriers for roadside inspections; and trying to gain carriers' compliance with driver and vehicle safety rules, when these carriers may not have crashes, consistent with agency efforts. * Identifying and improving the safety of carriers that pose high crash risks. The negative binomial regression model approach would produce a rank order listing of carriers by crash risk and by the predicted number of crashes. For compliance reviews, FMCSA could choose those carriers with the greatest number of predicted crashes. FMCSA would choose the number of carriers to review based on the resources available to it, much as it currently does. Regarding improving the safety of carriers that pose high crash risks, FMCSA currently enrolls carriers that receive a SafeStat category of A, B, or C in the Motor Carrier Safety Improvement Program. This program aims to improve the safety of high-risk carriers through (1) a repetitive cycle of identification, data gathering, and assessment and (2) progressively harsher treatments applied to carriers that do not improve their safety. The use of a negative binomial regression model would not affect the structure or workings of this program, other than to better identify carriers that pose high crash risks. As discussed above, FMCSA would use the regression model's results to identify the highest risk carriers and then intervene using its existing approaches (such as issuing warning letters, conducting follow-up compliance reviews, or levying civil penalties) as treatment. * Providing an understandable display to the public. FMCSA could choose to provide a rank order listing of carriers together with the associated number of predicted crashes or it could look for natural breaks in the predicted number of crashes and associate a category-- such as "category A" to these carriers. * Selecting carriers for roadside inspections. Safety rankings from the SafeStat model are also used in FMCSA's Inspection Selection System to prioritize carriers for roadside driver and vehicle inspections. The negative binomial regression model optimizes the identification of carriers by crash risk using safety evaluation area information. The negative binomial regression model approach that we describe in this report retains SafeStat's basic design with four safety management areas (driver, vehicle, accident, and safety management). Therefore, FMCSA could use the negative binomial regression model results to identify carriers that pose a high crash risk, the results from the driver and vehicle safety evaluation areas, or both, to target carriers or vehicles for roadside driver and vehicle inspections. * Furthering agency efforts to gain compliance with driver and vehicle safety rules for carriers that do not experience crashes (or a sufficient number of crashes to pose a high risk for crashes). FMCSA was interested in understanding how, if at all, the negative binomial regression model approach would affect gaining compliance against carriers that may routinely violate safety rules (such as drivers' hours of service requirements), but where these violations do not lead to crashes. As discussed above, the negative binomial regression model approach retains SafeStat's four safety evaluation areas. Where it differs, is that it assigns different weights to those areas based on a statistical procedure, rather than having the weights assigned by expert judgment. As a result, FMCSA would still be able to identify carriers with many driver, vehicle, and safety management violations. Other opportunities also exist for FMCSA to improve the ability of regression models to identify carriers that pose high crash risks. In 2005, a FMCSA compliance review work group reported a positive correlation between driver hours of service violations and crash rates.[Footnote 20] Because FMCSA can link violations of specific regulatory provisions, including those limiting driver hours of service, to the crash experience of the carriers involved, it has the opportunity to improve the violation severity weighting used in constructing the driver and vehicle safety evaluation areas. FMCSA has detailed violation data from roadside inspections and can statistically analyze these data to find other strong relationships with carriers' crash risks. Changes made to the safety evaluation area methodology to strengthen the association with crash risk will improve the ability of the negative binomial regression model to identify carriers that pose high crash risks. FMCSA has expressed doubts in the past when analysts have proposed switching to a regression model approach. For example, Oak Ridge National Laboratory advocated using a regression model approach in place of SafeStat in 2004, but FMCSA was reluctant to move away from its expert judgment model because it believed that the regression model approach would place undue weight on the accident safety evaluation area in determining priorities for compliance reviews,[Footnote 21] thereby diminishing the incentive for motor carriers to comply with the many safety regulations that feed into the driver, vehicle, and safety management safety evaluation areas. In FMCSA's view, carriers would be less likely to comply with these regulations because violations in the driver, vehicle, and safety management areas would be less likely to lead to compliance reviews under a regression model approach that placed a heavy emphasis on crashes. Our view is that adopting a negative binomial regression model approach would better identify carriers that pose high crash risks and would thus further FMCSA's primary mission of ensuring safe operating practices among commercial interstate motor carriers. Over the longer term, FMCSA is considering a complete overhaul of its safety fitness determinations with its Comprehensive Safety Analysis 2010 initiative. This planned comprehensive review and analysis of the agency's compliance and enforcement programs may result in a new operational model for identifying drivers and carriers that pose safety problems and for intervening to address those problems.[Footnote 22] FMCSA expects to deploy the results of this initiative in 2010. In our opinion, given the relative ease of adopting the regression modeling approach discussed in this report,[Footnote 23] and the immediate benefits that can be achieved, there is no reason to wait for FMCSA to complete its initiative, even if the initiative results in major revisions to the SafeStat model. Modifications of SafeStat Did Not Improve Crash Identification: Besides investigating whether the use of regression models could improve SafeStat's ability to identify carriers that pose high crash risks, we explored whether the existing model could be improved by changing several of its decision rules. Overall, these changes did not enhance the model's ability to identify carriers that pose high crash risks. As long as FMCSA continues to estimate the safety evaluation area values with its present methodology, the rules we investigated help make the identification of high-risk motor carriers more efficient for both SafeStat and the negative binomial regression model. Because the SafeStat model is composed of many components, we selected three decision rules for analysis. We chose these three rules because they are important pillars of the SafeStat model's methodology for constructing the safety evaluation areas and because we could complete our analysis of them during the time we had to perform our work. A fuller exploration of areas with high potential to improve the identification of carriers that pose high crash risks would be a long- term effort, and FMCSA plans to address this work as part of the Comprehensive Safety Analysis 2010 initiative. * Removing comparison groups. As part of its methodology for calculating the accident, driver, and vehicle safety evaluation area values, SafeStat divides carriers into comparison groups. For example, in the driver safety evaluation area, SafeStat groups carriers by the number of moving violations they have, placing them in one of four groups (3 to 9, 10 to 28, 29 to 94, and 95 or more).[Footnote 24] SafeStat uses the comparison groups to control for the size of the carrier. We removed all the comparison groups in each of the three safety evaluation areas, recalculated their values, and compared the number of crashes in which the carriers were involved and their crash rates, for each of the SafeStat categories A through H, with the SafeStat results in which comparison groups were retained. * Removing minimum event requirements. SafeStat imposes minimum event requirements. For example, as noted, SafeStat does not consider a carrier's moving violations if, in the aggregate, its drivers had fewer than three moving violations over a 30-month period. FMCSA does not calculate a safety evaluation area value for carriers with fewer than three events in an attempt to control for carriers that have infrequent, rather than possibly systemic, safety problems.[Footnote 25] We removed the requirement to have a minimum number of events (such as moving violations, crashes, and inspections), recalculated the three safety evaluation values, and compared the number of crashes in which the carriers were involved and their crash rates, for each of the SafeStat categories A through H, with the SafeStat results in which minimum event requirements were retained. * Removing time and severity weights. The SafeStat formula weights more recent events and more severe events more heavily than less recent or less severe events in the accident, driver, and vehicle safety evaluation areas. For example, the results of vehicle roadside inspections performed within the latest 6 months receive three times the weight of inspections performed 2 years ago. Similarly, crashes involving deaths or injuries receive twice as much weight as those that resulted in property damage only. We removed the time and severity weights for the three safety evaluation areas, recalculated these values, and compared the number of crashes in which the carriers were involved and their crash rates, for each of the SafeStat categories A through H, with the SafeStat results in which time and severity weights were retained. * Simultaneous changes to comparison group, event, and time severity requirements. Finally, we simultaneously removed comparison groups, minimum event requirements, and time and severity weights and compared the number of crashes in which the carriers were involved and their crash rates, for each of the SafeStat categories A through H, with the SafeStat results in which comparison groups, minimum event requirements, and time and severity weights were retained. The results of each of our individual analyses and of making all changes simultaneously produced one of two outcomes, neither of which was considered more desirable. Relaxing the minimum data requirements greatly increased the number of carriers identified as high risk without increasing the overall number of predicted crashes over the subsequent 18 months, thus reducing the effectiveness of the SafeStat model. Removing comparison groups and removing time and severity weights had the effect of reducing the future crashes per 1,000 vehicles among those carriers identified as high risk, also reducing the effectiveness of the SafeStat model. As a result, we are not reporting on these results in detail. Trying to modify the decision rules used in SafeStat did highlight the balance that FMCSA has to strike between maximizing the identification of companies with the largest number of crashes (usually larger carriers) and those carriers with the greatest safety risk (which can be of any size). Despite Quality Problems, FMCSA's Crash Data Can Be Used to Compare Methods for Identifying Carriers That Pose High Crash Risks: The quality of crash data is a long-standing problem that potentially hindered FMCSA's ability to accurately identify carriers that pose high crash risks.[Footnote 26] Despite the problems of late-reported crashes and incomplete and inaccurate data on crashes during the period we studied, we determined that the data were of sufficient quality for our use, which was to assess how the application of regression models might improve the ability to identify high-risk carriers over the current approach--not to determine absolute measures of crash risk. Our reasoning is based on the fact that we used the same data set to compare the results of the SafeStat model and the regression models. Limitations in the data would apply equally to both results. FMCSA has recently undertaken a number of efforts to improve crash data quality. Late Reporting Had a Small Effect on SafeStat's Ability to Identify High-Risk Carriers: FMCSA's guidance provides that states report all crashes to MCMIS within 90 days of their occurrence. Late reporting can cause SafeStat to miss some of the carriers that should have received a SafeStat score. Alternatively, since SafeStat's scoring involves a relative ranking of carriers, a carrier may receive a SafeStat score and have to undergo a compliance review because crash data for a higher risk carrier were reported late and not included in the calculation. Late reporting affected SafeStat's ability to identify all high-risk carriers to a small degree--about 6 percent---for the period that we studied. Late reporting of crashes by states affected the safety rankings of more than 600 carriers, both positively and negatively. When SafeStat analyzed the 2004 data, which did not include the late- reported crashes, it identified 4,989 motor carriers as highest risk, meaning they received a category A or B ranking. With the addition of late-reported crashes, 481 carriers moved into the highest risk category, and 182 carriers dropped out of the highest risk category, resulting in a net increase of 299 carriers (6 percent) in the highest risk category. After the late-reported crashes were added, 481 carriers that originally received a category C, D, E, F, or G SafeStat rating received an A or B rating. These carriers would not originally have been given a high priority for a compliance review because the SafeStat calculation did not take into account all of their crashes. On the other hand, a small number of carriers would have received a lower priority if the late-reported crashes had been included in their score. Specifically, 182 carriers - or fewer than 4 percent of those ranked, fell from the A or B category into the C, D, E, F, or G category once the late-reported crashes were included.[Footnote 27] These carriers would not have been considered high priority for a compliance review if all crashes had been reported on time. This does not have a big effect on the overall motor carrier population, however, as only 4 percent of carriers originally identified as highest risk were negatively affected by late reporting. The timeliness of crash reporting has shown steady and marked improvement. The median number of days it took states to report crashes to MCMIS dropped from 225 days in calendar year 2001 to 57 days in 2005 (the latest data available at the time of our analysis).[Footnote 28] In addition, the percentage of crashes reported by states within 90 days of occurrence has jumped from 32 percent in fiscal year 2000 to 89 percent in fiscal year 2006. (See fig. 2.) Figure 2: Percentage of Crashes Submitted to MCMIS within 90 Days of Occurrence: [See PDF for image] Source: GAO analysis of FMCSA data. [End of figure] Incomplete Data from States Potentially Limit SafeStat's Identification of All Carriers That Pose High Crash Risks: FMCSA uses a motor carrier identification number, which is unique to each carrier, as the primary means of linking inspections, crashes, and compliance reviews to motor carriers. Approximately 184,000 (76 percent) of the 244,000 crashes reported to MCMIS between December 2001 and June 2004 involved interstate carriers. Of these 184,000 crashes, nearly 24,000 (13 percent) were missing this identification number. As a result, FMCSA could not match these crashes to motor carriers or use them in SafeStat. In addition, the carrier identification number could not be matched to a number listed in MCMIS for 15,000 (8 percent) other crashes that involved interstate carriers. Missing data or being unable to match data for nearly one quarter of the crashes during the period of our review potentially has a large impact on a motor carrier's SafeStat score because SafeStat treats crashes as the most important source of information for assessing motor carrier crash risk. Theoretically, information exists to match crash records to motor carriers by other means, but such matching would require too much manual work to be practicable. We were not able to quantify the actual effect of either the missing data or the data that could not be matched for MCMIS overall. To do so would have required us to gather crash records at the state level--an effort that was impractical. For the same reason, we cannot quantify the effects of FMCSA's efforts to improve the completeness of the data (discussed later). However, a series of reports by the University of Michigan Transportation Research Institute sheds some light on the completeness of the data submitted to MCMIS by the states.[Footnote 29] One of the goals of the research was to determine the states' crash reporting rates. Reporting rates varied greatly among the 14 states studied, ranging from 9 percent in New Mexico in 2003 to 87 percent in Nebraska in 2005. It is not possible to draw wide-scale conclusions about whether state reporting rates are improving over time because only two of the states--Missouri and Ohio---were studied in multiple years. However, in these two states, the reporting rate did improve. Missouri experienced a large improvement in its reporting rate, with 61 percent of eligible crashes reported in 2001, and 83 percent reported in 2005. Ohio's improvement was more modest, increasing from 39 percent in 2000 to 43 percent in 2005. The University of Michigan Transportation Research Institute's reports also identified a number of factors that may affect states' reporting rates. One of the main factors affecting reporting rates is the reporting officer's understanding of crash reporting requirements. The studies note that reporting rates are generally lower for less serious crashes and for crashes involving smaller vehicles, which may indicate that there is some confusion about which crashes are reportable. Some states, such as Missouri, aid the officer by explicitly listing reporting criteria on the police accident reporting form, while other states, such as Washington, leave it up to the officer to complete certain sections of the form if the crash is reportable, but the form includes no guidance on reportable crashes. Yet other states, such as North Carolina and Illinois, have taken this task out of officers' hands and include all reporting elements on the police accident reporting form. Reportable crashes are then selected centrally by the state, and the required data are transmitted to MCMIS. Inaccurate Data Potentially Limit SafeStat's Ability to Identify Carriers That Pose High Crash Risks: Inaccurate data, such as reporting a nonqualifying crash to FMCSA, potentially has a large impact on a motor carrier's SafeStat score because SafeStat treats crashes as the most important source of information for assessing motor carrier crash risk. For the same reasons as discussed in the preceding section, we were neither able to quantify these effects nor determine how data accuracy has improved for MCMIS overall. The University of Michigan Transportation Research Institute's reports on crash reporting show that, among the 14 states studied, incorrect reporting of crash data is widespread. In recent reports, the researchers found that, in 2005, Ohio incorrectly reported 1,094 (22 percent) of the 5,037 cases, and Louisiana incorrectly reported 137 (5 percent) of the 2,699 cases. In Ohio, most of the incorrectly reported crashes did not qualify because they did not meet the crash severity threshold. In contrast, most of the incorrectly reported crashes in Louisiana did not qualify because they did not involve vehicles eligible for reporting. Other states studied by the institute had similar problems with reporting crashes that did not meet the criteria for reporting to MCMIS. These additional crashes could cause some carriers to exceed the minimum number of crashes required to receive a SafeStat rating and result in SafeStat's mistakenly identifying carriers as posing high crash risks. Because each report focuses on reporting in one state in a particular year, it is not possible to identify the number of cases that have been incorrectly reported nationwide and, therefore, it is not possible to determine the impact of inaccurate reporting on SafeStat's calculations. As noted in the University of Michigan Transportation Research Institute's reports, states may be unintentionally submitting incorrect data to MCMIS because of difficulties in determining whether a crash meets the reporting criteria. For example, in Missouri, pickups are systematically excluded from MCMIS crash reporting, which may cause the state to miss reportable crashes. However, some pickups may have vehicle weights above the reporting threshold, making crashes involving them eligible for reporting. There is no way for the state to determine which crashes involving pickups qualify for reporting without examining the characteristics of each vehicle. In this case, the number of omissions is likely to be relatively small, but this example demonstrates the difficulty states may face when identifying reportable crashes. In addition, in some states, the information contained in the police accident report may not be sufficient for the state to determine if a crash meets the accident severity threshold. It is generally straightforward to determine whether a fatality occurred as a result of a crash, but it may be difficult to determine whether an injured person was transported for medical attention or a vehicle was towed because of disabling damage. In some states, such as Illinois and New Jersey, an officer can indicate on the form if a vehicle was towed by checking a box, but there is no way to identify whether the reason for towing was disabling damage. It is likely that such uncertainty results in overreporting because some vehicles may be towed for other reasons. FMCSA Has Undertaken Efforts to Improve Crash Data Quality: FMCSA has taken steps to try and improve the quality of crash data reporting. As we noted in November 2005, FMCSA has undertaken two major efforts to help states improve the quality of crash data.[Footnote 30] One program, the Safety Data Improvement Program, has provided funding to states to implement or expand activities designed to improve the completeness, timeliness, accuracy, and consistency of their data. FMCSA has also used a data quality rating system to rate and display ratings for states' crash and inspection data quality. Due to its public nature, this map serves as an incentive for states to make improvements in their data quality. To further improve these programs, FMCSA has made additional grants available to states and implemented our recommendations to (1) establish specific guidelines for assessing states' requests for funding to support data improvement in order to better assess and prioritize the requests and (2) increase the usefulness of its state data quality map as a tool for monitoring and measuring commercial motor vehicle crash data by ensuring that the map adequately reflects the condition of the states' commercial motor vehicle crash data. In February 2004, FMCSA implemented Data Q's, an online system that allows for challenging and correcting erroneous crash or inspection data. Users of this system include motor carriers, the general public, state officials, and FMCSA. In addition, in response to a recent recommendation by the Department of Transportation Inspector General, FMCSA is planning to conduct a number of evaluations of the effectiveness of a training course on crash data collection that it will be providing to states by September 2008. While the quality of crash reporting is sufficient for use in identifying motor carriers that pose high crash risks and has started to improve, commercial motor vehicle crash data continue to have some problems with timeliness, completeness, and accuracy. These problems have been well-documented in several studies, and FMCSA is taking steps to address the problems through studies of each state's crash reporting system and grants to states to fund improvements. As a result, we are not making any recommendations in this area. Conclusion: Interstate commerce involving large trucks and buses has been growing substantially, and this growth is expected to continue. While the number of fatalities per million vehicle miles traveled has generally decreased over the last 30 years, the fatality rate has leveled off and remained fairly steady since the mid-1990s. FMCSA could more effectively address fatalities due to crashes involving a commercial motor vehicle if it better targeted compliance reviews to those carriers that pose the greatest crash risks. Using a negative binomial regression model would further FMCSA's mission of reducing crashes through the more effective targeting of compliance reviews to the set of carriers that pose the greatest crash risks. In light of possible changes to FMCSA's safety fitness determinations resulting from its Comprehensive Safety Analysis 2010 initiative, we are not suggesting that FMCSA undertake a complete and thorough investigation of SafeStat. Rather, we are advocating that FMCSA apply a statistical approach that employs the negative binomial regression model rather than relying on the current SafeStat formula that was determined through expert judgment. In our view, the substitution of a statistically based approach would likely yield a markedly better ability to identify carriers that pose high crash risks with relatively little time or effort on FMCSA's part. Recommendation for Executive Action: We recommend that the Secretary of Transportation direct the Administrator of FMCSA to apply a negative binomial regression model, such as the one discussed in this report, to enhance the current SafeStat methodology. Agency Comments and Our Evaluation: We provided a draft of this report to the Department of Transportation for its review and comment. In response, departmental officials, including FMCSA's Director of the Office of Enforcement and Compliance and Director of the Office of Research and Analysis, noted that our report provided useful insights and offered a potential avenue for further improving the effectiveness of FMCSA's efforts to reduce crashes involving motor carriers. The agency indicated that it is already working to improve upon SafeStat as part of its Comprehensive Safety Analysis 2010 initiative. FMCSA agreed that it would be useful for it to consider whether there are both short and longer term measures that would incorporate the type of analysis identified in our report, as an adjunct to the SafeStat model, in order to better target compliance reviews so as to make the best use of FMCSA's resources to reduce crashes. The agency expressed some concerns with the negative binomial regression analysis, noting that its intent is to effectively target its compliance activities based on a broader range of factors than is considered in the negative binomial regression analysis approach described in our draft report, which increases reliance on past crashes as a predictor of future crashes while apparently de-emphasizing known driver, vehicle, or safety management compliance issues. FMCSA told us that it incorporates a broad range of information including driver behavior, vehicle condition, and safety management in an attempt to capture and enable the agency to act on accident precursors in order to reduce crashes. FMCSA is correct in concluding that the use of the negative binomial regression approach could tilt enforcement heavily toward carriers that have experienced crashes and away from other aspects of its problem areas, such as violation of vehicle safety standards, that are intended to prevent crashes. That is because the present SafeStat model does not statistically assign weights to the accident, driver, vehicle, and safety management areas. In addition, the negative binomial regression approach fully considers information on the results of driver and vehicle inspection data and safety management data. We used the same data that FMCSA used, with some adjustments as new information became available. While we found that the driver, vehicle, and safety management evaluation area scores are correlated with the future crash risk of a carrier, the accident evaluation area correlates the most with future crash risk. We recognize that FMCSA selects carriers for compliance reviews for multiple reasons, such as to respond to complaints, and we would expect that it would retain this flexibility if it adopted the negative binomial regression approach. FMCSA also indicated that greater reliance on crash data increases emphasis on the least reliable available data set, and one that is out of the organization's direct control--crash reporting. While our draft report found that crash reporting has improved, and that late reporting has not significantly impaired FMCSA's use of the SafeStat model, FMCSA noted that the reliance on previous crashes in the negative binomial regression analysis described in our draft report could result in greater sensitivity to the crash data quality issues. As FMCSA noted in its comments, our results showed that the effect of late-reported data was minimal. Also, as mentioned in our draft report and in this final report, it was not practical to determine the effect, if any, on SafeStat rankings of correcting inaccurate data or adding incomplete data. Since June 2004, FMCSA has devoted considerable efforts to improving the quality of the crash data it receives from the states. States are now tracked quarterly for the completeness, timeliness, and accuracy of their crash reporting. As FMCSA continues its efforts to have states improve these data, any sensitivity of results to crash data quality issues for the negative binomial regression approach should diminish. We are sending copies of this report to congressional committees and subcommittees with responsibility for surface transportation safety issues; the Secretary of Transportation; the Administrator, FMCSA; and the Director, Office of Management and Budget. We also will make copies available to others upon request. In addition, this report will be available at no charge on the GAO Web site at http://www.gao.gov. If you have any questions about this report, please either contact Sidney H. Schwartz at (202) 512-7387 or Susan A. Fleming at (202) 512- 2834. Alternatively, they may be reached at schwartzsh@gao.gov or flemings@gao.gov. Contact points for our Offices of Congressional Relations and Public Affairs may be found on the last page of this report. Staff who made key contributions to this report are Carl Barden, Elizabeth Eisenstadt, Laurie Hamilton, Lisa Mirel, Stephanie Purcell, and James Ratzenberger. Signed by: Sidney H. Schwartz: Director Applied Research and Methods: Signed by: Susan A. Fleming: Director Physical Infrastructure Issues: [End of section] Appendix I: Results of Other Assessments of the SafeStat Model's Ability to Identify Motor Carriers That Pose High Crash Risks: Several studies by the Volpe National Transportation Systems Center (Volpe), the Department of Transportation's Office of Inspector General, the Oak Ridge National Laboratory (Oak Ridge), and others have assessed the predictive capability of the Motor Carrier Safety Status Measurement System (SafeStat) model and the data used by that model. In general, those studies that assessed the predictive power of SafeStat offered suggestions to increase that power, and those studies that assessed data quality found weaknesses in the data that the Federal Motor Carrier Safety Administration (FMCSA) relies upon. Assessments of SafeStat's Predictive Capability: The studies we reviewed covered topics such as comparing SafeStat with random selection to determine which does a better job of selecting carriers that pose high crash risks, assessing whether statistical approaches could improve that selection, and analyzing whether carrier financial positions or driver convictions are associated with crash risk. Predictive Capability of SafeStat Compared with Random Selection: In studies of the SafeStat model published in 2004 and 1998,[Footnote 31] Volpe analyzed retrospective data to determine how many crashes the carriers in SafeStat categories A and B experienced over the following 18 months. The 2004 study used the carrier rankings generated by the SafeStat model on March 24, 2001. Volpe then compared the SafeStat carrier safety ratings with state-reported data on crashes that occurred between March 25, 2001, and September 24, 2002, to assess the model's performance. For each carrier, Volpe calculated a total number of crashes, weighted for time and severity, and then estimated a crash rate per 1,000 vehicles for comparing carriers in SafeStat categories A and B with the carriers in other SafeStat categories. The 1998 Volpe study used a similar methodology. Each study used a constrained subset of carriers rather than the full list contained in the Motor Carrier Management Information System (MCMIS).[Footnote 32] Both studies found that the crash rate for the carriers in SafeStat categories A and B was substantially higher than the other carriers during the 18 months after the respective SafeStat run. On the basis of this finding, Volpe concluded that the SafeStat model worked. In response to a recommendation by the Department of Transportation's Office of Inspector General,[Footnote 33] FMCSA contracted with Oak Ridge to independently review the SafeStat model. Oak Ridge assessed the SafeStat model's performance and used the same data set (for March 24, 2001), provided by Volpe, that Volpe had used in its 2004 evaluation. Perhaps not surprisingly, Oak Ridge obtained a similar result for the weighted crash rate of carriers in SafeStat categories A and B over the 18-month follow-up period. As with the Volpe study, the Oak Ridge study was constrained because it was based on a limited data set rather than the entire MCMIS data set. Application of Regression Models to Safety Data: While SafeStat does better than simple random selection in identifying carriers that pose high crash risks, other methods can also be used to achieve this outcome. Oak Ridge extended Volpe's analysis by applying regression models to identify carriers that pose high crash risks. Specifically, Oak Ridge applied a Poisson regression model and a negative binomial model using the safety evaluation area values as independent variables to a weighted count of crashes that occurred in the 30 months before March 24, 2001. (For more information on statistical analyses, see app. III.) In addition, Oak Ridge applied the empirical Bayes method to the negative binomial regression model and assessed the variability of carrier crash counts by estimating confidence intervals. Oak Ridge found that the negative binomial model worked well in identifying carriers that pose high crash risks. However, the data set Oak Ridge used did not include any carriers with one reported crash in the 30 months before March 24, 2001. Because data included only carriers with zero or two or more reported crashes, the distribution of crashes was truncated. Since the Oak Ridge regression model analysis did not cover carriers with safety evaluation area data and one reported crash, the findings from the study are limited in their generalizability. However, other analyses of crashes at intersections and on road segments have also found that the negative binomial regression model works well.[Footnote 34] In addition, our analysis using a more recent and more comprehensive data set supports the finding that the negative binomial regression model performs better than the SafeStat model. The studies carried out by other authors advocate the use of the empirical Bayes method in conjunction with a negative binomial regression model to estimate crash risk. Oak Ridge also applied this model to identify motor carriers that pose high crash risks. We applied this method to the 2004 SafeStat data and found that the empirical Bayes method best identified the carriers with the largest number of crashes in the 18 months after June 25, 2004. However, the crash rate per 1,000 vehicles was much lower than that for carriers in SafeStat categories A and B. We analyzed this result further and found that although the empirical Bayes method best identifies future crashes, it is not as effective as the SafeStat model or the negative binomial regression model in identifying carriers with the highest future crash rates. The carriers identified with the empirical Bayes method were invariably the largest carriers. This result is not especially useful from a regulatory perspective. Companies operating a large number of vehicles often have more crashes over a period of time than smaller companies. However, this does not mean that the larger company is necessarily violating more safety regulations or is less safe than the smaller company. For this reason, we do not advocate the use of the empirical Bayes method in conjunction with the negative binomial regression model as long as the method used to calculate the safety evaluation area values remains unchanged. If changes are made in how carriers are rated for safety, this method may in the future offer more promise than the negative binomial regression model alone. Relationship of Carrier Financial Data and Safety Risk: Conducted on behalf of FMCSA, a study by Corsi, Barnard, and Gibney in 2002 examined how a carrier's financial performance data correlate with the carrier's score on a compliance review.[Footnote 35] The authors selected those motor carriers from MCMIS in December 2000 that had complete data for the accident, driver, vehicle, and safety management safety evaluation areas. Using these data, the authors then matched a total of 700 carriers to company financial statements in the annual report database of the American Trucking Associations.[Footnote 36] The authors created a binary response variable for whether the carrier received a satisfactory or an unsatisfactory outcome on the compliance review. The authors then assessed how this result correlated with financial measures derived from the company financial statements. In general, the study found that indicators of poor financial condition correlated with an increased safety risk. Two practical considerations limit the applicability of the findings from this study to SafeStat. First, the 700 carriers in the study sample are not necessarily representative of the motor carriers that FMCSA oversees. Only about 2 percent of the carriers evaluated by the SafeStat model in June 2004 had a value for the safety management safety evaluation area. Of these carriers, not all had complete data for the other three safety evaluation areas. Second, FMCSA does not receive annual financial statements from all motor carriers.[Footnote 37] For these reasons, we did not consider using carrier financial data in our analysis of the SafeStat data. Relationship of Commercial Driver License Convictions and Crash Risk: A series of studies by Lantz and others examined the effect of incorporating conviction data from the state-run commercial driver license data system into the calculation of a driver conviction measure.[Footnote 38] The studies found that the driver conviction measure is weakly correlated with the crash per vehicle rate.[Footnote 39] However, the studies did not incorporate the proposed driver conviction measure into one of the existing safety evaluation areas and use the updated measure to estimate new SafeStat scores for carriers. While the use of commercial driver license conviction data may have potential for future incorporation into a model for identifying carriers that pose high crash risks, there is no assessment of its impact at this time. Impact of Data Quality on SafeStat's Predictive Capability: The 2004 Office of Inspector General report, the 2004 Oak Ridge study, and reports by the University of Michigan Transportation Research Institute on state crash reporting all examined the impact of data quality on SafeStat's ability to identify carriers that pose high crash risks. These studies looked at issues such as late reporting and incomplete or inaccurate reporting of crash data and found weaknesses. Late Reporting of Crash Data: To determine whether states promptly report SafeStat data, the Office of Inspector General conducted a two-stage statistical sample in which it selected 10 states for review and then selected crash and inspection reports from those states for examination. It sampled 392 crash records and 400 inspection records from July through December 2002. In 2 of the 10 states selected, Pennsylvania and New Mexico, no crash records were available for the sample period, so it selected samples from earlier periods. The Office of Inspector General also discussed reporting issues with state and FMCSA officials and obtained crash records from selected motor carriers. In addition, the Office of Inspector General used the coefficient of variation to analyze data consistency and trends in reporting timeliness across geographic regions.[Footnote 40] Our review of the study indicates that it was based on sound audit methodology. The study found that, as of November 2002, states submitted crash reports in fiscal year 2002 an average of 103 days after the crash occurred and that states varied widely in the timeliness of their crash data reporting. (FMCSA requires that states report crashes no more than 90 days after they occur.) In addition, the study found that 20 percent of the crashes that occurred in fiscal year 2002 were entered into MCMIS 6 months or more after the crash occurred. On the basis of this information, the Office of Inspector General concluded that the calculation of the accident safety evaluation area value was affected by the location of the carrier's operations but did not estimate the degree of this effect. We also assessed the extent of late reporting. We measured how many days, on average, it took each state to report crashes to MCMIS in each calendar year and found that the amount of time taken to report crashes declined from 2000 to 2005. Our findings were similar in nature to the Office of Inspector General's findings. However, our results are broader because they are based on all crash data rather than a sample. In addition, since our work is more recent, it reflects more current conditions. We both came to the conclusion, although to varying degrees, that late reporting of crash data by states negatively affects SafeStat's identification of carriers that pose high crash risks. Oak Ridge also examined the impact of late reporting. Using data provided by Volpe, Oak Ridge looked at the difference between the date a crash occurred and the date it was entered into MCMIS. The researchers found that after 497 days, 90 percent of the reported crashes were entered into MCMIS. The Oak Ridge study also reran the SafeStat model for March 2001 with the addition of crash data from March 2003 to see how more complete data changed SafeStat scores. The study found that the addition of late- reported data increased the number of carriers in the high-risk group by 18 percent. This late reporting affected the rankings of 8 percent of all the carriers ranked by SafeStat in March 2001. Of these affected carriers, 3 percent moved to a lower SafeStat category, and 5 percent moved to a higher category. Including the late-reported crash data available in March 2003 for the period from September 1998 through March 2001 resulted in a 35 percent increase in the available crash data. We performed the same analysis as the Oak Ridge study and obtained similar results. We used SafeStat data from June 2004, which include carrier safety data from December 2001 through June 2004. Using FMCSA's master crash file from June 2006, we found that, with the addition of late-reported crashes, 481 carriers moved into the highest risk category, and 182 carriers dropped out of the highest risk category resulting in a net increase of 299 carriers (6 percent) being added to the highest risk category. The University of Michigan Transportation Research Institute issued a series of reports examining crash reporting rates in 14 states. These reports looked at late reporting as a potential source of low crash reporting rates but did not specifically examine the extent of late reporting or the impact of late reporting on SafeStat scores. The institute looked at reporting rates in each of the states by month to determine if reporting rates were lower in the latter part of the year because of late reporting. It found that reporting rates were lower in the latter part of the year in 6 of the 14 states studied. This issue was not a focus of our efforts, so we did not conduct a similar analysis. Incomplete and Inaccurate Reporting of Crash Data: The Office of Inspector General's study found several instances of incomplete or inaccurate data on crashes and carriers. The study reviewed MCMIS reporting for all states and found that 6 of them did not report any crashes to FMCSA in the 6-month period from July through December 2002. In addition, the study found that MCMIS listed about 11 percent of carriers as having no vehicles, and 15 percent as having no drivers. Finally, from a sample of crash records, the study estimated that 13 percent of the crash reports and 7 percent of the inspection reports in MCMIS contained errors that would affect SafeStat results. In particular, the study concluded that the database identified the wrong motor carrier as having been involved in a crash or as having received a violation in 11 percent of the erroneous records. The University of Michigan Transportation Research Institute also examined the accuracy of states' crash data reporting. To determine if crashes were reported accurately, the institute compared information contained in the individual states' police accident reporting files with crash data reported to MCMIS. Some states, such as Ohio, had enough information captured in the police accident file to determine if individual crashes were eligible for reporting, and, therefore, the institute was able to use these data in its analyses. In other states, not enough information was available to make a determination, and the institute had to project results on the basis of other states' experience. The institute also carried out a number of analyses, such as comparing reporting rates for different reporting jurisdictions, in an attempt to identify reporting trends in the individual states. The institute identified several problems with the accuracy of states' crash reporting. All 14 states that it studied reported ineligible crashes to MCMIS. These crashes were ineligible because they either involved vehicles not eligible for reporting or they did not meet the crash severity threshold. In total, the 14 states reported nearly 5,800 ineligible crashes to MCMIS out of almost 68,000 crashes reported (9 percent). The states also failed to report a number of eligible crashes: the 14 states studied reported from 9 percent to 87 percent of eligible crashes. Our review of the institute's methodology indicates that its findings are based on sound methodology and that its analyses were very thorough. However, its studies are limited to the 14 states studied and to the particular year studied. (Not all studies covered the same year.) These states' experience may or may not be representative of the experiences of the entire country, and there is no way to determine if the reporting for this year is representative of the state's reporting activities over a number of years or if the results were unique to that particular year. The exceptions to this are the studies for Missouri, which covered calendar years 2001 and 2005, and Ohio, which covered calendar years 2000 and 2005. We did not attempt to assess the extent of inaccurate reporting in individual states, but we did find examples of inaccurate data reporting. To analyze the completeness of reporting, we attempted to match all crash records in the MCMIS master crash file for crashes occurring between December 26, 2001, and June 25, 2004, to the list of motor carriers in the MCMIS census file. We found that Department of Transportation numbers were missing for 30 percent of the crashes that were reported, and the number did not match a Department of Transportation number listed in MCMIS for 8 percent of reported crashes. We also compared the number of crashes in MCMIS with the number in the General Estimates System produced by the National Highway Traffic Safety Administration and found evidence of underreporting of crashes to MCMIS.[Footnote 41] [End of section] Appendix II: Scope and Methodology: To determine whether statistical approaches could be used to improve FMCSA's ability to identify carriers that pose high crash risks, we tested a variety of regression models and compared their results with results from the existing SafeStat model. The models we tested, using MCMIS data used by SafeStat in June 2004 to identify carriers that pose high crash risks, include the Poisson, negative binomial, zero-inflated negative binomial, zero-inflated Poisson, and empirical Bayes. We chose these regression models because crash totals for a company represent count outcomes, and these statistical models are appropriate for use with count data. In addition, we explored logistic regression to assess the odds of having a crash. Based on the results of the statistical models, we ranked the predicted means (or predicted probabilities in the logistic regression) to see which carriers would be at risk during the 18-month period after June 2004. We selected June 2004 because this date enabled us to examine MCMIS data on actual crashes that occurred in the 18-month period from July 2004 through December 2005.[Footnote 42] We used these data to determine the degree to which SafeStat identified carriers that proved to pose high crash risks. We then compared the predictive performance of the regression models with the performance of SafeStat to determine which method best identified carriers that pose high crash risks. Using a series of simple random samples,[Footnote 43] we also calculated the crash rates of all carriers listed in the main SafeStat summary results table in MCMIS for comparison with the crash rates of carriers identified by SafeStat as high risk. We did this analysis to determine whether the SafeStat model did a better job than random selection of identifying motor carriers that pose high crash risks. In addition, we tested changes to selected portions of the SafeStat model to see whether improvements could be made in the identification of high-risk motor carriers. In one analysis, we modified the calculation of the safety evaluation area values and compared the number of high-risk motor carriers identified with the number identified by the unmodified safety evaluation areas. For example, we included carriers with only one crash in the calculation of the accident safety evaluation area whereas the unmodified SafeStat model includes only carriers with two or more crashes. We also investigated the effect of removing the time and severity weights from the indexes used to construct the accident, driver, and vehicle safety evaluation areas. We then compared the result of using the modified and unmodified safety evaluation area values to determine if this modification improved the model's ability to identify future crash risks. To assess the extent to which the timeliness, completeness, and accuracy of MCMIS and state-reported crash data affect SafeStat's performance, we carried out a series of analyses with the MCMIS crash master file and MCMIS census file, as well as surveying the literature to assess findings on MCMIS data quality from other studies. To assess the effect of timeliness, we first measured how many days on average it was taking each state to report crashes to FMCSA by year for calendar years 2000 through 2005. We also recalculated SafeStat scores from the model's June 25, 2004, run to include crashes that had occurred more than 90 days before that date but had not been reported to FMCSA by that date. We compared the number and rankings of carriers from the original SafeStat results with those obtained by adding in data for the late-reported crashes. In addition, we reviewed the University of Michigan Transportation Research Institute's studies of state crash reporting to MCMIS to identify the impact of late reporting in individual states on MCMIS data quality. To assess the effect of completeness, we attempted to match all crash records in the MCMIS crash file for crashes occurring from December 2001 through June 2004 to the list of motor carriers in the MCMIS census file. In addition, we reviewed the University of Michigan Transportation Research Institute's studies of state crash reporting to MCMIS to identify the impact of incomplete crash reporting in individual states on MCMIS data quality. To assess the effect of accuracy, we reviewed a report by the Office of Inspector General that tested the accuracy of electronic data by comparing records selected in the sample with source paper documents. In addition, we reviewed the University of Michigan Transportation Research Institute's studies of state crash reporting to MCMIS to identify the impact of incorrectly reported crashes in individual states on MCMIS data quality. While the limitations in the data adversely affect the ability of any method to identify carriers that pose high crash risks, we determined that the data were of sufficient quality for our use, which was to assess how the application of regression models might improve the ability to identify high-risk carriers over the current approach--not to determine absolute measures of crash risk. Our reasoning is based on the fact that we used the same data set to compare the results of the SafeStat model and the regression models. Limitations in the data would apply equally to both results. Methods to identify carriers that pose high crash risk will perform more efficiently once the known problems with the quality of state-reported crash data are addressed. To understand what other researchers have found about how well SafeStat identifies motor carriers that pose high crash risks, we identified studies through a general literature review and by asking stakeholders and study authors to identify high-quality studies. Studies included in our review were (1) the 2004 study of SafeStat done by Oak Ridge National Laboratory, (2) the SafeStat effectiveness studies done by the Department of Transportation Office of Inspector General and Volpe Institute, (3) the University of Michigan Transportation Research Institute's studies of state crash reporting to FMCSA, and (4) the 2006 Department of Transportation Office of Inspector General's audit of data for new entrant carriers.[Footnote 44] We assessed the methodology used in each study and identified which findings are supported by rigorous analysis. We accomplished this by relying on information presented in the studies and, where possible, by discussing the studies with the authors. When the studies' methodologies and analyses appeared reasonable, we used those findings in our analysis of SafeStat. We discussed with FMCSA and industry and safety stakeholders the SafeStat methodology issues and data quality issues raised by these studies. We also discussed the aptness of the respective methodological approaches with FMCSA. Finally, we reviewed FMCSA documentation on how SafeStat is constructed and assessments of SafeStat conducted by FMCSA. [End of section] Appendix III: Additional Results from Our Statistical Analyses of the SafeStat Model: This appendix contains technical descriptions and other information related to our statistical analyses. Overview of Regression Analyses: To study how well statistical methods identify carriers that pose high crash risks, we carried out a series of regression analyses. The safety evaluation area values for the accident, driver, vehicle, and safety management areas served as the independent variables to predict crash risks.[Footnote 45] We used the state-reported crash data in MCMIS for crashes that occurred during the 30 months preceding June 25, 2004, as the dependent variable in each model. We used the results of the SafeStat model run from June 25, 2004, to benchmark the performance of the regression models with the crash records for the identified high- risk carriers over the succeeding 18 months. We matched the state-reported crashes that occurred from December 26, 2001, through June 25, 2004, to the carriers listed in SafeStat.[Footnote 46] We checked our match of crashes for carriers with those carriers used by FMCSA in June 2004 and found that the reported numbers had changed for about 10,700 carriers in the intervening 2 years. We found this difference even though we used only crashes that occurred from December 26, 2001, through June 25, 2004, and were reported to FMCSA before June 25, 2004. Because of this difference in matched crashes, we recalculated the accident safety evaluation area using our match of the crashes. This is discussed later in more detail. Using our recalculation of the accident safety evaluation area values and the original driver, vehicle, and safety management safety evaluation area values for the carriers, we fit a Poisson regression model and a negative binomial regression model to the crash counts. Both of these models are statistically appropriate for use when modeling counts that are positive and integer valued. The two models differ in their assumptions about the mean and variance. Whereas the Poisson model assumes that the mean and the variance are equal, the negative binomial model assumes the mean is not equal to the variance. The crash data in MCMIS fit the assumptions of the negative binomial distribution better than those of the Poisson.[Footnote 47] We also tried to estimate zero-inflated Poisson and zero-inflated negative binomial models with the SafeStat data. These models are appropriate when the count values include many zeros, as is the case with the values in this data set (because many carriers do not have crash records). However, we could not estimate the parameters for these models with the MCMIS data. We also considered using logistic regression to model the carrier's odds of experiencing a crash. However, the parameter estimates of the four safety evaluation area values could not be estimated, so we did not use the results of this model.[Footnote 48] Finally, we used the results from the negative binomial model to assess the expected carrier crash counts using the empirical Bayes estimate. In safety applications, the empirical Bayes method[Footnote 49] is used to increase the precision of estimates and correct for the regression- to-mean bias.[Footnote 50] In this application, the empirical Bayes method calculates a weighted average of the rate of crashes for a carrier from the prior 30 months with the predicted mean number of crashes from the negative binomial regression. This method optimizes the identification of carriers with the highest number of future crashes. This optimization of total crashes, however, resulted in the identification of primarily the largest companies. The crash rate (crashes per 1,000 vehicles per 18 months) was not as high for this group as for the carriers placed by the SafeStat model in its A and B categories. Technical Explanation of the Negative Binomial Regression Model: This section provides the technical details for the negative binomial regression model fit to the SafeStat data. This section also explains how we handled incomplete safety evaluation area data for carriers in the regression model analyses. The basic negative binomial probability distribution function for count data is expressed as: [See PDF for equation] for The term represents the dispersion parameter. It is not assumed to equal one, as in the Poisson distribution. The represents the crash count for the motor carrier, and the represents the observed safety evaluation areas. To formulate the negative binomial regression model and control for differences in exposure to events among the carriers, we can express the functional relationship between the safety evaluation areas and the mean number of crashes as: [See PDF for equation] With complete data for a motor carrier, where none of the safety evaluation area values are equal to missing, the regression model of interest is as follows: [See PDF for equation] This equation models the log of the expected mean number of crashes for each motor carrier using the four safety evaluation area values, but most commercial motor companies listed in MCMIS do not have values for all four safety evaluation areas.[Footnote 51] To account for this, it is necessary to define four indicator variables. Let: [See PDF for equations] The indicator variables will be used as main effects in the negative binomial regression model to indicate cases for which information is available. The effect of the safety evaluation area will be measured by the interaction of the indicator function with the safety evaluation area value. This gives us the following model specification: [See PDF for equation] With this parameterization, the estimate for the mean rate of crashes for a carrier with no safety evaluation area information is . For a carrier with information for just the accident safety evaluation area, the estimate for the mean number of crashes is . Note that the effect for each safety evaluation area will include a coefficient times the safety evaluation area value for the carrier plus an offset to the intercept for the indicator term (the coefficient for the indicator function). We used a similar parameterization to formulate the Poisson regression model. Evaluation of Regression Models' Performance: We estimated regression models using the same data FMCSA used in its application of the SafeStat model on June 25, 2004, with one exception for the accident safety evaluation area. For that area, we used our own match of crashes to carriers for December 26, 2001, through June 25, 2004. The MCMIS data we received in June 2006 produced different totals in the match of crashes to carriers for about 10,700 carriers. MCMIS data change over time because crash data are added, deleted, or changed as more information about these crashes is obtained. The discrepancies in matching arose even though we used the identical time interval and counted crashes only when the record indicated they had been reported to FMCSA before June 25, 2004. Because of these discrepancies, it was necessary to calculate the accident safety evaluation area values using our match of crashes and then recalculate the SafeStat carrier scores for June 25, 2004, using our accident safety evaluation area values and the original driver, vehicle, and safety management safety evaluation area values.[Footnote 52] We used our accident safety evaluation area values and the original driver, vehicle, and safety management safety evaluation area values in the regression model analysis. Using the revised accident safety evaluation area values and FMCSA's original driver, vehicle, and safety management safety evaluation area values, the SafeStat model identified 4,989 carriers that pose high crash risks. For each regression model, we input the safety evaluation area data for the carriers in our analysis data set and used the regression model to calculate the predicted mean number of crashes. We then sorted the predicted scores and selected the 4,989 carriers with the worst predicted values as the set of high-risk carriers identified by the regression model. Next, we used MCMIS to determine the crash history of these 4,989 carriers between June 26, 2004, and December 25, 2005, and compared the aggregate crash history with the aggregate crash history of the carriers identified by the SafeStat model during the same period of time. The regression models do not categorize carriers by letter; the regression models produce a predicted crash risk for each carrier. The regression models make use of the safety evaluation area values, but they differ from the SafeStat model in this respect. The results show that a negative binomial regression model estimated with the safety evaluation area values outperforms the current SafeStat model in terms of predicting future crashes and the future crash rate among identified carriers that pose high crash risks. (See table 3.) That is, our negative binomial and Poisson models show 111 and 109 crashes per 1,000 vehicles per 18 months, respectively, compared with the 102 crashes per 1,000 vehicles per 18 months estimated by the current SafeStat model. The Poisson model is not as appropriate since the crash counts for carriers have variability that is significantly different from the mean number of crashes.[Footnote 53] The empirical Bayes method optimizes the selection of future crashes; however, it does so by selecting the largest carriers. The largest carriers have a lower crash rate per 1,000 vehicles per 18 months than the carriers that pose high crash risks identified by the SafeStat model or by the negative binomial regression model. Since the primary use of SafeStat is to identify and prioritize carriers for FMCSA and state compliance reviews, the empirical Bayes method did not identify carriers with the highest safety risk. Table 3: Results for SafeStat Model and Regression Models: Method: SafeStat category A & B; Crash rate: 102; Number of crashes in 18 months: 10,076; Number of vehicles: 98,619. Method: Negative binomial; Crash rate: 111; Number of crashes in 18 months: 19,580; Number of vehicles: 175,820. Method: Poisson; Crash rate: 109; Number of crashes in 18 months: 21,532; Number of vehicles: 198,396. Method: Empirical Bayes; Crash rate: 59; Number of crashes in 18 months: 56,705; Number of vehicles: 965,070. Source: GAO analysis of FMCSA data. Note: As discussed in the text, the zero inflated Poisson, the zero inflated negative binomial, and the logistic regression approaches did not provide useful results. [End of table] FOOTNOTES [1] There are four safety evaluation areas--accident, driver, vehicle, and safety management. They are used by the SafeStat model to assess a carrier's safety. See the background section for a description of these four areas. SafeStat is built on a number of expert judgments rather than using statistical approaches, such as a regression model. [2] Negative binomial regression is often used to model count data (e.g., crashes). The results from this regression model can be interpreted as the estimated mean number of crashes per carrier. [3] The 9 percent improvement is in crash rate per 1,000 vehicles over an 18-month period. [4] The goal of this initiative is to develop an optimal operational model that will allow FMCSA to focus its resources on improving the safety performance of high-risk operators. [5] We applied the SafeStat model to retrospective data. Because of changes to the MCMIS crash file over the past 2 years, our number does not correspond exactly to the number of carriers identified by FMCSA as high risk on June 25, 2004. Had all crash data been reported within 90 days of when the crashes occurred, 182 of the carriers identified by SafeStat as highest risk would have been excluded (because other carriers had higher crash risks), and 481 carriers that were not originally designated as posing high crash risks would have scored high enough to be considered high risk, resulting in a net addition of 299 carriers. [6] A reportable crash is one that meets both a vehicle and a crash severity threshold. Generally, for a crash to be reported, it must involve a truck with a gross vehicle weight rating of over 10,000 pounds; a bus with seating for at least nine people, including the driver; or a vehicle displaying a hazardous materials placard. Reportable accidents involve a fatality, an injury requiring transport to a medical facility for immediate medical attention, or towing required because the vehicle sustained disabling damage. [7] This figure is for 2002, the most recent date for which data is available. [8] This includes an unidentified number of carriers that are registered but are no longer in business. [9] FMCSA completed 15,626 compliance reviews in 2006. The number of companies reviewed was less because some carriers received more than 1 compliance review. [10] Acute violations are violations so severe that FMCSA requires immediate corrective actions by a motor carrier regardless of the carrier's overall safety status. An example of an acute violation is a carrier's failing to implement an alcohol or drug testing program for drivers. Critical violations are serious, but less severe than acute violations, and most often point to gaps in carriers' management or operational controls. For example, a carrier may not maintain records of driver medical certificates. [11] Severe violations are violations of hazardous materials regulations. Level I violations require immediate corrective actions. An example of a level I violation is offering or accepting a hazardous material for transportation in an unauthorized vehicle. Level II violations indicate a breakdown in the management or operational controls of the facility. An example of a level II violation is failing to train hazardous materials employees as required. [12] Minimum requirements in this context mean that the carrier has enough safety data to receive a rating. Usually, the safety data are associated with adverse safety events. However it is possible for a carrier to have enough roadside inspections, even if none of the inspections resulted in violations, to qualify for a driver and vehicle safety evaluation area score. [13] The 9 percent improvement is in the crash rate per 1,000 vehicles over an 18-month period. [14] Applying the SafeStat model to June 2004 data identifies 4,989 carriers as high risk (categories A or B). Using 10,000 randomly selected samples of 4,989 carriers and considering the crashes that these carriers had between June 2004 and December 2005, we found that the crash rate per 1,000 vehicles in the ensuing 18 months was 83 percent higher among the carriers identified by the SafeStat model than among the randomly selected carriers. [15] Ken Campbell, Rich Schmoyer, and Ho-Ling Hwang, Review of the Motor Carrier Safety Status Measurement System (SAFESTAT), Oak Ridge National Laboratory, Final Report, October 2004. See appendix I for a more detailed discussion of the findings from this report. [16] This occurs because data were added, deleted, or modified as more information became known over time. See appendix III for a more detailed discussion. [17] The threshold could be increased or decreased to align with the resources that FMCSA and its state partners have available to perform compliance reviews. As discussed earlier, FMCSA and its state partners select carriers for these reviews because they pose high crash risks and for other reasons. [18] The carriers identified as high risk by SafeStat had a total of 98,619 vehicles while those identified by the negative binomial regression model had 175,820 vehicles. The identification of larger sized companies on average by the negative binomial regression model is how a 9 percent increase in the crash rate translated into 9,500 additional crashes. [19] FMCSA can use the current safety evaluation area values in SafeStat and the number of state-reported crashes for each carrier in the 30 preceding months in the negative binomial regression model. [20] Federal Motor Carrier Safety Administration Compliance Review Workgroup, Phase II Final Report: Proposed Operational Model for FMCSA Compliance and Safety Programs Report, February 2005. [21] Oak Ridge National Laboratory statistically measured the weights for the safety evaluation areas and estimated the accident safety evaluation area should have a weight of 57 in the SafeStat model formula. This compares with the present weight of 2 that SafeStat gives the accident safety evaluation area. Ken Campbell, Rich Schmoyer, and Ho-Ling Hwang, Review of the Motor Carrier Safety Status Measurement System (SAFESTAT), Oak Ridge National Laboratory, Final Report, October 2004. [22] We expect to issue a report shortly that provides additional discussion of FMCSA's initiative to identify and take action against carriers that are egregious safety violators. [23] Revisions to SafeStat are exempt from notice and comment under the Administrative Procedure Act if they relate to FMCSA's internal practices and procedures. [24] SafeStat does not consider carriers with fewer than three moving violations. [25] Carriers with one or zero state-reported crashes do not receive an accident safety evaluation area score unless the recordable accident indicator is available from a recent compliance review. Carriers with two or fewer driver inspections and two or fewer moving violations do not receive a driver safety evaluation area score unless the driver review indicator is available from a recent compliance review. Carriers with two or fewer vehicle inspections do not receive a vehicle safety evaluation area score unless the vehicle review indicator is available from a recent compliance review. In the data we reviewed, almost 2 percent of the carriers had undergone a compliance review within the 18 months prior to the SafeStat run on June 25, 2004. [26] For another assessment of data quality, see Office of Inspector General, Improvements Needed in the Motor Carrier Safety Status Measurement System, U.S. Department of Transportation, Report MH-2004- 034, 2004. [27] These 182 carriers were no longer in the worst 25 percent for the accident safety evaluation area after the addition of the late-reported crashes. [28] Part of the improvement in timeliness of reporting for the most recent year is that an unknown number of crashes that occurred in 2005 had still not been reported as of June 2006, the date we obtained these data. [29] The University of Michigan Transportation Research Institute's reports on state crash reporting can be found at http://www.umtri.umich.edu. State reports issued by the University of Michigan Transportation Research Institute cover California, Florida, Illinois, Iowa, Louisiana, Maryland, Michigan, Missouri, Nebraska, New Jersey, New Mexico, North Carolina, Ohio, and Washington. We included all of these reports in our review. [30] GAO, Highway Safety: Further Opportunities Exist to Improve Data on Crashes Involving Commercial Motor Vehicles, GAO-06-102 (Washington, D.C.: Nov. 18, 2005). [31] David Madsen and Donald Wright, Volpe National Transportation Systems Center, An Effectiveness Analysis of SafeStat (Motor Carrier Safety Status Measurement System), Paper No. 990448, November 1998 and John A. Volpe National Transportation Systems Center, Motor Carrier Safety Assessment Division, SafeStat Effectiveness Study Update, March 2004. [32] Volpe included only carriers with two or more crashes and/or three or more inspections during the preceding 30 months, and/or an enforcement action within the past 6 years, and/or a compliance review within the previous 18 months. This is consistent with the SafeStat minimum event requirements. [33] Office of Inspector General, Improvements Needed, 2004. [34] Ezra Hauer, Douglas Harwood, and Michael Griffith, The Empirical Bayes Method for Estimating Safety: A Tutorial. Transportation Research Record 1784, National Academies Press, 2002, 126-131. [35] Thomas Corsi, Richard Barnard, and James Gibney, Motor Carrier Industry Profile: Linkages Between Financial and Safety Performance Among Carriers in Major Industry Segments, Robert H. Smith School of Business at the University of Maryland, October 2002. [36] The American Trucking Associations is a membership organization with a mission to serve and represent the interests of the trucking industry. [37] The Annual Report Form M is required only for class 1 or class 2 carriers that have revenue exceeding $3 million for 3 consecutive years. [38] Brenda Lantz and David Goettee, An Analysis of Commercial Vehicle Driver Traffic Conviction Data to Identify Higher Safety Risk Motor Carriers, Upper Great Plains Transportation Institute and FMCSA, 2004. Brenda Lantz, Development and Implementation of a Driver Safety History Indicator into the Roadside Inspection Selection System, FMCSA, April 2006. [39] Correlation = 0.085. (See FMCSA, Development and Implementation of a Driver Safety History Indicator into the Roadside Inspection Selection System, April 2006, 14). [40] The Office of Inspector General used MCMIS data to estimate a standard deviation for days to report a crash and then divided the standard deviation by the average number of days. This number was multiplied by 100 to derive the coefficient of variation. The obtained value of about 77 indicates substantial variability relative to the average number of days to report a crash. [41] The General Estimates System collects all types of information from all types of crashes. It is based on a nationally representative probability sample from the estimated 6.4 million police-reported crashes that occur annually. While the crash eligibility definitions are not strictly comparable, the number of crashes reported to MCMIS is below the lower bound for the 95 percent confidence interval around the estimated total number of crashes for large trucks in 2004. [42] We obtained crash data for this period that were reported to FMCSA through June 2006. This allowed us to obtain data on late-reported crashes for the July 2004 through December 2005 period. [43] We drew 10,000 simple random samples of 4,989 carriers (the number of carriers that SafeStat identified as being at highest risk for crashes when we recalculated it) from the list of all carriers in the MCMIS master file used by SafeStat on June 25, 2004, and for each sample we calculated how many crashes the selected carriers reported to MCMIS between June 26, 2004, and December 25, 2005. [44] Campbell, Schmoyer, and Hwang, Review of The Motor Carrier Safety Status Measurement System (SAFESTAT), 2004; U.S. DOT Office of Inspector General, Improvements Needed In the Motor Carrier Safety Status Measurement System, 2004; Madsen and Wright, U.S. DOT-Volpe National Transportation Systems Center, An Effectiveness Analysis of SafeStat, 1998; John A. Volpe National Transportation Systems Center, SafeStat Effectiveness Study Update, 2004. University of Michigan Transportation Research Institute MCMIS State Reports; U.S. DOT Office of Inspector General, Significant Improvements in Motor Carrier Safety Program Since 1999 Act But Loopholes For Repeat Violators Need Closing, 2006. [45] In addition to the safety evaluation area scores, we included indicator variables to flag any missing safety evaluation area scores. [46] We used the carrier's Department of Transportation number recorded in the crash record to match to the carrier's Department of Transportation number listed in the SafeStat summary table. [47] We checked this by estimating the mean and variance of the crashes for the population of all carriers and determined that they were significantly different. [48] The coefficients in the model could not be reliably estimated (the maximum likelihood of the model did not converge). [49] Hauer, Harwood, Council, and Griffith, Estimating Safety by the Empirical Bayes Method: A Tutorial, 2001. [50] In the context of crashes, we wish to "treat" the most dangerous companies with a compliance review to make them safer. But, crashes are distributed with a fair degree of randomness. A company selected for a compliance review may have suffered an atypical random grouping of accidents in the preceding months. With or without a compliance review, it is likely that the random grouping will not exist next year, and the crash figures will improve. Statistical methods seek to control for this regression-to-mean bias in order to better identify the effect of a compliance review on a company's safety. [51] A carrier has to have two or more reported crashes in the past 30 months to receive an accident safety evaluation area value. A carrier has to have three or more roadside inspections to receive a driver or vehicle safety evaluation area value. A driver has to have had a compliance review in the past 18 months to receive a safety management safety evaluation area value. There are other ways a carrier can receive a value for one of these four safety evaluation areas, refer to the description of each one provided in the Background. [52] Our calculation of the accident safety evaluation area differed slightly from that used by FMCSA. We did not add 1 to the severity weights for crashes with an associated hazardous materials release due to the rarity of this event. [53] The equality of the variability in the number of crashes to the average number of crashes is an assumption of the Poisson regression model. This assumption does not hold for the MCMIS data we analyzed. GAO's Mission: The Government Accountability Office, the audit, evaluation and investigative arm of Congress, exists to support Congress in meeting its constitutional responsibilities and to help improve the performance and accountability of the federal government for the American people. GAO examines the use of public funds; evaluates federal programs and policies; and provides analyses, recommendations, and other assistance to help Congress make informed oversight, policy, and funding decisions. GAO's commitment to good government is reflected in its core values of accountability, integrity, and reliability. Obtaining Copies of GAO Reports and Testimony: The fastest and easiest way to obtain copies of GAO documents at no cost is through GAO's Web site (www.gao.gov). Each weekday, GAO posts newly released reports, testimony, and correspondence on its Web site. To have GAO e-mail you a list of newly posted products every afternoon, go to www.gao.gov and select "Subscribe to Updates." Order by Mail or Phone: The first copy of each printed report is free. Additional copies are $2 each. A check or money order should be made out to the Superintendent of Documents. GAO also accepts VISA and Mastercard. Orders for 100 or more copies mailed to a single address are discounted 25 percent. Orders should be sent to: U.S. Government Accountability Office 441 G Street NW, Room LM Washington, D.C. 20548: To order by Phone: Voice: (202) 512-6000 TDD: (202) 512-2537 Fax: (202) 512-6061: To Report Fraud, Waste, and Abuse in Federal Programs: Contact: Web site: www.gao.gov/fraudnet/fraudnet.htm E-mail: fraudnet@gao.gov Automated answering system: (800) 424-5454 or (202) 512-7470: Congressional Relations: Gloria Jarmon, Managing Director, JarmonG@gao.gov (202) 512-4400 U.S. Government Accountability Office, 441 G Street NW, Room 7125 Washington, D.C. 20548: Public Affairs: Paul Anderson, Managing Director, AndersonP1@gao.gov (202) 512-4800 U.S. Government Accountability Office, 441 G Street NW, Room 7149 Washington, D.C. 20548: