In Alaska, West Virginia and other mostly rural states, census takers relied more on the word of neighbors, landlords and others for information about a home’s residents. In New Jersey, New York and other more densely populated states in the Mid-Atlantic region, they were more likely to come away from a household lacking basic information on race, sex and ethnic background.
An Associated Press review of the first data-quality measurements released by the US Census Bureau last month shows some early patterns that may point to red flags in the data that could emerge when more detailed numbers from the 2020 census are released in August.
While it’s too early to reach any conclusions about the accuracy of the data gathered during the once-a-decade head count, these types of responses — a reliance on proxies for answers and just a head count with no basic demographic information — result in poorer quality data compared to other methods.
Poor quality data can diminish the political power and resources available to communities across the US: Children who are missed in the census deprive communities of money for building schools, and undercounting racial or ethnic minorities prevents them from forming minority-majority political districts.
The bureau released data quality measurements last month as part of an effort to engender confidence in the numbers following a head count challenged by the spread of the new coronavirus, concerns about politicization by the Trump administration and natural disasters. The bureau also is allowing a team of outside statisticians to perform quality checks.
The measurements include state-by-state breakdowns of rates of households that answered the census questionnaire on their own, the percentage of households where a member answered a census taker’s questions and the rate of households where information was gathered from administrative records from agencies like the IRS or Social Security Administration. Answers gathered from these methods are considered higher quality than proxies and population-only counts.
“We will learn more when smaller geography data is released,” said Jan Vink, a demographer at Cornell University.
Besides Alaska and West Virginia, other rural states that had the highest rates of household answers coming from proxies such as neighbors and landlords included Maine, Montana, Vermont and New Mexico. In these states, census takers relied on information from proxies for between almost a quarter and a third of households. Puerto Rico’s rate was 37.3 percent.
Nationally, the rate was 18.2 percent, a little less than the 2010 rate of 19.5 percent, but the bureau for the first time used administrative records in 2020, which helped fill in some of those information gaps.
These same states and Puerto Rico are also places where large numbers of households got counted from a bureau operation geared towards rural and remote communities that often lack mail delivery at their homes. Known as “Update/Leave,” it involved census takers dropping off paper questionnaires at homes. The operation’s launch coincided with the coronavirus outbreak in the US in March 2020, and was suspended for two months as the pandemic spread.
While it’s too early to reach any conclusions about the fairness or accuracy of the 2020 count, the correlation between the high use of proxies and the fact these states had large numbers of households covered by this operation “raises an important red flag that needs to be examined further,” said Steven Romalewski, director of the CUNY Mapping Service.
When it came to households where census takers only got a head count without demographic information, the national rate was a 5.9 percent. But it was higher for states in the Mid-Atlantic region, some with the nation’s highest population densities, as well as the District of Columbia. It ranged between almost 8 percent and 12 percent for the District of Columbia, New York, Illinois, New Jersey, Maryland and Delaware.
In a statement, the Census Bureau confirmed the findings of the AP review. In 2010, those mostly rural states also had higher rates of proxy responses compared to other states. Residents in these states tend to self-respond at lower rates, requiring census takers to visit their homes, said Mike Bentley, the bureau’s assistant division chief for Census Statistical Support.
Regarding the higher head count-only responses in the Mid-Atlantic states, Bentley said, “It is something that we are exploring further.”
One of the questions the bureau needs to ask is if the failed effort by the Trump administration to add a citizenship question to the 2020 census had a residual effect on residents refusing to provide personal information, said Thomas Saenz, president of the Mexican American Legal Defense and Educational Fund. Saenz also serves on a bureau advisory committee.
At the same time, because of the Trump administration’s attempts to politicize the 2020 census, the bureau needs to determine if people were more willing to provide false information about their neighbors or tenants when proxies were used, Saenz said.
“If you are a Latino family, and a census taker is asking about you, and your proxy is a Trump lover, is this the first census where we have to worry about whether a proxy lied?” Saenz said.
Follow Mike Schneider on Twitter at https://twitter.com/MikeSchneiderAP
Image: People walk on the street, Monday, April 26, 2021 in New York. The once-a-decade head count of the United States shows where the population grew during the past 10 years and where it shrank. New York will lose one seat in Congress as a result of national population shifts, according to census data released Monday. (AP Photo/Mark Lennihan)