Sex segregation propositions in 140 characters

In response to an annoying conversation on Twitter about this short paper, which felt very familiar, here is an argument about the sex segregation of work, in the form of unsourced propositions of 140 characters or less. You can find most of these in longer form in various posts under the segregation tag. It’s tweetstorm, in one post!

Many studies show men and women have mean differences in personality and preferences, although there is overlap in the distributions; but

Every respondent in any such study was born and raised in a male-dominated society, because all societies are male dominated.

Most people in the debates I see, being elites, act like everyone is a college graduate who chose their job, or “field” of work; but

We know lots of people are in jobs they didn’t freely choose or didn’t get promoted out of, for reasons related to gender (like pregnancy).

No one knows how much segregation results from differences in choices of workers vs. parent/employer/educator pressure or constraints; and

The level of sex segregation varies across social contexts (across space and time), which means it is not all caused by biology; and

Because segregation causes inequality and constrains human freedom, and we have the means to reduce it, the biology theory is harmful; so

Go ahead and study the biology of sex differences, because society is interesting, but don’t use that as an excuse for inequality.

Family Inequality year-end review

This blog has enhanced my working life in so many ways, so let me start by thanking you for reading, sharing, and commenting here. The writing I do here — 909 posts so far — led to my textbook (working on the second edition now), and the new collection of essays I have under contract (and now under review at U. of California Press). Because of the visibility I have here, I got to be co-editor of Contexts magazine for the last two years (one more to go), and got elected to leadership positions in the American Sociological Association’s Family and Population sections. And the engagement I get here, with the discipline of sociology and academia generally, led to this year’s major initiative, SocArXiv, an open access archive of social science research (read about it, share your work, watch videos, @follow). This is all very rewarding work with an expanding group of great colleagues and collaborators.*

All of these things took me away from the daily work of writing here this year. As a result, I wrote fewer posts — 77, compared with an average of 130 per year since 2010. And for the first time this blog saw a decline in readership in 2016. Even with a self-serving new measure — visitors per new piece posted, which deflates the hit count by an indicator of effort — there was a drop this year:


As long as blog traffic was increasing, I was of course delighted to report on my success on that metric. Now that it’s not I stress other key indicators, such as those I listed above. Obviously I won’t be measuring success by my interventions into politics. But more fundamentally, all of us in the knowledge and truth business have more serious problems to consider than impact metrics.

The most popular posts I wrote this year fall into four categories: Trump, the academic publishing problem, regular demography, and debunking.** This is a good reflection of my priorities over the year, and I have no strategic adjustments planned for 2017. But who knows?

Here are the top 10 posts written in 2016. Thanks again for reading!

  1. No Black women are not the “most educated group in the US”. How do you debunk a false meme when it says something positive about people you want to support?
  2. Black men raping White women: BJS’s Table 42 problem. A lot of clicks on this post came from people Googling things like “black man rape white woman.” I hope they stay to read it.
  3. Life table says divorce rate is 52.7%. There is no one “divorce rate.” This is an underappreciated method, with a non-surprising result.
  4. How broken is our system (hit me with that figure again edition). And see also Eran Shor responds. Our academic publishing system once again revealed to be poorly designed for the task of providing information to people.
  5. Perspective on sociology’s academic hierarchy and debate. Follow up to the Shor et al. debate. Academics are going to have to get thicker skins.
  6. The one big thing that might doom Trump in November. Race, I figured. I stuck with this message all year. Maybe it helped a little.
  7. Must-know current demographic facts. Updating a list of the basics, especially for teaching.
  8. How the left can win the general election. Some suggestions for how to win in a two-party game. The focus on “social” versus “economic” issues was incorrect. (This one just made the list because Chris Hayes called it “fascinating” on Twitter, a quote I plan to put on the back of all my books from now on.)
  9. Looks like racist Southern Whites like Trump. Sure do.
  10. For (not against) a better publishing model. How the American Sociological Association is not getting it right. Written the day I registered the SocArXiv domains.

* Note this year I started posting data and code on the Open Science Framework, a collaboration and sharing platform on which SocArXiv also runs. Here are my public projects. I hope you’ll consider using it, or something like it.

** Not included on this list, but probably tops among my essays this year, is the post that was picked up by the LSE Impact blog about the formation of SocArXiv. 

Is there sex selection among Asian immigrants in the US?

There is a 2008 paper reported in the New York Times in 2009, which found skewed sex ratios among children of immigrants from China, Korea, and India, if their older siblings were girls, using the 2000 Census. The implication was that some parents were using IVF or abortion to select boy children if their first two were girls — as is the case in their home countries. There has been some other research on this from the early 2000s, but I haven’t seen it updated since then.

I took a quick stab at it, but don’t have time right now to pursue it more thoroughly. So here’s the quick answer I got, and I shared my data, code, and results in an Open Science Framework project, here. I hope someone will be interested and pursue it further (using my approach or not). The files there include all different ethnic/racial groups.

This is preliminary.

Using the American Community Survey data from 2010-2015, from, I took U.S.-born children ages 0-5, whose parents were both born in China, Korea, or India and both were present in the household. I counted the sex of any present siblings under age 15 (excluding step- and adopted children). Then I restricted the data to those with 2 older siblings, and compared the sex ratios among those who had 0 or 1 older sister to those who had 2 older sisters. I did this in a logistic regression controlling for individual years of age, and using ACS person weights. There are judgment calls to make about age, siblings, data and other issues. The older you get the more likely you are to have kids moving out in a way that is not sex-neutral (for example, if parents with girls are more or less likely to divorce), and so on. Should parents be matched on immigration status, siblings born abroad included, why the years 2010-2015, and so on. This is what I mean by preliminary. But these results are interesting enough to prompt me to post them and encourage discussion and more analysis.

Here’s what I got:

sex selection.xlsx

The sex differences between those with 0/1 older sister and 2 older sisters are not statistically significant at p.<.05 in each of the three groups, but they are for the combined set (.046). These comparison involve a few hundred cases. Here are the unweighted, unadjusted results:


As you can see, just a few families intervening to choose boys — or some other force rearranging the living arrangements, or survival, of children and families, and the difference would not hold. Still, I think it’s worth pursuing. Maybe someone already has. If you decide to get into it, feel free to use this stuff, and let me know what you come up with!


Advice for and about ASA

Last summer the incoming American Sociological Association President, Michèle Lamont, asked me to offer some advice to ASA about open access publishing issues. It was an open-ended request, and I didn’t know how to go about it. My understanding of ASA is that it is not well outfitted as a change agent; it’s much more likely to respond to external developments in its ecosystem than to take the lead, especially when its revenue stream is at stake. Nevertheless, lots of good people work in and around the association, and it has great capacity. (I am involved myself, as co-editor of the ASA magazine Contexts, as chair-elect of the Family Section, and as secretary treasurer of the Population Section.) So I wrote a short essay on what ASA might do, or what its members might do or demand of it.

It’s not coincidental that this is posted on the SocArXiv blog, SocOpen, which is part of that changing external environment that I hope will lead to ASA adapting for the better. I believe that devoting my energy to this project is producing something tangible for research and scholarly communication, while also pressuring ASA (and maybe other associations) to move in the right direction.

I hope you’ll read it on SocOpen.

No paper, no news (#NoPaperNoNews)


In the abstract, the missions of science and science reporting align. But in the market arena they both have incentives to cheat, stretch, and rush. Members of the two groups sometimes have joint interests in pumping up research findings. Reporters feel pressure to get scoops on cutting edge research, research that they want to appear important as well as true — so they may want to avoid a pack of whining, jealous tweed-wearers seen as more hindrance than help. And researchers (and their press offices) want to get splashy, positive coverage of their discoveries that isn’t bogged down by the objections of all those whining, jealous tweed-wearers either.

Despite some bad incentives, the alliance between good researchers and good reporters may be growing stronger these days, with the potential to help stem the daily tide of ridiculous stories. Partly due to social media interaction, it’s become easier for researchers to ping reporters directly about their research, or about a problem with a story; and it’s become easier for reporters to find and contact researchers to cover their work, and for comment or analysis of research they’re covering. The result is an increase in research reporting that is skeptical and exploratory rather than just exuberant or exaggerated. Some of this rapid interaction between experts researchers and expert reporters, in fact, operates as a layer of improved peer review, subjecting potentially important research to more extreme vetting at just the right moment.

Those of us in these relationships who want to do the right thing really do need each other. And one way to help is to encourage the development of prosocial norms and best practices. To that end, I think we should agree on a No Paper No News pact. Let’s pledge:

  • If you are a researcher, or university press office, and you want your research covered, free up the paper — and insist that news coverage link to it. Make the journal open a copy, or post a preprint somewhere like SocArXiv.
  • If you are a reporter or editor, and you want to cover new research, insist that the researcher, university, or journal, provide open access to its content — then link to it.
  • If you are a consumer of science or research reporting, and you want to evaluate news coverage, look for a clear link to an open access copy of the paper. If you don’t see one, flag it with the #NoPaperNoNews tag, and pressure the news/research collaborators to comply with this basic best practice.

This is not an extremist approach. I’m not saying we must require complete open access to all research (something I would like to see, of course). And this is not dissing the peer review process, which, although seriously flawed in its application, is basically a good idea. But peer review is nothing like a guarantee that research is good, and it’s even less a guarantee that research as translated through a news release and then a reporter and an editor is reliable and responsible. #NoPaperNoNews recognizes that when research enters the public arena through the news media, it may become important in unanticipated ways, and it may be subject to more irresponsible uses, misunderstandings, and exploitation. Providing direct access to the research product itself makes it possible for concerned people to get involved and speak up if something is going wrong. It also enhances the positive impact of the research reporting, which is great when the research is good.

Plenty of reporters, editors, researchers, and universities practice some version of this, but it’s inconsistent. For example, the American Sociological Association currently has a news release up about a paper in the American Sociological Review, by Paula England,  Jonathan Bearak, Michelle Budig, and Melissa Hodges. And, as is now usually the case, that paper was selected by the ASR editors to be the freebie of the month, so it’s freely available. But the news release (which also only lists England as an author) doesn’t link to the paper. Some news reports link to the free copy but some don’t. ASA could easily add boilerplate language to their news releases, firmly suggesting that coverage link to the original paper, which is freely available.

Some publishers support this kind of approach, laying out free copies of breaking news research. But some don’t. In those cases, reporters and researchers can work together to make preprint versions available. In the social sciences, you can easily and immediately put a preprint on SocArXiv and add the link to the news report (to see which version you are free to post — pre-review, post-review, pre-edit, post-edit, etc. — consult your author agreement or look up the journal in the Sherpa/Romeo database.)

This practice is easy to enforce because it’s simple and technologically easy. When a New York Times reporter says, “I’d love to cover this research. Just tell me where I can link to the paper,” most researchers, universities, and publishers will jump to accommodate them. The only people who will want to block it are bad actors: people who don’t want their research scrutinized, reporters who don’t want to be double-checked, publishers who prioritize income over the public good.



Why Heritage is wrong on the new Census race/ethnicity question

Sorry this is long and rambly. I just want to get the main points down and I’m in the middle of other things. I hope it helps.

Mike Gonzalez, a Bush-era speech writer with no background in demography (not that there’s anything wrong with that), now a PR person for the Heritage Foundation, has written a noxious and divisive op-ed in the Washington Post that spreads some completely wrong information about the U.S. Census Bureau’s attempts to improve data collection on race and ethnicity. It’s also a scary warning of what the far right politicization of the Census Bureau might mean for social science and democracy.

Gonzalez is upset that “the Obama administration is rushing to institute changes in racial classifications,” which include two major changes: combining the Hispanic/Latino Origin question with the Race question, and adding a new category, Middle Eastern or North African (MENA). Gonzalez (who, it must be noted, perhaps with some sympathy, recently wrote one of those useless books about how the Republican party can reach Hispanics, made instantly obsolete by Trump), says that what Obama has in mind “will only aggravate the volatile social frictions that created today’s poisonous political climate in the first place.” Yes, the “poisonous political climate” he is upset about (did I mention he works for the Heritage Foundation?) is the result of the way the government divides people by race and ethnicity. Not actually dividing them, of course (which is a real problem), but dividing them on Census forms. (I hadn’t heard this particular version of why Trump is Obama’s fault — who knew?)

How will the new reforms make the Trump situation he helped create worse? Basically, by measuring race and ethnicity, which Gonzalez would rather not do (as suggested by the title, “Think of America as one people? The census begs to differ,” which could have been written at any time in the past two centuries).

Specifically, Gonzalez claims, completely factually inaccurately, that Census would “eliminate a second question that lets [Hispanics] also choose their race.” By combining Hispanic origin and race into one question — on which, as before, people will be free to mark as many responses as they like — Gonzalez thinks Census would “effectively make ‘Hispanic’ their sole racial identifier.” He is upset that many Latinos will not identify themselves as “White” if they have the option of “Hispanic” on the same question, even if they are free to mark both (which he doesn’t mention). Some will, but that is not because anyone is taking away any of their choices.

The Census Bureau, of course, because they always do, because they are excellent, has done years of research on these questions, including all the major stakeholders in a long interactive process that is scrupulously documented and (for a government bureaucracy) quite transparent. Naturally not everyone is happy, but in the end the trained demographic professionals come down on the side of the best science.

Race that Latino

The most recent report on the research I found was a presentation by Nicholas Jones and Michael Bentley from the Census Bureau. This is my source for the research on the new question.

First, why combine Hispanic with race? You have probably seen the phrase “Hispanics may be of any race” on lots of reports that use Census or other government data. The figure below is from the first edition of my book, using 2010 data, in which I group all 50 million Hispanics, and show the races they chose: about half White, the rest other race or more than one race (usually White and other race). Notice that by this convention Hispanics are removed from the White group anyway, just because we don’t want to have people in the same picture twice (“non-Hispanic Whites” is already a common construction).


The “may be of any race” language is the awkward outcome of an approach that treats Hispanic as an “ethnicity” (actually a bunch of national origins, maybe a panethnicity), while White, Black, Asian, Pacific Islander, and American Indian are treated as “races.” The distinction never really made sense. These things have been measured using self-identification for more than half a century, so we’re not talking about genetics and blood tests, we’re talking about how people identify themselves. And there just isn’t a major categorical difference between race and ethnicity for most people — people of any race or ethnicity may identify with a specific national origin (Italian, Pakistani, Mexican), as well as a “race” or panethnic identify such as Asian, or Latino. And now that the government allows people to select multiple races (since 2000), as well as answering the Hispanic question, there really is no good justification for keeping them separate. As you can see from my figure above, when we analyze the data we mostly pull all the Hispanics together regardless of their races. The new approach just encourages them to decide how they want that done, which is usually a better approach.

Of course, Asians and Pacific Islanders have been answering the “race” question with national origin prompts for several decades. There was no “Asian” checkbox in 2000 or 2010 (or on the American Community Survey). So they have been using their ethnicity to answer the race question all along — that’s because for some reason you just can’t get “Asian” immigrants, especially recent immigrants — that is, people from India, Korea, and Japan, Vietnam, and so on — to see themselves as part of one panethnic group. Go figure, must be the centuries of considering themselves separate peoples, even “races.” So, a new question that combines the more ethnic categories (Mexican, Pakistanis, etc.), with America’s racial identities (Black, White, etc.), just works better, as long as you let people check as many boxes as they want. This is what the “race” question looked like in 2014. Note there is no “Asian” checkbox:


As a general guide, the questionnaire scheme works best when (a) everyone has a category they like, and (b) few people choose “other.” That is the system that will yield the most scientifically useful data. It also will tend to match the way people interact socially, including how they discriminate against each other, burn crosses on each other’s lawns, and randomly attack each other in public. We want data that helps us understand all that.

Through extensive testing, it has become apparent that, when given a question that offers both race and Hispanic origin together, Latino respondents are much more likely to answer Hispanic/Latino only, rather than cluttering up the race question with “some other race” responses (often writing in “Hispanic” or “Latino” as their “other race”). If I read the presentation right, in round numbers, given the choice of answering the “race” question with “Hispanic,” in the test data about 70% chose Hispanic alone; about 20% chose White along with Hispanic, and 5% choose two races. In fact, the number of Latinos saying their only race is White probably won’t change much; the biggest difference is that you no longer have almost 40% of Latinos saying they are “some other race,” or choosing more than one race (usually White and Other) which usually just means they don’t see a race that fits them on the list.

In the end, the size of the major groups (Hispanics and the major races) are not changed much. Here’s the summary:


In fact, the only major group that will shrink is probably the non-group “multiracial” population, which today is dominated by Hispanics choosing White and “some other race.”

It’s really just better data. It’s not a conspiracy. It’s not eliminating the White race or discouraging assimilation of Hispanics. In short, keep calm and collect better data. We can fight about all that other stuff, too.

I’m sure Gonzalez doesn’t really think this will “eliminate Hispanics’ racial choices.” He’s dog-whistling to people who think the government is trying to reduce the number of Whites by not letting Hispanics be White. His statements are factually incorrect and the Washington Post shouldn’t have printed them. (I don’t know how the Post does Op-Eds; when I wrote one for the NY Times it was pretty thoroughly fact-checked.)


The Migration Policy Institute estimates there are about 2 million MENAs in the U.S. now, about half of them immigrants. This is a pretty small population, mostly Arab-speaking immigrants and their descendants, and more Christian (relative to Muslim) than the countries they left. This is especially true of the more recent immigrants, which don’t include a lot of Iranians (who aren’t Arab).

Census could have instead defined them by linguistic origin (Arab), and captured most, but they instead are going with country of origin, which is consistent with how the other race/ethnic groups are identified (for better or worse). Their testing showed that this measure captures most people with MENA ancestry, encourages them to identify their ancestry, cuts down on them identifying as White, and cuts down on them using “some other race.”

The difference is dramatic for those identifying as White, which fell from 85% to 20% in the test once a MENA category was offered. Would it be better if they just identified as White? I’m really not trying to shrink the count of Whites, I just think this is more accurate. I don’t care about the biology of Whiteness and whether Iranians are part of it, for example (and don’t ever say “Caucasian,” please), I care about the experience and identity of the people we’re talking about — as well as the beliefs of the people who hate them and those who want to protect them from discrimination. Counting them seems better than shoehorning them into a category most of them avoid when given the chance.

Here’s one version of the proposed new combined question, from that Census presentation:



Why not Mike Gonzalez to run Census? Unbelievably, he probably knows more about it than Trump’s education and HUD department heads know about their new portfolios.

But that’s just one odious possibility. It makes me kind of sick to think of the possible idiots and fanatics Trump might put in charge of the Census Bureau, after all this work on research and testing, designed to get the best data we can out of a very messy and imperfect situation.

What else would they do? Will they continue to develop ways to identify and count same-sex couples? The Supreme Court says they can get married, but there is no law that says the Census Bureau has to count them. What about multilingual efforts to reach immigrant communities? This has been a focus of Census Bureau development as well. And so on.

It is absolutely in Trump’s interest, and the interests of those who he serves (not the people who voted for him), to reduce the quality and quantity of social science data the government produces and enables us to produce.


My, what dimorphic parents you have!

Quick note to add the new Disney princess movie Moana to the animated gender series.

As in the case of Hercules, Disney can claim that the giant male Maui is a demigod so it’s normal that he’s many times larger than the princess, Moana. (There are a lot of large-bodied people in some Polynesian societies, but I don’t think this is a sex-specific pattern.) So instead look at Moana’s parents.


His big toe has the same diameter as her wrist. His unflexed bicep is wider than her waist. (Sources say the voice actor for Maui has 20-inch biceps, while a real life-sized Barbie doll would have an 18-inch waist, compared with 31 inches for a typical 19-year-old woman.) Anyway, it’s ridiculous.

But this is not unusual for animated kids-movie parents. Here are the parents from Brave and How to Train Your Dragon:



So, extreme dimorphism among parents is common in this genre. Why? I can’t say for sure, but here’s a clue — the parents from Frozen:


My, how similar their bodies are! Sure, her eyes are bigger than his mouth, and his hand is a little engorged, but that’s because there’s a baby in the scene. In the scale of things, they’re practically twins.

If the difference is in racial or ethnic context for the families, then maybe extreme dimorphism among parents helps signify the exoticism of the culture depicted. Of course Black men are often stereotyped as having superhuman bodies, but super petite women don’t usually go along with that particular trope, so I’m not sure how to interpret this. Ideas welcome.


