Marc F. Bellemare

“On the (Mis) Use of the Fixed Effects Estimator” Now Forthcoming at the Oxford Bulletin of Economics and Statistics

Marc F. Bellemare — Mon, 03 Nov 2025 11:00:00 +0000

My paper with Dan Millimet titled “On the (Mis) Use of the Fixed Effects Estimator” has been accepted and is now forthcoming at the Oxford Bulletin of Economics and Statistics. If you want a link to a .pdf of the accepted version of the paper or to a Stata .do file showing you how to use the alternative estimators we discuss in the paper, scroll all the way to the end of this post. If you want a bit of storytelling about how this paper came about, and what it does, read on.

I forget when Dan and I first discussed this, but this paper was born out of the two of us connecting and bonding over social media during the pandemic. Since about 2014, I had had in mind the idea that if you have, say, 10 years worth of longitudinal data on workers, why was the default way to deal with that to use one individual fixed effect per worker? Why not two fixed effects per worker–one for years 1 to 5, and one for years 6 to 10? Why not more than two? What is the optimal number of fixed effects per unit of observation when you have a “long” panel? At some point, Dan and I discussed this and realized we had both been thinking about the same thing, and so we set to work on this paper. (It’s really Dan’s paper, I was just along for the ride. And while Dan likes to joke that he’s not really an econometrician, he just plays one on TV, I think that’s just humility speaking.)

This is especially important for two reasons. First, if you have only a handful of observations over time per unit of observation (say, you follow workers over two, maybe three years), then yes, you can probably argue that individual fixed effects do a good job of purging the error term of unobserved heterogeneity that is correlated with the covariates because said heterogeneity is arguably time-invariant given that individuals do not change that much over the span of two or three years.

But we are not the 1990s anymore: We now have access to much longer panel data sets, and as one adds additional observations over time to a panel data set, there is a lot less that remains time-invariant, and fixed effects become much less useful for identification. In the limit, as the number of time periods goes to infinity, the fixed effects estimator does no better than a pooled OLS. As we note both in this article and in our article earlier this year in the Journal of Economic Perspectives on Yair Mundlak and the fixed effects estimator, this is something that Mundlak himself recognized in his seminal 1961 article in the Journal of Farm Economics (now the American Journal of Agricultural Economics), in which he brought to economics the first application of the fixed effects estimator.

So far so good. But this brings us to the second problem: Why did the fixed effects estimator become the de facto way to deal with heterogeneity in panel data among the reduced-form applied micro crowd, a crowd that is notoriously picky about and likely to cry foul at identification, especially when there are much better options (e.g., first differences) to account for the fact that, in this sad Heraclitean world of ours, although an individual today might be comparable to herself last year, that same individual today is much less likely to be comparable to herself ten years ago?

For whatever it is worth, this is a particularly egregious problem in political science, where scholars often rely on cross-country fixed effects over periods of time in excess of 25 years. But what does remain constant over time for an entire country? Climate and topography can change. Cultures certainly change as well. Even a country’s borders can change in 25 years! So if you always were skeptical of cross-country longitudinal studies but could never quite put your finger on why, here is your huckleberry.

(The funny thing is that when we submitted this to a leading political science journal, we were told by the editor, with my own emphasis: “[W]e feel that the contribution is not strong enough for a top general interest journal … [w]e also feel that the research design and empirical strategy remain underdeveloped.” I am no political scientist, so I cannot assess how valid the former statement is. But when I wrote to the editor in charge about the latter statement, asking whether we were talking about the same paper since our paper is not your usual application to a research question, and thus does not really have a research design or identification strategy, I never heard back from them…)

In this article, we show how the fixed effects estimator can (and often does) break down with ever-longer panels, and we discuss alternative (and often better) means of dealing with heterogeneity in panel data including first differences, of course, but also interactive fixed effects as well as novel rolling estimators. We then illustrate our point with Monte Carlo simulations and by replicating four sets of results published in leading journals—with only one such set of results turning out to be robust.

Here is the accepted version, and here is the abstract:

Data that span multiple units and time periods allow controlling for time-invariant heterogeneity correlated with the covariates. While researchers can do this in different ways, the fixed effects estimator—also known as the within estimator, and equivalent to the least squares dummy variable approach—has become the default choice. But when time-invariant attributes are not invariant to time—that is, when they are not invariant to the length of the panel—the fixed effects estimator can be considerably biased as researchers incorporate additional time periods. We show that, in finite samples, first-differencing and novel rolling estimators can offer researchers a practical alternative to the fixed effects estimator in this case. These estimators are simple to implement and can significantly reduce bias relative to the fixed effects estimator under certain data-generating processes. Most importantly, researchers should always provide results from multiple estimators. We illustrate this with simulations and four replications.

If you would like to use the alternative estimators to fixed effects we discuss in the paper, you can find a Stata template in this .zip file.

“Global Agricultural Value Chains and Food Prices” Now Forthcoming at the American Journal of Agricultural Economics

Marc F. Bellemare — Thu, 30 Oct 2025 16:55:25 +0000

My paper with Bernhard Dalheimer titled “Global Agricultural Value Chains and Food Prices” has been accepted and is now forthcoming at the American Journal of Agricultural Economics.

I am glad that this is finally accepted for publication: Bernhard and I first discussed it in 2022 when he was here for his postdoc, and he began presenting it when he was on the job market that same year, in early 2023. Since then, the paper has only gotten better as a result of comments from colleagues. I really like that it marries Bernhard’s interests in international trade and agricultural value chains with my own interest in food prices and agricultural value chains.

It also helps that the results tell an interesting story: As countries become more involved in global agricultural value chains, food prices go down, but food price volatility increases. Given what we know about how consumers and producers respond to food price volatility, then, it is no surprise that low- and middle-income countries may be reticent to liberalizing their agricultural sector even as high-income countries insist that they do so.

Here is the accepted version, and here is the abstract:

We study the relationship between the extent of participation in global agricultural value chains (GAVCs) and food prices at the country level. Using longitudinal data on a sample of 138 countries for the period 2000–2015 and a shift-share instrumental variable design, we study how the extent of a country’s participation in GAVCs in a given year relates to food price levels and volatility in that same country and in the same year. We document a mean–variance trade-off in food prices, finding that participation in GAVCs is associated with a decrease in consumer food price levels but an increase in food price volatility. Looking at a country’s upstream (i.e., closer to producers) or downstream (i.e., closer to consumers) positioning in GAVCs, we find that food price volatility is associated more strongly with downstream participation than with upstream participation.

One Important Thing Lost in Discussions of the Terrible, Horrible, No Good, Very Bad Econ Job Market

Marc F. Bellemare — Tue, 21 Oct 2025 19:43:20 +0000

The job market for econ PhDs is bad. It’s not just bad: It’s as-bad-if-not-worse-than-during-a-global-pandemic bad. Here is a screen capture of the data visualization the American Economics Association has on its Job Openings for Economists website:

The story here is not just that the job market is bad this year; the story is that, with the exception of an uptick in 2022, the job market has been going from bad to worse over the last six years. I knew this anecdotally from looking at how my PhD students on the job market have been doing over that time period, but it is sobering to see my intuition supported by actual job market data.

I have heard a variety of reasons for why this is happening. “The Trump administration doesn’t like experts!” Yeah? Doesn’t explain why the trend predates January 2025 by quite a few years. “AI is replacing us!” Be that as it may, ChatGPT, Claude, and Gemini weren’t exactly household names in 2023 or even early 2024. “There’s an enrollment cliff coming!” Sure, è pericoloso sporgersi, but since when do teaching needs directly dictate hiring needs, under the “if you build it, they will come” logic? And so on, and so forth.

#iykyk Adieu, Gotlib. Merci pour ces années de rire.

For all I know, it’s a question of… economics. See, for years there was this profession where you were all but guaranteed a really good job—one that would reward you handsomely with money, the freedom to work on whatever you felt like or both, in some rare and much-coveted jobs at top departments—if only you were willing to put in the hard work of getting a PhD. And for a long time, this was true year in, year out.

I know that economic theory is viewed as old-fashioned by and has fallen out of favor with younger people, but anyone who has done a PhD in economics (or even a PhD in something econ-adjacent like business or public policy) has heard of allocative efficiency, the idea whereby resources tend to flow to where their productivity (and thus their wage or rent) is the highest. Given that, is it any surprise that bright young people flocked to econ (and econ-adjacent) PhD programs? Is it any surprise that eventually the gap between demand and supply would be filled, and that there would then be an excess supply of workers in that lucrative profession, especially given the five-, six-, and even seven-year time lag between the start of a PhD and the job market’s verdict on a fresh PhD?

But there’s also another, no-less-important fact that seems to be getting lost in discussions of how bad the job market is. I alluded somewhat snarkily above to the fact that economic theory is viewed as old-fashioned by and has fallen out of favor with younger people. At best, it is seen as yet another entry barrier to be dispensed with via coursework and sundry qualifying exams. I cannot speak for macro or for more structural fields (e.g., industrial organization), but as someone who is working under the broad umbrella of reduced-form applied microeconomics, I have seen the field change from relying heavily on theory to derive testable predictions that were tested almost exclusively with observational data in the early 2000s to something much more atheoretical relying on causal inference methods, with or without experimental data.

One of my coauthors likes to joke that difference-in-differences is all younger people seem to know about nowadays. That is a bit unfair, but it does contain a kernel of truth: A lot of the work done in applied micro nowadays is almost entirely empirical, often with little to no discussion of the economics of the application—and that’s if and when people look at an application that is actually economic in nature rather than looking at questions that seem more relevant to other disciplines.

As I have been telling graduate students lately: The empirical methods you use—diff-in-diffs, for sure, but also RCTs, IV, RDD, synthetic control, shift-share, etc.—are tools that are accessible to people in other disciplines just as much as they are available to you. In fact, there are social scientists who do excellent work writing about econometrics from outside of economics (a lot of them were trained as political scientists and teach in political science departments; people like Matt Blackwell, Adam Glynn, Cyrus Samii, Maya Sen, and so on), and they often do a much better job of explaining their contributions to applied researchers than your friendly neighborhood Real Rigorous Econometrician. So at the end of the day, if you are going to be doing work that is hard to distinguish from the work done by applied researchers in other disciplines, you are only setting yourself up as a more expensive version of what those other disciplines can offer.¹

So what is to be done, as Lenin famously asked? For starters, one might want to bring economic theory back into what they are doing. The one thing that separates economists from other social scientists is that economic theory gives us a way to analyze the world—one that often leads to conclusions that are counterintuitive or surprising. I’m not arguing for full-blown structural work, just for bringing back economics in (reduced-form) applied micro research.²

“But we know about identification and do a better job of identifying causal relationships than other disciplines!” Bless your heart. During those Seven Blighted Years during which I taught at a policy school, first-semester MPP students were taught about causal inference. ︎
And again, I am not a macroeconomist nor am I a structural econometrician, and so I do not speak for what those guys do. ︎

Finding Your Research Niche in Agricultural Economics (and Beyond)

Marc F. Bellemare — Tue, 23 Sep 2025 15:29:40 +0000

This was the title of the talk I gave this morning to AgEconMeet, the network of (mainly) junior European agricultural economists “working to build a strong community of researchers ready to tackle today’s key challenges in agricultural economics.”

In my talk, I shared my experience finding my research niche as a grad student, pre-tenure, and post-tenure.

It was nice to prepare slides for this, as it gave me a chance to reflect on how I have approached the last few decades in my professional life. You can find my slides here in .pdf format.

Quality vs. Quantity in Publishing Redux

Marc F. Bellemare — Thu, 28 Aug 2025 13:34:44 +0000

My post earlier this year, titled “Quality vs. Quantity in Publishing,” has made waves, apparently. Colleagues as far as Asia, Europe, and Latin America have told me they brought it to the attention of grad students under their supervision.

I have never been under any illusion that my research would influence policy—a belief which has only become stronger these past eight months—and so it is good to know that what I write might at least have an influence on my chosen profession.

Unfortunately, from what I hear and from some of the comments I saw on LinkedIn, where many of us have now retreated,¹ it seems some people took a descriptive discussion of the trade-off between quality and quantity in publishing as a green screen on which to project their insecurities.

As far as I can tell, there was nothing offensive about my describing the quantity approach to being a successful agricultural and resource economist in my earlier post. As I wrote then, quantity has a quality all of its own, and with a greater quantity of articles comes a significantly greater likelihood that a high-quality article will emerge. The only thing I could maybe see as controversial if I squint hard enough was when I wrote that if a department wants to go up the rankings, it should invest in quality rather than quantity. But then again, I don’t know why that’s controversial because, well… just look around for proof.

(In what follows, I will be using originality to differentiate between a high- vs. low-quality article, ceteris paribus. It will soon become obvious why I do that.)

But there is one rather pernicious way in which quantity is a decreasingly useful strategy for success.

With the advent of and constant improvement in generative AI, the returns to quantity will decrease significantly, and the returns to quality will increase significantly.

What I mean by this is that thanks to generative AI, the quantity of unoriginal-but-competently-done-and-written articles will explode. To take an example I know well, there will be many more ho-hum articles looking at the “effects” of participation in contract farming on income with shaky identification strategies.

(This assumes that generative AI cannot come up with original and interesting research ideas on its own, something I believe will remain true for a good long while.)

As a result, we are likely to witness the academic equivalent of Steve Bannon’s “flood the zone with shit” strategy. Journals will see a firehose of unoriginal-yet-competently-done-and-written articles, and their editors (or their own algorithms) will not be able to tell the difference between unoriginal-yet-competently-done-and-written articles written by generative AI and submitted by unscrupulous authors on the one hand and unoriginal-yet-competently-done-and-written articles written by authors who submit their own work.

So what happens then? Journal editors will have to come up with a means of choosing what gets published that goes beyond whether an article is competently done and written well. And what they will use as a rationing device is likely to be whether a research question is original and interesting. In other words, they will look for whether something is of good quality since, in a world where everything is written well and competently done, originality will be the only mark of authenticity—and thus of quality.

My point is this: If I were a graduate student these days, not only would I choose not to do development economics, but I would also make triply sure that I invest in quality rather than quantity. Because many academics become obsolete as soon as they defend their dissertations, this might require bucking the advice of many advisors, but the payoff will in all likelihood be worth it.

Twitter long ago lost its usefulness to academics, and Bluesky never had any, because it turns out that the thing that is most hated about Twitter—its algorithm—is the thing that made everyone love Twitter in the first place. It’s almost as though there is money to be made selling outrage. More seriously, going from Twitter to Bluesky is like going from a right-wing Charybdis to a left-wing Scylla, and the last thing I want (or need) in my life are more polarized and polarizing viewpoints. But if you can get past hollow congratulations on your work anniversary, LinkedIn is a good way to share professional stuff. ︎

A Fool’s Errand? The Inverse Productivity Relationship Reconsidered

Marc F. Bellemare — Mon, 04 Aug 2025 10:00:00 +0000

That’s the title of a new working paper my brilliant student Ling Yao (she is on the market this year, and she will make a great hire for anyone looking for someone working on agricultural economics, labor economics, agribusiness, applied econometrics, or a combination thereof) and I put the finishing touch to this past weekend.

Here is the abstract, with what strikes me as the most exciting things about this paper in boldface font:

An inverse unconditional relationship between farm or plot size (e.g., hectares) and productivity (e.g., kilograms per hectare) is often observed in low- and middle-income countries that appears to be at odds with economic theory. The traditional approach to studying the inverse relationship regresses yield (i.e., output divided by size) on size as well as control variables, testing the null hypothesis that the coefficient on size is zero. We first show that in many circumstances, the relevant null hypothesis is misspecified because the estimand cannot be zero. Moreover, because size appears on both sides of the equation—indirectly on the left-hand side as denominator, and directly on the right-hand side as a measure of size—inherent issues arise with the identification of the relationship between size and productivity. Specifically, any unobserved production factor, even if independent from size, will introduce bias in the estimated coefficient. We next highlight persistent methodological flaws and contradictions in the literature on the inverse size–productivity relationship, discussing how better controls and more precise measurements are unlikely to ensure unbiased estimates. We further identify the stringent requirements that need to be satisfied to correctly estimate the relationship. Finally, we conduct a meta-analysis of the literature on the inverse relationship, discussing the evolution of empirical specifications and documenting evidence of publication bias in favor of negative and significant estimates of the relationship between size and productivity.

Ars longa, vita brevis. This paper is the fruit of several years of thinking. I remember working on early analytical derivations during the summer of 2018, trying to overcome jetlag while in Tokyo to teach a short course at Waseda University. This paper is also a contribution to a literature that is both old and new. Next year will mark the hundredth anniversary of A.V. Chayanov documenting the existence of an unconditional inverse relationship between farm size and productivity in Russia. But as we document in our meta-analysis, the number of studies on the inverse relationship has practically exploded since 2010. And over the last 100 years, the inverse relationship has captivated the attention of many researchers, including that of a Nobel laureate.

Survey Ordering and the Measurement of Welfare

Marc F. Bellemare — Wed, 30 Jul 2025 12:47:05 +0000

At long last, my article with Wahed Rahman and Jeff Bloem titled “Survey Ordering and the Measurement of Welfare” has been published open access in the Journal of the Economic Science Association.

Here is the abstract:

“Economic policy and research rely on the accurate measurement of welfare. In nearly all instances, measuring welfare requires collecting data via long household surveys. If survey response patterns change over the course of a survey to introduce measurement error, this measurement error can be either classical (i.e., changing distributions, leading to noise) or non-classical (i.e., changing expectations, leading to bias). We embed an experiment in a survey by randomly assigning a questionnaire with either the assets module near the beginning of the survey or the assets module at the end of the survey, delaying enumeration of assets by about 60 minutes. We find no evidence in the full sample that survey ordering introduces differential response patterns, either in the number of reported assets or the reported value of those assets. In exploratory analysis of heterogeneity, we find evidence of non-classical measurement error due to survey ordering within sub-samples of respondents who (i) are from larger households or (ii) have low levels of education. Our experimental design can be generalized to serve as an ex-post test of data quality with respect to questionnaire length.”

Writing Matters

Marc F. Bellemare — Fri, 06 Jun 2025 16:59:42 +0000

Having spent last weekend in in my hometown for the Canadian Economics Association/Canadian Agricultural Economics Society annual meetings, I was asked by a classmate from way back to give a short talk about writing in economics (especially for people for whom English is a second language) to the participants of a writing retreat for economics graduate students across all four of Montreal’s universities.

Here are the slides of that short talk, in which I tried to go beyond what I had already said in Doing Economics.

Yair Mundlak and the Fixed Effects Estimator

Marc F. Bellemare — Fri, 09 May 2025 16:26:31 +0000

That’s the title of my latest article, coauthored with Dan Millimet, and which just came out in the spring issue of the Journal of Economic Perspectives. Here’s the abstract:

We discuss Yair Mundlak’s (1927–2015) contribution to econometrics through the lens of the fixed effects estimator. We set the stage by discussing Mundlak’s life and his seminal 1961 article in the Journal of Farm Economics, showing how it was looking at the right application—the study of agricultural productivity, which had hitherto been thought to be marred by the presence of management bias—that led Mundlak to use the fixed effects estimator. After discussing Mundlak’s contribution, we briefly discuss the historical economic and statistical contexts in which he made that contribution. We then highlight the dialogue that took place between the proponents of fixed versus random effects and discuss how Mundlak settled the debate in his 1978 Econometrica article. We conclude by discussing how, between fixed and random effects, the fixed effects estimator won the day, becoming the de facto estimator of choice among applied economists because of the Credibility Revolution, culminating in the popularity nowadays of difference-in-differences designs and of two-way fixed effects estimators.

I have learned a lot while working on this article. Like many people, I thought Mundlak himself had developed the fixed effects estimator. But that turned out not to be true: The estimator was already available; what Mundlak did which forever associated his name with the fixed effects estimator was to find the right application for it. Having recently published an article in which my coauthors and I present the first application of an estimator to economics, this was very encouraging.

Quality vs. Quantity in Publishing

Marc F. Bellemare — Fri, 31 Jan 2025 21:49:18 +0000

As a result of having served as editor of Food Policy (2015-2019) and the American Journal of Agricultural Economics (2019-2023), I’ve been asked to write a lot of tenure and promotion letters these past ten years. This has given me a chance to reflect on how people can be successful in agricultural economics. Given that, I thought I should uncover some of the hidden curriculum behind how people’s research portfolios are evaluated on the job market, for tenure, for promotion, and so on.

But first, what does “successful” mean? This is where objective criteria come into play, but I can think of several proxies for success.¹ First, there is success at the extensive margin: Does someone get tenure? Do they get promoted? Are they considered for endowed chairs or named professorships? Second, there is success at the intensive margin: How many Google Scholar or Web of Science citations does someone have? What is their h-index? What is their salary relative to comparable matches? And then there is stuff like the kind of job offers they get when they go on the market.

“Quantity has a quality all of its own,” a mentor once told me. By that, he was alluding to the fact that while some researchers in our discipline (i.e., agricultural economics) are known for publishing high-quality articles, others are known for publishing a high quantity of articles, and that publishing a high quantity of articles can eventually add up to quality. I wanted to talk about quantity, quality, or even both can be leveraged in terms of having a scholarly impact.

Quality

I think there is broad agreement on what “quality” means in agricultural economics: Assuming a research article in the American Journal of Agricultural Economics (AJAE, i.e., the journal in agricultural economics) is the numéraire (i.e., the yardstick by which other publications are evaluated), we can express other publications in terms of AJAE equivalents.

While the precise value of each publication does remain subjective, there is widespread agreement that an article in a top-five economics journal is definitely worth more than one article in the AJAE. For an article in a general science journal like Nature, Science, of PNAS, opinions differ. Some people will view those as > 1 AJAE. Others will view them as ≈ 1 AJAE. Yet others will view them as < 1 AJAE. Articles in top field journals (e.g., Journal of Development Economics, Journal of the Association of Environmental and Resource Economists), will usually be worth 1 AJAE, if not more. Articles in regional agricultural economics journals will usually be worth < 1 AJAE. Predatory or near-predatory journals tend to be worth 0 AJAE.² Given that rough ranking, people typically have a subjective criterion for how much quality they want to see.

Quantity

Quantity might lead to quality a few different ways. First, black swans do crop up. If someone publishes five or six articles year in, year out, eventually one of them is bound to be in the far-right tail of the quality distribution. To see this, look at Woody Allen’s filmography here. Whatever one might think of him, in the 58 years between 1965 and 2023, there were only seven years during which Woody Allen did not release a film he either directed, wrote, or acted in. Of the 65 films he did release (did I forget to mention there were 10 years where he released more than one?), only three won an Oscar: Annie Hall, Hannah and Her Sisters, and Midnight in Paris. Unless you’re a film buff, odds are those are the ones you’ve heard of.

The same can happen with research. If you publish five to ten articles a year, the Law of Truly Large Numbers says that every once in a while, you’ll put out a real banger. If you’re econometrically minded, call that a within-article quality effect.

Second, if “quality” is defined as the number of citations to one’s work,³ quantity can generate citations-as-quality via spillover effects. To see this, consider the following anecdote told by Ryan Holiday in Perennial Seller, I think. Iron Maiden has never gotten much airplay. They’ve always been a band that few if any radio stations would touch. And yet they’ve been around since before I was born, they’ve put out 17 studio albums and 13 live ones, they keep touring and selling out hockey arenas and, for a time, they even had their own jet airplane.

Case in point: The view from my seats at the Xcel Energy Center on October 22, 2024. Have you seen the writing on the wall?

How did they do it? By releasing either a studio or live album every few years. As Holiday explains it: Every time they release a new album, not only do they sell a lot of that album, they sell a lot of their other albums, too, because people are reminded of Iron Maiden and of their other albums.

If you’re a researcher, this is the type of thing that happens if you become known for working on a given topic, since people will typically cite work by the same author in clusters.⁴ Again, if you’re econometrically minded, call that a between-article quality effect.

Quantity or Quality?

Which of quantity or quality should a researcher target? I think it is reasonable to say that one should avoid corner solutions at all costs. That is, one should avoid focusing purely on quality, and one should avoid focusing purely on quantity. I used the term “portfolio” above to describe the sum total of someone’s scholarly output. I used that term for a reason: Just like you would not want to put all your retirement savings in a single stock, you also would not want to spread your retirement savings across too many stocks. (Ignore mutual or index funds, since there are no such equivalents in publishing.)

What is the right balance of quality and quantity for you? That depends on the incentives you face. In other words: It depends on the job you have, or on the kind of job offer you would like to get if you are in grad school (or if you have a job but would like to get a competing offer).

Some departments will want to see quality above anything else. Others will be happy to let you substitute as much quantity as you want for quality. The one pattern I am aware of here is this: The closer you get to the top in terms of departmental rankings, the less quantity is acceptable as a substitute for quality. At the very top departments, the elasticity of substitution is zero: No amount of quantity can make up for a lack of quality.

Beyond individual-level strategies, this suggests the following institutional-level strategy: If a department wants to work its way up the rankings, it is probably wise to invest in significantly more quality at the expense of quantity. Conversely, a department that encourages quantity is unlikely to move up the rankings.⁵

Those are proxies for success since everyone has their own subjective criteria for what constitutes success, and so in the absence of data that allow comparing subjective outcomes, we can only go by external signals if we want to quantify “success.” ︎
For some, an article in a predatory or near-predatory journal may even reduce the quality of someone’s research portfolio, i.e., be worth < 0 AJAE, because it shows a lack of judgment. ︎
While some citations are clearly better than others, the vast majority of citations are neither good (e.g., a citation from a Nobel laureate, or from an article in a top journal) nor bad (e.g., a citation from an article that criticizes you for being wrong about something, or a citation from a predatory journal). At any rate, while you may not like citations as a measure of impact or quality, your dean, various funders, and colleagues in other departments certainly do, as citations are the only thing easily compared across disciplines. Even the RepEc ranking of economists relies on citations. ︎
For example, suppose someone is working on a topic Someone named Smith worked on. Very often, instead of citing the one or two most relevant paper by Smith, they’ll cite almost all or all of Smith’s output on the topic. They do so for two reasons. First, most people don’t spend much time reading. They’ll remember “Oh, right… Smith said that one thing about this topic. Wait, where did she say it? Ah, screw it. Let’s just cite everything.” Second, they may also do so strategically, to increase the likelihood that Smith will be one of their reviewers. ︎
For instance, I know a department where, in order to get tenure, someone on a 50-50 teaching-research split has to publish at least 2.4 articles per year. Given how much an article in a top field journal requires, this requirement is hardly conducive to that department going up in rankings. ︎