Researchers from the University of Basel quantified the impact of Generative AI adoption on academic performance in social and behavioral sciences, revealing substantial increases in publications (15% in 2023, 36% in 2024) and modest improvements in journal impact factors (1.3% in 2023, 2.0% in 2024). The study highlights particular benefits for non-native English speakers, early-career researchers, and those in technically demanding fields, suggesting a potential to reduce structural barriers in academic publishing.
Generative AI is transforming higher education, yet systematic evidence on student adoption remains limited. Using novel survey data from a selective U.S. college, we document over 80 percent of students using AI academically within two years of ChatGPT's release. Adoption varies across disciplines, demographics, and achievement levels, highlighting AI's potential to reshape educational inequalities. Students predominantly use AI for augmenting learning (e.g., explanations, feedback), but also to automate tasks (e.g., essay generation). Positive perceptions of AI's educational benefits strongly predict adoption. Institutional policies can influence usage patterns but risk creating unintended disparate impacts across student groups due to uneven compliance.
The COVID-19 pandemic caused widespread disruptions to education, with school closures affecting over one billion children. These closures, aimed at reducing virus transmission, resulted in significant learning losses, particularly in mathematics and science. Using data from TIMSS 2023, which assesses fourth and eighth-grade achievements across 71 education systems, this study analyzes the impact of school closure duration on learning outcomes. Mixed-effect models estimate deviations from pre-pandemic trends, adjusting for demographic factors. Results show a global average decline of 0.11 standard deviations (SD) in student achievement, with longer closures linked to more severe losses. The effects on low performers, girls, and linguistic minorities show effect sizes up to 0.22 SD. These findings highlight the lasting impact of school closures and emphasize the need for targeted recovery strategies and international cooperation to promote equitable educational outcomes post-pandemic.
Analysis of effect heterogeneity at the group level is standard practice in empirical treatment evaluation research. However, treatments analyzed are often aggregates of multiple underlying treatments which are themselves heterogeneous, e.g. different modules of a training program or varying exposures. In these settings, conventional approaches such as comparing (adjusted) differences-in-means across groups can produce misleading conclusions when underlying treatment propensities differ systematically between groups. This paper develops a novel decomposition framework that disentangles contributions of effect heterogeneity and qualitatively distinct components of treatment heterogeneity to observed group-level differences. We propose semiparametric debiased machine learning estimators that are robust to complex treatments and limited overlap. We revisit a widely documented gender gap in training returns of an active labor market policy. The decomposition reveals that it is almost entirely driven by women being treated differently than men and not by heterogeneous returns from identical treatments. In particular, women are disproportionately targeted towards vocational training tracks with lower unconditional returns.
Decision-making plays a pivotal role in shaping outcomes in various disciplines, such as medicine, economics, and business. This paper provides guidance to practitioners on how to implement a decision tree designed to address treatment assignment policies using an interpretable and non-parametric algorithm. Our Policy Tree is motivated on the method proposed by Zhou, Athey, and Wager (2023), distinguishing itself for the policy score calculation, incorporating constraints, and handling categorical and continuous variables. We demonstrate the usage of the Policy Tree for multiple, discrete treatments on data sets from different fields. The Policy Tree is available in Python's open-source package mcf (Modified Causal Forest).
Until the mid 1960s, the UK experienced regular measles epidemics, with the vast majority of children being infected in early childhood. The introduction of a measles vaccine substantially reduced its incidence. The first part of this paper examines the long-term human capital and health effects of this change in the early childhood disease environment. The second part investigates interactions between the vaccination campaign and individuals' endowments as captured using molecular genetic data, shedding light on complementarities between public health investments and individual endowments. We use two identification approaches, based on the nationwide introduction of the vaccine in 1968 and local vaccination trials in 1966. Our results show that exposure to the vaccination in early childhood positively affects adult height, but only among those with high genetic endowments for height. We find no effects on years of education; neither a direct effect, nor evidence of complementarities.
This study evaluates the macroeconomic effects of active labour market policies (ALMP) in Germany over the period 2005 to 2018. We propose a novel identification strategy to overcome the simultaneity of ALMP and labour market outcomes at the regional level. It exploits the imperfect overlap of local labour markets and local employment agencies that decide on the local implementation of policies. Specifically, we instrument for the use of ALMP in a local labour market with the mix of ALMP implemented outside this market but in local employment agencies that partially overlap with this market. We find no effects of short-term activation measures and further vocational training on aggregate labour market outcomes. In contrast, wage subsidies substantially increase the share of workers in unsubsidised employment while lowering long-term unemployment and welfare dependency. Our results suggest that negative externalities of ALMP partially offset the effects for program participants and that some segments of the labour market benefit more than others.
In this paper we develop a new machine learning estimator for ordered choice models based on the random forest. The proposed Ordered Forest flexibly estimates the conditional choice probabilities while taking the ordering information explicitly into account. In addition to common machine learning estimators, it enables the estimation of marginal effects as well as conducting inference and thus provides the same output as classical econometric estimators. An extensive simulation study reveals a good predictive performance, particularly in settings with non-linearities and near-multicollinearity. An empirical application contrasts the estimation of marginal effects and their standard errors with an ordered logit model. A software implementation of the Ordered Forest is provided both in R and Python in the package orf available on CRAN and PyPI, respectively.
Estimators that weight observed outcomes to form effect estimates have a long tradition. Their outcome weights are widely used in established procedures, such as checking covariate balance, characterizing target populations, or detecting and managing extreme weights. This paper introduces a general framework for deriving such outcome weights. It establishes when and how numerical equivalence between an original estimator representation as moment condition and a unique weighted representation can be obtained. The framework is applied to derive novel outcome weights for the six seminal instances of double machine learning and generalized random forests, while recovering existing results for other estimators as special cases. The analysis highlights that implementation choices determine (i) the availability of outcome weights and (ii) their properties. Notably, standard implementations of partially linear regression-based estimators, like causal forests, employ outcome weights that do not sum to (minus) one in the (un)treated group, not fulfilling a property often considered desirable.
Using administrative data on all induced abortions recorded in Spain in 2019, we analyze the characteristics of women undergoing repeat abortions and the spacing between these procedures. Our findings indicate that compared to women experiencing their first abortion, those who undergo repeat abortions are more likely to have lower education levels, have dependent children, live alone, or be foreign-born, with a non-monotonic relationship with age. We also report that being less educated, not employed, having dependent children, or being foreign-born are all strongly related to a higher number of repeat abortions. Lastly, we find that being less educated, foreign-born, or not employed is correlated with a shorter time interval between the last two abortions.
Despite the widespread use of graphs in empirical research, little is known about readers' ability to process the statistical information they are meant to convey ("visual inference"). We study visual inference within the context of regression discontinuity (RD) designs by measuring how accurately readers identify discontinuities in graphs produced from data generating processes calibrated on 11 published papers from leading economics journals. First, we assess the effects of different graphical representation methods on visual inference using randomized experiments. We find that bin widths and fit lines have the largest impacts on whether participants correctly perceive the presence or absence of a discontinuity. Our experimental results allow us to make evidence-based recommendations to practitioners, and we suggest using small bins with no fit lines as a starting point to construct RD graphs. Second, we compare visual inference on graphs constructed using our preferred method with widely used econometric inference procedures. We find that visual inference achieves similar or lower type I error (false positive) rates and complements econometric inference.
This study quantified the extent of taste-based discrimination against gay individuals in the Chilean labor market, revealing that direct questioning significantly underreports discomfort compared to anonymous list experiments. Researchers found that discomfort is highest in supervisory relationships and varies across demographic groups, suggesting the persistence of taste-based discrimination despite legal progress.
This paper provides quasi-experimental evidence on how income taxes affect gross hourly wages, utilizing Danish administrative data and a tax reform that introduced joint taxation. Exploiting spousal income for identification, we present nonparametric, difference-in-differences graphical evidence among husbands. For low-income workers, taxes have negative and dynamic effects on wages; their wage elasticity with respect to net-of-marginal-tax rates is 0.4. For medium-income workers, the effects are smaller and insignificant. Wages respond to taxes through promotions or job-to-job transitions. Neither daily nor annual hours worked respond significantly; consequently, annual earnings respond to taxes primarily through hourly wages, rather than through labor supply.
The Coase Theorem has a central place in the theory of environmental economics and regulation. But its applicability for solving real-world externality problems remains debated. In this paper, we first place this seminal contribution in its historical context. We then survey the experimental literature that has tested the importance of the many, often tacit assumptions in the Coase Theorem in the laboratory. We discuss a selection of applications of the Coase Theorem to actual environmental problems, distinguishing between situations in which the polluter or the pollutee pays. While limited in scope, Coasian bargaining over externalities offers a pragmatic solution to problems that are difficult to solve in any other way.
We study how the diffusion of broadband Internet affects social capital using two data sets from the UK. Our empirical strategy exploits the fact that broadband access has long depended on customers' position in the voice telecommunication infrastructure that was designed in the 1930s. The actual speed of an Internet connection, in fact, rapidly decays with the distance of the dwelling from the specific node of the network serving its area. Merging unique information about the topology of the voice network with geocoded longitudinal data about individual social capital, we show that access to broadband Internet caused a significant decline in forms of offline interaction and civic engagement. Overall, our results suggest that broadband penetration substantially crowded out several aspects of social capital.
Surveys are an indispensable source of data for applied economic research; however, their reliance on self-reported information can introduce bias, especially if core variables such as personal income are misreported. To assess the extent and impact of this misreporting bias, we compare self-reported wages from the German Socio-Economic Panel (SOEP) with administrative wages from social security records (IEB) for the same individuals. Using a novel and unique data linkage (SOEP-ADIAB), we identify a modest but economically significant reporting bias, with SOEP respondents underreporting their administrative wages by about 7.3%. This misreporting varies systematically with individual, household, and especially job and firm characteristics. In replicating common empirical analyses in which wages serve as either dependent or independent variables, we find that misreporting is consequential for some, but not all estimated relationships. It turns out to be inconsequential for examining the returns to education, but relevant for analyzing the gender wage gap. In addition we find that misreporting bias can significantly affect the results when wage is used as the independent variable. Specifically, estimates of the wage-satisfaction relationship are substantially overestimated when based on survey data, although this bias is mitigated when focusing on interpersonal changes. Our findings underscore that survey-based measures of individual wages can significantly bias commonly estimated empirical relationships. They also demonstrate the enormous research potential of linked administrative-survey data.
Binary treatments are often ex-post aggregates of multiple treatments or can be disaggregated into multiple treatment versions. Thus, effects can be heterogeneous due to either effect or treatment heterogeneity. We propose a decomposition method that uncovers masked heterogeneity, avoids spurious discoveries, and evaluates treatment assignment quality. The estimation and inference procedure based on double/debiased machine learning allows for high-dimensional confounding, many treatments and extreme propensity scores. Our applications suggest that heterogeneous effects of smoking on birthweight are partially due to different smoking intensities and that gender gaps in Job Corps effectiveness are largely explained by differential selection into vocational training.
In their IZA Discussion Paper 10247, Johansson and Lee claim that the main result (Proposition 3) in Abbring and Van den Berg (2003b) does not hold. We show that their claim is incorrect. At a certain point within their line of reasoning, they make a rather basic error while transforming one random variable into another random variable, and this leads them to draw incorrect conclusions. As a result, their paper can be discarded.
Fairness and interpretability play an important role in the adoption of decision-making algorithms across many application domains. These requirements are intended to avoid undesirable group differences and to alleviate concerns related to transparency. This paper proposes a framework that integrates fairness and interpretability into algorithmic decision making by combining data transformation with policy trees, a class of interpretable policy functions. The approach is based on pre-processing the data to remove dependencies between sensitive attributes and decision-relevant features, followed by a tree-based optimization to obtain the policy. Since data pre-processing compromises interpretability, an additional transformation maps the parameters of the resulting tree back to the original feature space. This procedure enhances fairness by yielding policy allocations that are pairwise independent of sensitive attributes, without sacrificing interpretability. Using administrative data from Switzerland to analyze the allocation of unemployed individuals to active labor market programs (ALMP), the framework is shown to perform well in a realistic policy setting. Effects of integrating fairness and interpretability constraints are measured through the change in expected employment outcomes. The results indicate that, for this particular application, fairness can be substantially improved at relatively low cost.
Pursuing educational qualifications later in life is an increasingly common phenomenon within OECD countries since technological change and automation continues to drive the evolution of skills needed in many professions. We focus on the causal impacts to economic returns of degrees completed later in life, where motivations and capabilities to acquire additional education may be distinct from education in early years. We find that completing an additional degree leads to more than \3000 (AUD, 2019) extra income per year compared to those who do not complete additional study. For outcomes, treatment and controls we use the extremely rich and nationally representative longitudinal data from the Household Income and Labour Dynamics Australia survey (HILDA). To take full advantage of the complexity and richness of this data we use a Machine Learning (ML) based methodology for causal effect estimation. We are also able to use ML to discover sources of heterogeneity in the effects of gaining additional qualifications. For example, those younger than 45 years of age when obtaining additional qualifications tend to reap more benefits (as much as \50 per week more) than others.
There are no more papers matching your filters at the moment.