Limits of Tax Policy
Åsa Hansson
Lund Economics Studies number 90
Contents
Acknowledgements
3Chapter 1 Introduction and Summary
1.1 Introduction 6
1.2 Marginal Tax Rates and Incentives 4
1.3 Welfare Dependency 7
1.4 Government Size and Growth 11
1.5 The Laffer Hypothesis 13
1.6 Summary of Findings 16
References 18
Chapter 2 How High are Marginal Effective Tax Rates in the U.S.?
2.1 Introduction 23
2.2 Method and Data 26
2.3 Estimation of Marginal Transfer Effects 30
2.4 Estimation of Marginal Tax Rates 43
2.5 Marginal Effective Tax Rates 52
2.6 Conclusions 59
References 61
Appendix A 66
Appendix B 68
Chapter 3 The Optimal Benefit-Reduction Rate
3.1 Introduction 73
3.2 General Approach 76
3.3 The Model 77
3.3.1 Individual Utility Maximization 78
3.3.2 Government Utility Maximization 80
3.4 Analytical Interpretation of the Tax Rates 83
3.5 Discussion 90
3.5.1 General Discussion 90
3.5.2 Application 96
3.5.3 Limitations 98
3.6 Conclusions 99
References 101
Appendix 104
Chapter 4 Government Size and Growth: An Empirical Study of 21 OECD Countries
4.1 Introduction 107
4.2 Data 112
4.3 Estimation 115
4.4 Results 120
4.5 Multicollinearity and Other Sensitivity Analyses 128
4.6 Conclusions 131
References 133
Appendix 137
Chapter 5 Measurement of Transfers and Peaking of Fiscal Sizes of Government
5.1 Introduction 139
5.2 A Theoretical Limit to the Fiscal Size of Government 143
5.3 Comparing Observed Fiscal Sizes of Government
in OECD Countries to Theoretical Limits 148
of Government 149
5.3.1.1 Detailed Description of Approach 150
5.3.2 Fiscal Sizes Averaged over 1972-1992 155
5.3.3 Fiscal Sizes in Individual Countries over Time 158
5.3.4 Comparing the Actual Peak Levels to Computed Theoretical Limits 161
5.4 Conclusions and Discussion 163
References 165
Appendix A 169
Appendix B 172
Appendix C 173
Acknowledgements
Writing a dissertation is at times a difficult and lonely job, and I would never have finished without the encouragement, stimulation, and help of a great many people. First, I would like to express my gratitude to Göte Hansson who encouraged me to enter the Ph. D program and who saw me through to the end. His support and boundless optimism led me through many moments of doubt. Second, I would like to thank Charles Stuart for introducing me to Public Economics; for providing endless intellectual stimulation, help, and encouragement; and not least for being a good friend. I would like also to thank Göran Skogh for all his support, encouragement, and assistance during his time as my advisor.
I have also benefited from the comments of seminar participants in the international and political economy seminar groups. In particular, I am indebted to Fredrik Andersson who made valuable comments about each of the chapters. I would also like to thank Joakim Gullstrand, Peter Jochumzen, Per-Magnus Lundgren, and Lars Söderstöm, each of whom made insightful comments at various seminars. In addition, I would like to extend thanks to everybody at the departments of economics at the University of California, Santa Barbara and the University of Massachusetts, Amherst, for providing the opportunity for me to conduct part of my research there and for providing a stimulating work environment. I am also very grateful to Cleve Willis, who found the time to read and comment on all my chapters and to David Edgerton for comments related to the econometrics.
I have also received invaluable help from Jörgen Norén, Agnetta Kretz, Ulla Olofsson, Jeanie Petersson and Ann-Charlotte Sahlin with practical matters and Jaya Reddy with my English grammar.
I gratefully acknowledge Jan Wallander and Tom Hedelius' Foundation for providing me with generous financial support. I would also like to thank the Scanian Chamber for Commerce, the Crafoord Foundation, the Siamon Foundation, and the faculty of Social Science for financing various parts of this dissertation.
I am also grateful to each of my colleagues in Lund as well as those at the Universities of California and Massachusetts for making it fun to go to work. I am especially indebted to Fredrik Gallo who made time at work pass quickly.
I am deeply indebted to my parents, Majken and Nils, and my brother, Per, for always supporting me, however crazy my undertakings may have seemed, and for all the help I have received over the years. I am also grateful to all my friends for reminding me that "all work and no play makes Åsa a dull girl". In the end, your diversions improved my productivity during work hours immeasurably.
Lastly, I want to thank Mike and Tor for all you have done for me. Mike, I am truly grateful for all your comments, encouragement and help - even though, at times, it wasn't obvious. Most of all, though, I am grateful to the two of you for being there. Spending time with the two of you has been a good way to end not-so-good days at work.
Lund, July 2000
Åsa Hansson
Chapter 1
Introduction and Summary
During the course of the last century government became a large component of the economy in virtually all developed countries (Peltzman [1980], Borcherding [1985], North [1985], Tanzi and Schuknecht [1997(a)]). In the U.S., for instance, total government expenditures as a fraction of GNP quadrupled from 8 percent in 1900 to 32 percent in 1997 (OECD [1998]). In many European countries, the growth has been even more dramatic. Total government expenditures in Sweden, for example, grew from 6 percent of GNP to 63 percent over the same time period. In the U.K, the government sector increased from 10 to 38 percent of the economy (OECD [1998]).
Figure 1 shows
that government expenditures, as a fraction of GDP, have increased nearly uninterruptedly for a century. The exceptions to this trend are a period after World War II in Germany and Japan, and a period during the 1980s in Sweden, Germany, the U.K., and Japan.This increase in government size has resulted primarily from the expansion of tax and redistribution policies. In general, government transfers have grown two or three times faster per year than government consumption (Peltzman [1980]). In Sweden in 1996, for instance, the average family with one child received 17 percent of their disposable income
Figure 1.
The Growth of General Government Expenditures over Time

The numbers in the figure are from Tanzi and Schuknecht [1997(a)].
Note that the intervals are not equal in length.
as government welfare payments (while income taxes amounted to 38 percent of their earnings alone) (RSV [1998]).
This growth in government, particularly in tax and redistribution, has been suspected of harmful effects on both the national economy and individual households in recent decades (e.g., Okun [1975], Tullock [1976], and Lindbeck [1986]). For instance, redistribution is often associated with efficiency losses and many government activities may crowd out more efficient private sector activities. Tanzi and Schuknecht [1997(b)] found that indicators of both social and economic performance are generally better in countries with medium-sized governments than in countries with large governments.
Aside from the sheer magnitude of government involvement in the economy, tax and redistribution policies are being used with increasingly regularity to shape economic behavior. For instance, some interventions are designed to reward particular actions (merit goods) and others are designed to discourage behavior (cigarette taxes, for example). Even when the effects are unintentional, government tax policies alter incentives and frequently have undesirable consequences for households and the economy.
Given the magnitude of government involvement in the economy and its potential to influence outcomes, whether intentionally or not, research that documents the effect of government size on the economy and offers guidelines can be quite valuable. This thesis addresses some of the problems and limitations that currently face government policy-makers. Chapters two and three deal with the hot topic of welfare reform. Specifically, the second chapter presents estimates of the marginal effective tax rates that support claims that current means-testing rules can trap the poor in dependency on welfare programs. In the third chapter, I develop a theoretical model of the optimal benefit-reduction rate. In the fourth chapter, I study the consequences of government size for growth of the aggregate economy. Finally, in the fifth chapter, I address whether the large government sizes that have evolved are sustainable or whether, in line with recent claims, government sizes have peaked in industrialized nations and are now decreasing.
1.2 Marginal Tax Rates and Incentives
Using neoclassical economic theory, it is straightforward to show that marginal tax rates alter the incentives of individuals to work and save. Consequently, the tax base, the amount of tax revenues that can be collected, and more importantly overall economic performance are affected. Worries that high marginal tax rates are causing high-skilled individuals to reduce their work effort - and thereby reducing the tax base and overall economic performance - have long been expressed. Not only high-income brackets face high marginal tax rates, however. Low-income earners, welfare recipients in particular, form another group that faces effectively high marginal tax rates. The means-testing that limits participation in many social welfare programs can generate high marginal effective tax rates, sometimes in excess of 100 percent, effectively trapping participants into continued government dependency. Many "safety nets", hence, defeat their purpose of providing temporary relief, and instead nurture life-long dependency on the government. The economy is deprived of potential productive capacity and the need for tax revenues is increased, in addition to any harm done to the individuals.
Concerns about growing welfare dependency and increasing caseloads have put welfare reforms on the top of the political agenda in many countries, including the U.S. (Keane [1995]). The principal objective of current welfare reform is to reduce welfare dependency and thereby the costs of welfare. Without reducing the high marginal effective tax rates that welfare recipients face and thereby giving them financial incentives to work and save, no reform is likely to be successful. Before adjusting marginal effective tax rates, it is important to know how high these rates actually are as well as how common high tax rates are.
In general, previous studies estimating marginal tax rates can be divided into three categories. The first category estimates the average marginal tax rate for an aggregate household over time (e.g., Joines [1981], Seater [1982], and Barro and Sahaskul [1983]). These studies focus on federal income taxes and omit other types of taxes and effects due to the means-testing nature of many welfare programs. The second category calculates benefit-reduction rates for welfare recipients but omits all forms of taxes (e.g., Lurie [1974], Hutchen [1978], Moffitt [1979], and Fraker et al. [1985]). The third category includes income taxes and effects of means-tested transfers in their estimates, but estimates marginal tax rates for welfare recipients only (e.g., Moffitt and Keane [1991], and Dickert et al. [1995]). Typically, these studies include only a few welfare programs, however.
The only study, to my knowledge, that includes a wide range of transfers and income taxes, and computes marginal effective tax rates over the whole income distribution is Browning and Johnson [1979]. Specifically, they divide the population into deciles and estimating the marginal effective tax rates for each decile. Their marginal effective tax rates are defined as the change in net tax payments over the change in gross income when a household moves from the mean in one decile to the mean of the following decile. Hence, their marginal effective tax rates do not measure the change in net tax payments when income increases by a unit.
In the second chapter in this thesis, How High are Marginal Effective Tax Rates in the U.S.?, I estimate marginal effective tax rates in the U.S. across the whole income distribution, separately for each of the three filing statuses (single households, married households, and heads of household). More specifically, I calculate the overall tax wedges on income from labor and unearned income that arise due to fiscal policies. These include any tax or public transfer programs that affect the amount of potential consumption obtainable from a unit increase in gross income. Marginal effective tax rates calculated in this way are of special interest because they can be modified directly by governments.
My measure differs from most earlier studies of marginal effective tax rates that focus on taxes and/or transfers that statutorily affect households. Often they have only calculated tax rates for part of the income distribution (Fraker et al. [1985], Moffitt and Keane [1991], and Dickert et al. [1995]), whereas I calculate them for the whole income distribution. Moreover, earlier studies include only one or possibly a few of the taxes and public transfers that affect a household’s potential consumption (Lurie [1974], Hutchens [1978], Moffitt [1979], and Fraker et al. 1985], Dickert et al. [1995]), while I include AFDC (Aid to Families with Dependent Children), Food Stamps, Medicaid, Lunch Subsidies, Energy Assistance, and various forms of public housing and rent subsidies. In addition, earlier studies do not distinguish households by tax status type (Browning and Johnson [1979], Joines [1981], Seater [1982], and Barro and Sahasakul [1983]), whereas I do. The latter is important because tax codes vary by tax status and the amounts of transfers received differ by household type.
My estimates are based on a cross-sectional sample from 1990 of roughly 55,000 U.S. households, the Current Population Survey (CPS). The CPS is designed to be nationally representative of the overall U.S. population. Most of the effort in the paper is devoted to estimating the marginal transfer effect, that is the rate at which transfers are lost when a household’s gross income increases by a unit. Specifically, I estimate marginal transfer effects by regressing the net value of actual transfers received by households on earned and unearned income. The net value of a transfer is defined as the consumption value of the transfer. To obtain the net value of transfers, the net value of in-kind transfers are evaluated. Marginal tax rates are then added to marginal transfer effects to obtain overall marginal effective tax rates.
The resulting marginal effective tax rates are extremely high for low-income households, and for low-income heads of household in particular. For instance, 30 percent of all heads of household face a rate on earned income well above 100 percent. The rates on unearned income are even higher. Concerns about disincentive effects of the current welfare system appear well founded. The rates for wealthier households are considerably lower. For instance, the highest rate on earned income is about 50 percent, and the highest rate on unearned income is about 40 percent for the wealthiest households.
1.3 Welfare Dependency
In the second chapter of this thesis, How High are Marginal Effective Tax Rates in the U.S.?, I document high marginal effective tax rates for welfare recipients, in some cases more than 100 percent. Because statutory tax rates are low for this group, most of the effective taxes consist of reductions in benefits that occur with increases in income. Clearly, if welfare reforms are to be successful, the rate of benefit reduction must be lowered. Lowering the marginal effective tax rates for welfare recipients, however, can be costly. If marginal tax rates are lowered for one group they may need to be increased for other groups to offset the loss in revenues. Given possible differences in productivity, it is unclear whether decreasing the marginal incentive for high-wage earners to work, in order to increase the incentive for welfare recipients to enter the labor force, is worthwhile from a social standpoint.
This leads to the second issue I address, namely what is the optimal benefit-reduction rate? The third chapter, The Optimal Benefit-Reduction Rate, develops a model of optimal income taxation and derives bounds for the optimal benefit-reduction rate, that is the rate at which welfare recipients lose welfare payments when their earned income increases by one unit. This paper derives bounds for the optimal benefit-reduction rate, the optimal marginal tax rate on labor income, and the optimal guarantee level using an expanded Sheshinski [1972] optimal income tax model. Specifically, Sheshinski’s [1972] model is augmented with the benefit-reduction rate.
Previous studies estimating the optimal benefit-reduction rate are scarce. Closely related is the optimal income taxation literature that started with Mirrlees’ [1971] seminal paper, however. The essential features of optimal income taxation models a’la Mirrlees are a social welfare function, the individuals’ utility function, the ability distribution, and a fiscal side where the government sets a tax function. In particular, the social welfare function captures society’s values about equity and fairness. The most critical feature of optimal income taxation models is the individual utility function that models how an individual’s behavior (in particular, how much labor to supply) depends on the tax system that he faces. It is assumed that individuals differ only in their ability to earn income and that the labor supplied by different individuals differs in productivity according to the ability of the individual. That is, an individual with more ability earns a higher wage than an individual with less ability. The last essential feature of the optimal income taxation models is the fiscal side that redistributes by choosing a tax function. The above features of the optimal income taxation models make these models well suited to study problems like the optimal benefit-reduction rate.
Without assuming a specific form of utility and ability distribution, Mirrlees was limited to only a few analytical conclusions. Importantly, though, he was able to demonstrate that the optimal marginal tax rate lies inclusively between 0 and 100 percent and the marginal tax rates for the very highest-income household and the very lowest-income household are zero. Further conclusions, however, rest on assumptions about the utility function and the ability distribution. When assuming a utilitarian social utility function, a log-linear utility function for consumption and leisure, and an ability distribution based on data from the U.K., Mirrlees concludes that the optimal tax schedule is approximately linear, not progressive throughout the income distribution, and with rather low tax rates.
Subsequent work has investigated the sensitivity of the optimal tax system to the parameters of the model. Atkinson [1973], for example, uses a Rawlsian social utility function to give more weight to the utility of the poor. He finds that the tax rates are no longer so low and depart from linearity. Stern [1976] explores how sensitive the results are to the choice of the individual utility function, and finds that the degree of labor supply responsiveness implied by a log-linear utility function is excessive and overstates the costs of increasing tax progressivity.
Sheshinski [1972] restrictes the Mirrlees model to be linear, and finds that the optimal linear income schedule is associated with a positive lump-sum at zero income and that the optimal marginal tax rate is bounded from above by a fraction that decreases with the lowest elasticity of the labor supply. His results are independent of the choice of utility function and ability distribution.
I generalize Sheshinski’s model from a linear tax to a tax schedule that is kinked with two linear segments. One segment is for welfare recipients, who face both marginal benefit-reduction and marginal tax rates, and the other is for welfare non-recipients, who face only a marginal tax rate. I believe that this generalization is more consistent with actual tax schedules.
A closely related paper in the optimal income taxation literature is one by Slemrod et al. [1994]. They also generalize Sheshinski’s model to a tax schedule with two linear segments. Specifically, one segment applies to income under certain levels and the other to marginal tax rates above these levels. They do not model the rate at which the subsidy should be reduced when income increases, however. Another difference between my study and Slemrod et al. is that they use a simulation technique and hence make assumptions about parameters, while I do not.
I find that the optimal tax problem is associated with a guarantee level at zero income that is reduced at the benefit-reduction rate when welfare recipients earn income. The upper bound of the benefit-reduction rate depends negatively on the magnitude of the optimal marginal tax rate on labor income, and either negatively on the average labor supply elasticity with respect to changes in net income of welfare recipients or positively on the average labor supply elasticity with respect to changes in the benefit-reduction rate of welfare non-recipients. The optimal benefit-reduction rate can be negative, in which case welfare recipients should be given a subsidy for working.
1.4 Government Size and Growth
The second and third chapters suggest that fiscal policy can have negative impacts on the economic behavior of individuals. Government interventions may also have far-reaching ramifications for the overall economy. The Public Choice School of Thought, for instance, often criticizes the government for weakening economic performance. Economic theory, however, is ambiguous on whether these government interventions affect economic performance negatively or positively. Clearly, some interventions may promote growth while others may discourage it. To date, empirical studies of the relationship between government spending and economic performance have found mixed results.
Now, when government interventions cover such a large share of the economy, improving our understanding of the relationship between government size and economic growth may be more valuable than ever. The fourth chapter in this thesis, Government Size and Growth: An Empirical Study of 21 OECD Countries, is an attempt to explore how the size of government affects economic growth. To do this I estimate the relationship between the size of government and the growth rate in the OECD countries from 1973 to 1992 using multivariate statistical techniques.
A large number of studies have estimated this relationship previously, though there has been no consensus in the results. Some of the studies conclude that government size has a negative impact on the growth rate (e.g., Landau [1983] and Barro [1991]). Other studies, though, find support for a positive relationship (e.g., Ram [1986] and Devarajan et al. [1996]). A sizable number of studies, however, have been unable to find a statistically significant relationship between the two variables (e.g., Kormendi and Meguire [1985]). Levine and Renelt [1992], for instance, state that they "could not find a robust cross-section relationship between a diverse collection of fiscal-policy indicators and growth".
The present study differs from earlier studies in a number of important ways which may help to explain the discrepancies in earlier results. The first difference is that I use a fixed effects estimation model that controls for country- and time- specific effects. Earlier studies have typically used cross-section data and thereby been unable to control for such important factors as, for example, culture, history, and environmental differences that likely influence the growth rate. When these factors are correlated with the regressors, the cross-section parameter estimates are biased.
The second difference is that I, in contrast to many previous studies that include both developed and developing countries, limit my sample to OECD countries. It has been shown that results are sensitive to the countries included, and limitation to the relatively homogenous OECD members helps to reduce some of the potential estimation biases.
The third difference is that my measure of government size is more inclusive in the sense that interest payments on pre-existing debt are included. Interest payments on debt are a substantial component of government spending and, thus, may have an impact on economic performance. I also define the various spending categories, that make up total government size, differently. Typically, definitions of the spending categories have been based on the expected impact, negative or positive, that various expenditures, a priori, will have on the growth rate. My definitions fit better with economic theory, however. That is, I define as public goods those spending categories that are associated with zero, or low, marginal cost and where it is costly to exclude individuals from consumption. Government spending categories that could be provided equally well privately are defined as transfers.
Two results are noteworthy. First, fixed effects estimation yields different results than traditional estimation techniques. This may suggest that omitted variables bias the results of most earlier studies. Second, government spending seems to affect the growth rate negatively, ceteris paribus.
1.5 The Laffer Hypothesis
Government involvement has increased dramatically over time, but can this upward trend continue indefinitely? Excessively high tax rates may generate sufficiently large disincentive effects, so that tax revenues may actually decline in response to tax rate increases. That is, if enough individuals reduce their work effort, the tax base can decline proportionally more than the tax rate rises. The Laffer Hypothesis [1979] implies that the upward trend in government interventions can not continue uninterruptedly forever. Eventually, a future increase in the tax rate required to finance increased government spending will actually decrease tax revenues available to fund government involvement, and hence limit the amount to which governments can intervene.
In the fifth chapter, Measurement of Transfers and Peaking of Fiscal Sizes of Government, we look at whether the upward trend in fiscal size of government has been broken and whether this break may be a result of limits to taxation. Specifically, we first compute theoretical limits to fiscal sizes in a Laffer curve model, and then compare maxima derived in the theoretical model to actual measures of fiscal sizes of government in 22 OECD countries.
To see whether fiscal sizes of government in some countries have been close to or above theoretical maxima, we start out by deriving maxima in a long-run Laffer curve model. Calculations of long-run theoretical limits are based on plausible assumptions about labor supply elasticities and parameterized to fit a stylized "average OECD economy". The model maximizes tax revenues, but for given levels of public goods, interest on debt, non-tax revenues, and deficit, it also calculates maximum levels of spending and transfers.
Results from the theoretical model suggest that fiscal sizes may reach a maximum when tax revenues equal approximately 70 percent of GNP, spending equals approximately 75 percent of GNP, and transfers equal approximately 60 percent of GNP.
To make comparisons between numbers derived in our theoretical model and actual data meaningful, we introduce a measure of fiscal size that is more consistent with our theoretical tax model. It can be misleading to compare numbers derived in our theoretical model with numbers reported in official budgets because countries use different accounting practices to transfer resources to households. The size of government will appear different, thus, even if households are effectively being given the same subsidies. Another drawback of numbers reported in official budgets is that they, in some instances, are for national government and, hence, miss expenditures made by subnational governments. This can also distort comparisons when government organization varies between countries. Our alternative measure makes comparisons across countries and time more informative.
Specifically, we add tax expenditures, that is subsidies to tax payers in the form of tax breaks, to and subtract indirect taxes paid on transfers from the traditional measure. The importance of including tax expenditures is easily illustrated. The government in a country that subsidizes households by giving them a tax break will "appear" smaller using the traditional measure than the government in an otherwise identical country which subsidizes the same households through transfers, though the inhabitants are equally well-off. This occurs because tax expenditures do not show up in official budgets as expenditures but rather as a reduction in tax revenues.
Another difference between our measure and traditional measures is that we net out indirect taxes obtained from transfer payments. The reason we net out indirect taxes is that households typically pay sales and value-added taxes on cash and equivalent transfers, gross transfers as reported in official budgets, hence, overstate the net claim to private goods provided by government. Our measure, instead, defines transfers as net claims to private goods that arise directly from government fiscal decisions about budgeted spending and its funding, which is more consistent with the definition of transfers used in the theoretical model.
Looking at actual data, we find that most countries have experienced breaks in the upward trend of government size. Breaks typically occurred when tax revenues reached approximately 70 percent of GNP, spending approximately 75 percent of GNP, and transfers approximately 60 percent of GNP - levels remarkably close to theoretical limits. Fiscal sizes have exceeded these levels only for periods of a few years around times of peak fiscal sizes. Moreover, we find that the countries with the greatest peak fiscal sizes experienced the largest declines in fiscal sizes. This may suggest that some combination of political and economic forces may make it difficult for a country to sustain tax revenues above 70 percent of GNP, transfers above 60 percent of GNP, and spending above 75 percent of GNP. Thus, theoretical limits to raising tax revenues may help explain the downward trend of fiscal sizes in countries with large fiscal sizes.
1.6 Summary of Findings
Among the major findings in this thesis are:
References
Atkinson, A.B., 1973, "Maxmin and the Optimal Income Taxation," Discussion Paper no. 47, Department of Economics, University of Essex.
Barro, R., 1991, "Economic Growth in a Cross-Section of Countries," Quarterly Journal of Economics, 106(2), 407-443.
Barro, R. and C. Sahasakul, 1983, "Measuring the Average Marginal Tax Rates from the Individual Income Tax," Journal of Business, 56(4), 419-452.
Borcherding, T., 1985, "The Causes of Government Expenditure Growth: A Survey of the U.S. Evidence," Journal of Public Economics, 28, 359-382.
Browning, E. and W. Johnson, 1979, "The Distribution of the Tax Burden," American Enterprise Institute for Public Policy Research, Washington D.C.
Devarajan, S., Swaroop, V. and H. Zou, 1996, "The Composition of Public Expenditure and Economic Growth, "Journal of Monetary Economics, 37, 313-344.
Dickert, S., Houser, S. and J. K. Scholz, 1995, "The Earned Income Tax Credit and Transfer Programs: A Study of Labor Market and Program Participation," Tax Policy and the Economy, NBER, 19, 1-50.
Fraker, T., Moffitt, R. and D. Wolf, 1985, "Effective Tax Rates and Guarantees in the AFDC Programs, 1967-1982, " Journal of Human Resources, 20(2), 251-263.
Fullerton, D., 1982, "On the possibilities of an Inverse Relationship Between Tax Rates and Government Revenues," Journal of Public Economics, 19, 3-22.
Hutchens, R., 1978, "Changes in AFDC Tax Rates 1967-1971," Journal of Human Resources, 13(1), 60-74.
Joines, D., 1981, "Estimates of the Effective Marginal Tax Rates on Factor Income," Journal of Business, 54(2), 191-226.
Karras, G., 1997, "On the Optimal Government Size in Europe: Theory and Empirical Evidence," The Manchester School, 65(3), 280-294.
Keane, M., 1995, "A New Idea for Welfare Reform," Federal Reserve Bank of Minneapolis Quarterly Review, 19, 2-28.
¾ and R. Moffitt, 1991, "A Structural Model of Multiple Welfare Program Participation and Labor Supply," Brown University Department of Economics Working Paper 91-26.
Kormendi, R. and P. Meguire, 1985, "Macro Economic Determinants of Growth: Cross-Country Evidence," Journal of Monetary Economics, 16, 141-163.
Laffer, A., 1979, Statement prepared for the Joint Economic Committee, May 20, Reprinted in A. Laffer and J. Seymour, eds., The economics of the tax revolt: A reader, Harcourt, Brace, Jovanovich, New York, 35-42.
Landau, D., 1983, "Government Expenditures and Economic Growth: A Cross Country Study, Southern Economic Journal, 49, 783-792.
Levine, R. and D. Renelt, 1992, "A Sensitivity Analysis of Cross-Country Growth Regressions," American Economic Review, 84(4), 942-963.
Lindbeck, A., 1986, "Limits to the Welfare State," Challenge, 28(6), 31-36.
Lurie, I., 1974, "Estimates of Tax Rates in the AFDC Program," National Tax Journal, 27(1), 93-107.
Mirrless, J., 1971, "An Exploration in the Theory of Optimal Income Taxation, Review of Economic Studies, 38, 135-208.
Moffitt, R., 1979, "Cumulative Effective Tax Rates and Guarantees in Low-Income Transfer Programs," Journal of Human Resources, 14(1), 122-129.
North, D., 1985, "The Growth of Government in the United States: An Economic Historian’s Perspective," Journal of Public Economics, 28, 383-399.
Okun, A., 1975, Equality and Efficiency: The Big Trade-Off, Washington D.C., Brooking Institute.
Organisation of Economic Cooperation and Development, 1998, "Economic Outlook 1998, Paris.
Peltzman, S., 1980, "The Growth of Government," Journal of Law and Economics, 23, 209-287.
Ram, R., 1986, "Government Spending and Economic Growth: A New Framework and Some Evidence from Cross-Section and Time-Series Data, American Economic Review, 76(1), 191-203.
Riksskatteverket (RSV) (The National Tax Board), 1998, Tax Statistical Yearbook of Sweden 1998, Centraltryckeriet, Borås.
Rodrik, D., 1998, "Why Do More Open Economies Have Bigger Governments?," Journal of Political Economy, 106(5), 997-1032.
Seater, J., 1982, "Marginal Federal Personal and Corporate Income Tax Rates in the U.S., 1909-1975," Journal of Monetary Economics, 10(3), 361-381.
Slemrod, J., Yitzhaki, S., Mayshar, J. and M. Lundholm, 1994, "The Optimal Two-Bracket Linear Income Tax," Journal of Public Economics, 53, 269-290.
Stern, N.H., 1976, "On the Specification of Models of Optimum Income Taxation," Public Economics, 6, 123-162.
Tanzi, V. and L. Schuknecht, 1997(a), "Reforming Government: An Overview of Recent Experiencies," European Journal of Political Economy, 13, 395-417.
¾ , 1997(b), "Reconsidering the Fiscal Role of Government," American Economic Review, 87(2), 164-168.
Tullock, G., 1976, The Vote Median, The institute of Economic Affairs.
Chapter 2
How High Are Marginal Effective Tax Rates in the U.S.?
2.1 Introduction
Economists believe widely that marginal tax rates affect the economic behavior of individual agents by altering their incentives. For instance, marginal tax rates are often thought to be related negatively to the amount of labor a person will supply to the market and the amount of earnings that he or she will save and invest. Predictably, the wealthiest - who typically face the highest statutory tax rates - have expressed great concern about the deleterious effects of high marginal tax rates. The poor, welfare recipients in particular, also face high marginal tax rates, however, because the benefit entitlements in many welfare programs decrease when a recipient’s income increases. Clearly, thus, it is important to include all sources of taxation, not just statutory tax rates, when effective marginal tax rates are determined. Because high marginal tax rates may affect the production and investment decisions of such a broad group of people (the rich, the poor, and many in between), far-reaching ramifications for the overall economy are likely.
In this paper, I estimate the structure of effective marginal tax rates over the distribution of incomes in the U.S. using data from the 1990 Current Population Survey (CPS). Specifically, I calculate schedules of marginal effective tax rates (henceforth METRs) for both earned and unearned income across the income distribution for a typical representative household of each of the three U.S. filing statuses: head of household, single household, and married household. METRs are measured as the change in net payments to government from all taxes (personal income, payroll, and indirect taxes) and means-tested welfare programs that result from a unit rise in a household’s earned and unearned income, respectively. These measures are particularly interesting at a time when numerous countries worldwide are struggling to reform their welfare systems, often with the aim of weaning welfare recipients off welfare. To be effective, however, high marginal tax rates that discourage welfare recipients from working must be lowered. Naturally, then, it is important to understand how high these rates actually are.
The work here differs from previous studies of METRs in several ways. First, I estimate entire METR schedules for both earned and unearned income for 1990, taking account of filing statuses. Second, most previous studies of METRs failed to include many forms of public welfare and instead measured only Aid to Families with Dependent Children (AFDC) and possibly Food Stamps. I include the CPS Public Assistance Income variable - of which AFDC is only one component - to measure cash transfers and Food Stamps, Medicaid, public housing, rent subsidies, Lunch Subsidies, and Energy Assistance to measure in-kind transfers. Moreover, I include the Earned Income Tax Credit. Third, I include only the net value of transfers because indirect tax receipts induced by transfers are returned to the government and hence not available for household consumption. Fourth, I measure overall tax wedges on labor income and unearned income, whereas other studies focused only on taxes and/or transfers that statutorily affect households. Finally, I use regression techniques, whereas most other studies have used tax statutes to estimate the rates.
The results suggest that METRs are indeed very high for many heads of household. For instance, 30 percent of all heads of household face estimated rates of 100 percent or greater on earned income and even higher rates on unearned income. The rates for the other filing statuses are considerably lower.
The following section discusses the method used and the data. Section 2.3 describes how I estimate the change in welfare benefits induced by additional income, that is marginal transfer effects. In section 2.4 marginal tax rates are calculated. In section 2.5, I combine the estimates of the marginal transfer effects from section 2.3 with marginal tax rates from section 2.4 to generate estimates of METRs. Section 2.6 concludes the paper and then discusses the implication for welfare policy.
2.2 Method and Data
Effective marginal tax rates, strictly defined, measure how the potential consumption of a household changes when its income increases by a unit. This includes all factors that affect consumption or consumption possibilities and are due to an increase in income, for example fringe benefits, insurance coverage, the cost of day care, and the cost of commuting. Unfortunately, data are not available to support such exact calculations. Consequently, I focus only on fiscal policy factors that affect a household’s incentives to work and earn unearned income (accumulate wealth).
I estimate the METR as the sum of the marginal tax rate, defined as the increase in all tax payments resulting from a unit rise in income, and the marginal transfer effect, defined as the decline in consumption obtained from transfer receipts resulting from a unit rise in income. One minus the METR, thus, equals the net change in consumption or consumption possibility (henceforth referred to as consumption) due to a unit rise in income induced by fiscal policies. Marginal tax rates are calculated for each household from the statutory tax rates. Calculating marginal transfer effects is more difficult so much of this paper will be devoted to this end.
There are two kinds of METRs, one for earned income and one for unearned income. It is important to calculate them separately because the associated tax and welfare incentives differ. For instance, personal income taxes, payroll taxes, and indirect taxes all affect a taxpayer’s net earned income, while only personal income and indirect taxes affect a taxpayer’s incentives to earn unearned income. Similarly, means-tested transfers depend on earned as well as unearned income. This implies that increases in earned and unearned income generally have different effects on transfers.
There are two approaches to calculating marginal transfer effects. The first approach estimates marginal transfer effects from underlying statutes while the second focuses on the relationship between actual transfer payments and different income levels. If policy is exogenous, the first approach is preferable theoretically because statutes codify policy. However, there are three problems with using statutes that argue in favor of the second approach. First, eligibility rules are complicated and the amounts paid by different programs are often interdependent. So the simple sum of benefit-reduction rates for different programs does not equal the overall benefit-reduction rate. Indeed, Fraker et al. [1985] found that such interactions can be substantial. Second, transfer statutes differ across states as well as across local jurisdictions. Using statutes to estimate METRs for typical households therefore requires assumptions about the "typical statute" across jurisdictions. Furthermore, not all eligible households apply for benefits, and those that do may fail to maximize transfer receipts. The amount of transfers a typical household receives, thus, is likely to differ considerably from the statutory maximum.
To avoid these difficulties, I adopt the second approach. Specifically, I estimate marginal transfer effects by regressing net values of actual transfers received by households on earned and unearned income as well as other demographic characteristics. I do this separately for different income groupings, a procedure often referred to as piecewise regression. The coefficient associated with the earned income variable is the marginal transfer effect of labor income, and the coefficient associated with the unearned income variable is the marginal transfer effect of unearned income.
The downside of the regression approach is that it makes it harder to study how policy changes affect METRs, and hence individuals' incentives. The primary goal of this paper is to study the size of METRs and not how policy changes affect METRs, however, so this may not be a crucial problem.
METRs vary across income levels and must consequently be estimated separately for different income ranges. I estimate METRs for each decile for each of the three filing statuses. These deciles are defined by ordering the data according to total gross income, that is earned plus unearned income, for each filing status and dividing it into ten equally sized parts. The first decile contains the lowest income and the tenth decile the highest.
Using deciles achieves a good balance between a number of important interests. For instance, they are small enough to combine only parts of the income distribution that one may reasonably expect to be similar, yet they are big enough to estimate the parameters with some measure of precision. Moreover, there is a preponderance of households with little or no income in the lowest deciles, so many METRs either can not be computed over finer income ranges or can be computed only with substantial measurement error.
The downside to using deciles, however, is the difficulty of comparing METRs across the three filing statuses. Because the deciles are defined across different income ranges, direct comparison can be misleading. To deal with this, I also estimate METRs for the different filing statuses using income ranges that are identical across filing statuses. These intervals are defined as deciles for all household types pooled together. I refer to these groups, henceforth, as pooled status deciles. Because the number of households used in calculating any given METR can vary widely, this approach says nothing about how common particular rates actually are.
To estimate the model, I have compiled a data set that consists primarily of 1990 individual-level extracts derived from the Current Population Survey (CPS) March Annual Demographic files (Moffitt [1995]). The sample consists of 21,330 single households, 29,301 married households, and 4,671 heads of household. The CPS provides information about each household and the individuals in the households. There are data covering monthly labor force participation, work experience, migration, income, Public Assistance Income, and in-kind assistance from Food Stamps, School-Lunch Programs, rent subsidies, public housing, Medicaid, and Energy Assistance. The employment and income data refer to 1989 and demographic data to 1990.
Because one of my goals is to measure incentives to work, I limit my focus to households where the individuals are either in the labor force or at least potentially in the labor force. Excluded, thus, are households in which the head is not between 16 and 64 and households that report receiving social security or other retirement benefits. In addition, I exclude households that report negative capital income or self-employment incomes to avoid the risk of including otherwise wealthy individuals who happened to suffer a "bad year" among the truly poor. Furthermore, I cannot model the tax implications of such losses with only one year's data because they occur across multiple years. Negative capital or self-employment income was reported by 2.7 percent of all households.
Summary household statistics are shown in Tables A, B, C, and D in appendix A for all households pooled together, heads of household, single households, and married households, respectively. The average earned and unearned incomes of the different deciles, as well as the average amounts of transfer payments received and labor supplied, vary widely between the different filing statuses. Heads of household, for example, work the least, earn the least, and receive the most transfers while married households work and earn the most. Single households receive the smallest transfer payments.
2.3 Estimation of Marginal Transfer Effects
I estimate marginal transfer effects for each decile by regressing the net value of transfers to recipients on earned and unearned income, piecewise by decile. Earned income consists of wages, salaries, self-employment income, and farm income, while unearned income consists of income from dividends, rent, and interest. For each decile i = 1, 2, ..., 10, Tij is defined as the net value of transfers received by household j and Lij and Kij as household j’s earned and unearned incomes, respectively. One can then express the regression equation as
, (1)
where Dij is a vector of dummy variables for household j in decile i that consists of state of residence, at least one dependent, at least two dependents, and at least three dependents. D is included because benefit levels often depend on the number of children and state of residence. a i is the intercept for Wyoming residents with no dependents. b i and g i are parameters that measure the marginal transfer effects from earned income and unearned income, respectively, and the elements of d i the effects of the demographic characteristics. e ij is a disturbance term, assumed to be uncorrelated with the regressors.
Because I calculate the overall "transfer" wedge between what a transfer costs and the consumption a welfare recipient derives from the transfer, I define the dependent variable, Tij, as the net value of in-kind and cash transfers to recipients and not just the dollar amount of transfer. The gross value of an in-kind transfer can be less than the dollar amount if the transfer alters the recipient’s consumption of the transferred good, typically by increasing consumption. The differences between the gross and face values of in-kind transfers depend on the physical income and substitution effects (henceforth physical is suppressed). If there is no substitution effect – that is, there is solely an income effect – in-kind transfers are identical to cash transfers. This is the case when the recipient consumes more or the same amount of the transferred good before being given the transfer. If there is a substitution effect, an in-kind transfer is worth less than an equal amount of cash. This is the case when the recipient consumes more of the transferred good after receiving the transfer. The gross value is, thus, defined as the monetary value of the transfer to the recipient as opposed to the face value, which is the dollar amount given.
The value of a transfer can be further reduced if the transfer leads to increased indirect tax payments to the government. Namely, cash transfers are indirectly subject to consumption taxes in the same way that earned and unearned cash incomes are. That is, indirect taxes must be paid when the recipient purchases goods with transferred cash. Even transfers given in-kind can effectively be subjected to indirect taxes if they cause resources freed up to be spent on items subject to indirect taxes. To convert gross transfer values to their net value, I net out indirect taxes by multiplying the gross value of all transfers by (1-ti), where ti is the indirect tax rate. The net value of a transfer is, thus, defined as the consumption value of a transfer. Because not all consumption is subject to indirect taxes, I calculate the indirect tax rate as indirect tax revenues divided by disposable income. The indirect tax rate is discussed in more detail in section 2.4.
Below I discuss how I determine the gross value of each of the included in-kind transfers, starting with Food Stamps.
Food Stamps
The federally funded Food Stamp program is designed to help low-income households maintain a more nutritious diet. The program provides eligible households with coupons that can be used to purchase food. The dollar amount of program benefits is reported in the March CPS for survey participants.
I assume that recipients value Food Stamps at the full dollar amount for several reasons. First, most Food Stamp recipients spend more on food than they receive in Food Stamps and hence the receipt of Food Stamps does not alter their food consumption (Moffitt [1989] and Fraker [1990]). Second, there is consensus in the literature that the value of Food Stamps to recipients is equal to or close to the dollar amount of the transfer. Finally, where Food-Stamp transfers do alter food consumption, the transfer can often be converted into cash by trafficking so the value of Food Stamps is bounded from below by the black market price.
Lunch Subsidies
The National School Lunch Program is designed to help protect the well-being and health of children in the U.S. by subsidizing school meals for children. All students eating lunches prepared at participating schools pay less than the total cost. Students with a demonstrated need, moreover, pay even less of the cost or receive their meals free.
The value of school lunches to recipients can, in principle, be larger or smaller than the dollar amount of the transfer given. The value of school lunches to recipient’s families may exceed the actual dollar amount of the transfer if the cost to the family of providing lunches exceeds that of the government, for instance if the government utilizes economies of scale. On the other hand, some parents may spend considerably less on their children’s school lunches than the dollar amount of the transfer. In such cases the value of the transfer would be less than the dollar amount of the transfer. For simplicity, I assume that the value to the recipient equals the dollar amount of the transfer.
The March CPS survey collects data on the number of households receiving school lunches, but unfortunately gives no information on the dollar amount or value of the transfer. I assume that the gross value of the School Lunch Program to each participating household equals the dollar amount of the annual transfer received by each child as reported by the Department of Agriculture. The annual transfer is broken down by whether the lunch is bought at a reduced price or is free. In addition, the annual transfer varies by income and region.
Housing Subsidies
Several federal, state, and local agencies exist to monitor housing conditions and to remedy acute shortages of decent, safe, and sanitary housing for low-income households. I have included the two most important programs: Low Rent Public Housing and rent supplement plans.
If an in-kind housing transfer distorts the consumption of housing, then the value of the transfer is worth less to the recipient than an equal amount of cash. Estimating the extent to which housing subsidies alter recipient’s behavior is difficult, so I assume that the value of housing subsidies is equal to the rent amount. This simplification has little bearing on the results, however (see appendix B).
The March CPS survey records whether each household receives a housing transfer, though again not the dollar amount or the value. Fortunately, the Bureau of Census has developed an approach to determining the value of housing transfers (see U.S. Bureau of Census [1990] for details). The value of the transfer is the difference between the actual gross rent (rent plus utilities) paid by families in subsidized housing and the rent these families would have been expected to pay had their units not been subsidized. The Bureau of Census does not collect data on housing cost, so the calculations of housing subsidies are based on the 1985 American Housing Survey.
The magnitude and the value of the transfer depend on the region in which the recipient resides, the number of dependents, and income. I assume that the values of rent subsidies and public housing are the same across households living in a common region, having the same number of children, and the same income.
Energy Assistance
The Low Income Home Energy Assistance Program is a federal block grant, distributed through the states, that assists eligible low-income households in meeting their home energy needs. The assistance can be used for heating, cooling, energy crisis interventions, and other related home repairs. Payments may be made directly to eligible households as cash or vouchers or indirectly through home energy suppliers.
It is difficult to know how much Energy Assistance distorts recipients’ energy consumption without detailed examination of recipient behavior. I assume that the value of the transfer is valued at its full dollar amount, and sensitivity analyses suggest that this assumption is innocuous (see appendix B). The dollar amount reported by the March CPS is, thus, treated as the value to the recipient.
Medicaid
The Medicaid program provides medical assistance to needy families with dependent children, and to the aged, blind, or disabled whose incomes and resources are insufficient to cover the cost of necessary medical service. The CPS reports whether individuals are covered by Medicaid at any time during the previous calendar year, but again not the value of coverage. I assume that individuals covered at any time during the year are covered for the entire year. The value of Medicaid coverage to the recipient depends on health level, income level, and household size, though the full mean government outlay per family is likely to overestimate the value to the recipient because most households would consume less medical care without Medicaid. That is, Medicaid has a positive substitution effect. I use the Bureau of Census estimates of the value of Medicaid coverage which is based on the fungible value of the benefit.
The Bureau of Census has developed a model to estimate the fungible value of Medicaid(see U.S. Bureau of Census [1990] for details). This value is assumed to be zero if the household is either unable to meet basic food and housing requirements or has no remaining resources after it meets basic food and housing requirements, since these households are unlikely to consume Medical insurance without the transfer. That is, Medicaid coverage would have no income effect because it does not free resources for other consumption and, hence, offers no value to the recipient. Medicaid benefits have full value, defined as mean government outlay for families in a given risk class, if the resources of the family after deducting amounts required for basic food and housing costs are as great or greater than the mean Medicaid outlays for families in the same risk class. That is, while these households have the means to provide for their own medical insurance, the transfer enables them to finance other consumption. Medicaid benefits have partial value if the resources of the family are higher than the level specified in the first case but lower than the mean Medicaid outlays for families in the same risk class.
Moffitt and Wolfe [1991] estimated the value of Medicaid coverage based on the expected utility of coverage. Expected utility differs substantially across the population, so they included characteristics that may be influential such as health, education, number of children, and race, though their study fails to allow for differences between states. They estimated the value for a single mother with children to be $1,053. Using the Bureau of Census approach, the value is $1,007 for a single mother above the poverty threshold, and $619 for a single mother at or below the poverty threshold. The Bureau of Census values are smaller than Moffitt and Wolfe’s, and substantially smaller than the mean government outlay per recipient of $1,242 for non-disabled individuals between the age of 21 and 64. Thus, using the Bureau of Census numbers will likely underestimate marginal transfer effects. To examine the sensitivity of the results, I estimate marginal transfer effects using mean outlays per recipient as well.
Cash Transfers
The March CPS reports the dollar amount of public assistance received. It includes AFDC as well as general assistance. The value of public assistance is obviously equal to its dollar amount.
Calculating Marginal Transfer Effects
After gross values of in-kind transfers have been evaluated, the net value of total transfers can be determined by netting out indirect taxes. The total net value of transfers received by each household is then calculated by adding up all transfers that the household receives. Marginal transfer effects can then be estimated using equation (1).
Because the first two deciles of heads of household possess no earned or unearned income, and it is thus impossible to regress transfer payments on income, they are grouped together with decile three when estimating marginal transfer effects. The estimated marginal transfer effects for the first three deciles of heads of household are consequently identical.
The approach used to estimate marginal transfer effects requires imputations that may lead to errors. To determine the size of these errors, I conduct sensitivity analyses. The results suggest that any errors are minor (presented in appendix B).
The suitability of the regression specification is also evaluated. For instance, I replace the linear with a quadratic relationship between transfer receipts and income and I allow for an interaction term between state of residence and income. The latter takes into account the fact that different states may have different benefit rules and hence different marginal transfer effects. The results were not particularly sensitive to either of these assumptions.
At the lower end of the income distribution, however, regression may be a better approach for calculating marginal transfer effects on earned income than on unearned. Indeed, there are few observations in this range with both unearned income and transfer receipts.
Marginal Transfer Effect Estimates
I report estimates of the marginal transfer effects on earned and unearned income for the three filing statuses and for all households pooled together in Tables 1 and 2. Table 1 displays marginal transfer effects calculated using deciles and Table 2 displays marginal transfer effects estimated using the pooled status deciles.
The marginal transfer effects differ considerably by source of income and filing status. Those on unearned income are generally larger than those on earned income. That is, the welfare benefit reduction is apparently greater for increases in unearned income than for increases in earned income. In fact, marginal transfer effects on unearned income are exceptionally high, well over 100 percent for many lower income brackets, perhaps because the means-testing of transfers depends on a households’ wealth as well as unearned income. As increases in household wealth lead to reduced transfer payments, marginal transfer effects on unearned income may be inflated because they reflect changes in the return to unearned income rather than changes in actual wealth. Moreover, the marginal transfer effects on unearned income may be higher than on earned income because many transfer programs exempt some earned income or provide deductions from earned income for work expenses, while no exemptions or deductions are made on unearned income. These deductions and exclusions create lower marginal transfer effects on earned income than on unearned income.
Table 1.
Marginal Transfer Effects in percent on Earned and Unearned Income for deciles
|
Decile |
Heads of household |
Single Households |
Married Households |
|||
|
MTEL |
MTEK |
MTEL |
MTEK |
MTEL |
MTEK |
|
|
1 |
132.4* (5.84) |
681.2* (6.90) |
55.7 (2.38) |
209.7 (6.86) |
25.5 (20.45) |
32.6 (6.08) |
|
2 |
2.9 (0.63) |
22.1 (3.56) |
4.0 (4.46) |
5.6 (3.53) |
||
|
3 |
-0.7 (0.33) |
2.6 (1.13) |
1.8 (1.37) |
2.0 (1.17) |
||
|
4 |
45.1 (4.08) |
175.9 (4.35) |
1.2 (0.88) |
4.0 (2.65) |
0.6 (1.95) |
0.6 (1.53) |
|
5 |
24.9 (3.47) |
28.3 (1.94) |
-0.7 (1.12) |
-0.07 (0.10) |
-0.2 (0.31) |
0.04 (0.06) |
|
6 |
22.1 (4.05) |
36.4 (3.73) |
-0.05 (0.12) |
0.1 (0.29) |
-0.1 (0.17) |
0.07 (0.09) |
|
7 |
-1.9 (0.33) |
-9.4 (1.14) |
-0.07 (0.13) |
0.2 (0.34) |
0.06 (0.30) |
-0.3 (1.18) |
|
8 |
1.0 (1.02) |
2.8 (1.11) |
0.3 (0.92) |
0.5 (1.44) |
0.03 (1.10) |
0.04 (1.28) |
|
9 |
1.2 (0.81) |
-0.2 (0.07) |
0.1 (0.68) |
0.1 (0.61) |
-0.003 (0.08) |
-0.01 (0.28) |
|
10 |
-0.1 (1.64) |
0.05 (0.38) |
0.002 (0.39) |
0.006 (0.78) |
-0.01 (1.04) |
0.006 (0.49) |
Notes: Numbers in parentheses are t-statistics.
* The first three deciles of heads of household are grouped together
when estimating the marginal transfer effect.
The first decile of heads of household face the highest marginal transfer effects, followed by single households. Married households face the lowest marginal transfer effects. Indeed, the lowest-income heads of household face extremely high marginal transfer effects, 132 percent on earned income and 681 percent on unearned income. This means that the lowest-income heads of household will lose $1.32 in transfer payments if they increase their earned income by a dollar and $6.81 if they increase their unearned income by a dollar. The corresponding numbers are 56 and 210 percent for the lowest-income single
Table 2.
Marginal Transfer Effects in percent on Earned and Unearned Income
for pooled status deciles
|
Gross |
All Households |
Heads of household |
Single Households |
Married Households |
||||
|
Income |
MTEL |
MTEK |
MTEL |
MTEK |
MTEL |
MTEK |
MTEL |
MTEK |
|
0 - 2,417 |
107.6 (14.01) |
186.1 (13.38) |
122.4 (6.61) |
677.2 (6.87) |
42.4 (7.00) |
82.3 (8.35) |
135.3 (4.17) |
182.0 (3.25) |
|
2,417 – 8,200 |
6.5 (4.41) |
20.2 (8.90) |
29.9 (5.03) |
129.1 (5.11) |
2.1 (2.42) |
6.5 (5.4) |
36.9 (5.23) |
43.9 (3.17) |
|
8,200 - 14,100 |
4.6 (4.91) |
7.2 (5.98) |
18.7 (6.26) |
21.8 (2.83) |
0.5 (1.07) |
1.5 (2.68) |
10.1 (3.66) |
14.9 (3.23) |
|
14,100 – 20,000 |
2.2 (3.48) |
2.8 (3.49) |
7.6 (2.87) |
-0.2 (0.04) |
0.1 (0.37) |
0.2 (0.64) |
4.3 (2.94) |
5.9 (2.68) |
|
20,000 – 25,800 |
-0.7 (0.90) |
-0.3 (0.40) |
-3.1 (1.50) |
-1.1 (0.24) |
0.1 (0.39) |
0.4 (0.96) |
-1.3 (0.83) |
-0.3 (0.13) |
|
25,800 – 32,268 |
0.6 (2.46) |
0.7 (2.25) |
0.2 (0.09) |
-1.3 (0.46) |
0.09 (0.42) |
0.3 (1.15) |
1.1 (3.01) |
0.9 (1.92) |
|
32,268 – 40,000 |
0.03 (0.12) |
-0.002 (0.01) |
-0.2 (1.03) |
0.1 (0.51) |
0.3 (0.82) |
0.3 (0.74) |
-0.1 (0.32) |
0.08 (0.17) |
|
40,000 – 50,001 |
0.2 (1.06) |
0.2 (0.75) |
0.04 (0.05) |
4.8 (3.52) |
0.05 (0.81) |
0.009 (1.31) |
0.3 (1.21) |
0.2 (0.50) |
|
50,001 – 66,857 |
0.06 (1.46) |
0.06 (1.09) |
-1.0 (0.44) |
-0.6 (0.23) |
-0.04 (0.51) |
-0.03 (0.40) |
0.08 (1.89) |
0.07 (1.09) |
|
66,857 – 320,633 |
0.003 (0.48) |
0.007 (0.77) |
0 (0.00) |
0 (0.00) |
0 (0.00) |
0 (0.00) |
-0.003 (0.37) |
0.007 (0.63) |
Notes: Numbers in parentheses are t-statistics.
households, and 26 and 33 percent for the lowest-income married households.
For heads of household, high marginal transfer effects are not faced exclusively by the lower deciles. In fact, marginal transfer effects are substantial for heads of household more than halfway up the income distribution. Specifically, for heads of household, estimated marginal transfer effects are significantly different from zero and large in magnitude all the way up to decile six. Moreover, 30 percent of all heads of household face marginal transfer effects of over 100 percent on earned income and 40 percent face marginal transfer effects of over 100 percent on unearned income. In contrast, only the first decile of single and married households face large and statistically significant marginal transfer effects on earned income.
I investigate the stability of these high marginal transfer effects and whether the differences between filing statuses can be attributed to the use of different income ranges by estimating the model using a common set of income ranges. Specifically, I use income ranges based on the decile increments for all households pooled together.
I find that 10 percent of all households face marginal transfer effects of over one 100 percent on both earned and unearned income, though the first income range (gross income between $0 and $2,417) consists predominately of low-income heads of household. Marginal transfer effects taper off quickly for the succeeding income ranges. Unexpectedly, however, married households in the first pooled status decile (with gross income between $0 and $2,417) actually face higher marginal transfer effects on earned income (135 percent) than heads of household in the same income range (122 percent), though the high rate applies to fewer than 2 percent of all married households in the sample. For heads of household, the opposite is true; more than 30 percent of all heads of household belong to the first pooled status decile. The results are thus similar to those for the three first deciles of heads of household in Table 1; that is, the marginal transfer effects are very high. The high marginal transfer effects for married households and heads of household can be explained by the strong link between transfer payments in the U.S. and the presence of children. The high marginal transfer effects are more persistent for heads of household, however. Heads of household also face the highest and most persistent marginal transfer effects on unearned income.
Only the first pooled status decile of single households face high marginal transfer effects, 42 and 82 percent on earned and unearned income, respectively. The marginal transfer effects for the first pooled status decile are, however, smaller under this grouping than under the decile grouping because of an upward shift in incomes. Included in the first pooled status decile are wealthier individuals who receive relatively few transfer payments.
In short, marginal transfer effects are an important tax on earned and unearned income, especially for heads of household where they are important far up the income distribution. The results are fairly robust across the two income range groupings.
2.4 Estimation of Marginal Tax Rates
METRs also depend on numerous taxes. I calculate marginal tax rates for each of the income ranges and filing statuses, both for earned and unearned income. Taxes can be divided into the direct and indirect varieties. Direct taxes reduce the income that finances consumption and indirect taxes increase the cost of goods and services consumed. All taxes – be it personal income taxes, payroll taxes, or indirect taxes – affect household consumption decisions either by reducing the amount of income available or by making consumption more expensive.
My focus on the overall tax wedge between what an employer pays and a worker consumes dictates that I include all taxes affecting the consumption obtainable from a unit of labor income. The cost for the employer to pay the employee one unit of gross earned income is 1+tp, where tp is the employer-paid payroll tax rate. On that unit of gross earned income the employee pays personal income taxes and employee-paid payroll taxes. Denote the personal income tax rate by tm, the employee-paid payroll tax rate by te, and their sum (tm + te) by tme. The employee is then left with (1- tme) unit of disposable income after personal income taxes and payroll taxes are levied. On disposable income, indirect taxes are paid at a rate of ti, which leaves (1- tme)(1- ti) available for consumption. The total taxes levied on a unit of earned income are, therefore, tp + tme + ti - tme ti, so the marginal tax rate on earned income is
. (2)
The marginal tax rate on unearned income can be constructed in a similar fashion. Personal income taxes are paid on a unit of unearned income, leaving (1- tm). After indirect taxes are paid, (1- tm)(1- ti) is available for consumption. The total tax levied on unearned income is, thus, tm+ ti - tm ti, so the marginal tax rate on unearned income is
(3)
I compute marginal tax rates on both earned and unearned income for each sample household, using the above formulae. Statutory tax rates are used for the personal income tax rate and the employer-paid and employee-paid payroll tax rates. Estimating the indirect tax rate is more complicated and I will describe the process in more detail below.
Personal Income Taxes
Households pay federal taxes on personal income. I calculate the marginal tax rate for each household in the 1990 CPS data set in the same way that households did. I first compute each household’s adjusted gross income, the sum of earned and unearned income. I then subtract exemptions, deductions, and tax credits to get taxable income, to which I apply the 1989 federal tax table.
I use CPS provided information to determine each household’s filing status. Single individuals and unmarried individuals with children under 18 years of age are assumed to file as single households and as heads of household, respectively. Married households are less straightforward; they can file either jointly or separately. I assume they file so as to minimize their tax liability. Specifically, I compute every married household’s tax payments both ways and choose the lower of the two (filing separately is preferred by only 2.4 percent).
Households receive exemptions, standard deductions, and tax credits. The assumption that everybody files for standard deductions is purely for convenience; I have no information with which to calculate itemized deductions. This likely overestimates tax payments by higher-income households, and hence their marginal tax rates, however, since higher-income households commonly gain by filing itemized.
At the lower end of the income distribution, the Earned Income Tax Credit (EITC) is an important component of federal taxes and must be included in calculations of the marginal tax rates. The EITC, a central part of the federal antipoverty policy, is a refundable credit that allows qualified taxpayers to receive a tax refund from the government. A taxpayer with no federal tax liability, for example, receives a tax refund for the full amount. The EITC could also be counted as a transfer as it is a specific type of welfare program, but I choose to count it as a tax. In 1989 married households, living with dependents, filing jointly, and whose earnings and adjusted gross income are greater than zero but less or equal to $19,340 and working heads of household with earnings and adjusted gross income less or equal to $19,340 were eligible for the EITC. The EITC has three ranges. The credit increases with earned income in the first range, is constant at the maximum level ($910) in the second range, and is gradually phased out over the third. I assume that all eligible households receive the credit.
In 44 states, households pay state personal income taxes on top of federal taxes. I calculate state taxes by applying 1989 state tax tables for households in the four most populous states, which together make up 26 percent of the 1990 CPS. Three of these states, California, New York, and New Jersey impose relatively high state personal income taxes, while one, Texas, imposes no state personal income tax. For residents of the remaining states, I assume that state personal income taxes for each decile and filing status are an average across these four states. State marginal tax rates are generally low relative to federal rates, so assumptions about state personal taxes may not be critical. I conduct sensitivity analyses, however, to test how crucial assumptions about state taxes actually are.
Payroll Taxes
The payroll tax can be treated either as a tax or as forced savings for retirement. If payroll and unemployment taxes are viewed as generating future benefits, including these taxes will overstate actual marginal tax rates. Whether taxpayers view payroll and unemployment taxes as future benefits or taxes depends on how much they expect to pay in taxes relative to their expectations of retirement and unemployment benefits. The difference between how much the taxpayer pays and how much he or she receives depends, in turn, on several factors: pension system (pay as you go or fully funded), age, years in retirement (unemployment), likelihood of unemployment, political credibility, and the like. In what follows, I treat payroll and unemployment taxes as taxes and not retirement or insurance premiums, but conduct sensitivity analyses where the social security payroll taxes are treated as future benefits for old age and not as taxes.
In 1989, employers paid a 7.51 percent social security payroll tax on employees’ earnings up to $48,000. On the first $7,000, employers paid an additional unemployment insurance tax of 6.2 percent. Thus, tp equaled 0.1371 for earnings up to $7,000, 0.0751 for earnings between $7,000 and $48,000, and zero above that. The employee paid 7.51 percent on earnings (te) up to $48,000.
Indirect Taxes
I also need an estimate of the indirect tax rate paid on all forms of consumption. Using the statutory indirect tax rates would generally overestimate the indirect tax rates on consumption because statutory indirect tax rates are not levied on all consumption. For example, groceries and clothing are exempted from indirect taxes in many states. Moreover, statutory indirect tax rates are undesirable because they vary widely between states. To get around these problems, I instead use total indirect tax revenues (tax revenues from taxes on goods and services) in the U.S. for 1989 divided by total disposable income for the same year. The resulting tax rate is 6.34 percent. To determine how crucial assumptions about the indirect tax rate are, I conduct sensitivity analyses where the indirect tax rate is set to equal zero.
Because sales and excise taxes vary across both goods and states, the approximation based on national figures used here will overestimate the indirect tax rate in some states and underestimate it in others. Similarly, the rates will be overestimated for households spending a smaller than average fraction on excise- and sales-taxed goods and vice versa. The data, however, suggest that households in the five quintiles spend on average similar fractions of total expenditures on various consumption categories, see Table B1 (appendix B).
Marginal Tax Rate Estimates
I use the same decile approach for estimating marginal tax rates that I used for estimating marginal transfer effects. Specifically, I first compute marginal tax rates for each household by inserting the marginal personal income tax rate, marginal payroll tax rate, and the marginal indirect tax rate into equations (2) and (3). The marginal tax rate for each decile is then estimated as the average marginal tax rate of the relevant households. I report them for both earned and unearned income in Tables 3 and 4. Table 3 reports marginal tax rates for each decile for the three filing statuses, while Table 4 reports them for the three filing statuses and for all households using pooled status deciles. I prefer the figures in Table 3 because they indicate how common a specific rate is, though I use the Table 4 figures for comparing rates between filing statuses because the income ranges are standardized.
Marginal tax rates are lower on unearned than on earned income because payroll taxes are levied only on earned income. Households, thus, retain more for consumption from a unit increase in unearned income than from a unit increase in earned income (given that the marginal transfer effect is zero or the same for the two income sources). Heads of household pay the lowest and single households the highest marginal tax rates. Married households pay a rate somewhere in-between (Table 4). Heads of household pay the least because they earn, on average, less than single and married households (Table B, appendix A). In addition, working heads of household receive the Earned Income Tax Credit which reduces their marginal tax rates on earned income. Finally, the federal tax brackets for heads of household are wider than tax brackets for single and
Table 3.
Marginal Tax Rates in percent on Earned and Unearned Income for deciles
|
Decile |
Heads of household |
Single Households |
Married Households |
|||
|
MTRL |
MTRK |
MTRL |
MTRK |
MTRL |
MTRK |
|
|
1 |
11.9 |
6.3 |
23.8 |
6.3 |
14.0 |
9.8 |
|
2 |
11.9 |
6.3 |
23.8 |
6.3 |
25.4 |
20.9 |
|
3 |
11.3 |
6.3 |
29.0 |
12.2 |
32.1 |
22.5 |
|
4 |
12.4 |
6.3 |
36.3 |
22.7 |
33.6 |
23.3 |
|
5 |
11.8 |
11.1 |
35.6 |
23.1 |
34.4 |
24.1 |
|
6 |
18.1 |
19.3 |
36.5 |
24.4 |
40.1 |
30.7 |
|
7 |
23.7 |
22.3 |
37.4 |
25.4 |
43.6 |
36.8 |
|
8 |
32.0 |
22.5 |
47.3 |
36.2 |
44.0 |
37.5 |
|
9 |
33.9 |
23.8 |
50.2 |
39.1 |
43.4 |
38.2 |
|
10 |
41.7 |
36.3 |
48.3 |
41.8 |
43.0 |
42.0 |
married households, leading to a larger number of households in the lowest tax bracket.
In general, heads of household face very low marginal tax rates on both earned and unearned income (Table 3). The first two deciles, whose earned income is zero, face a marginal tax rate of 12 percent. Consequently, they would be left with $0.88 for consumption after all taxes are paid were they to increase their earned income by a dollar. Households in deciles three and five face even smaller marginal tax rates because the Earned Income Tax Credit increases with income up to a certain income level ($10,250). Households in decile seven (and a few in decile 8) are entitled to the Earned Income Tax Credit and therefore face relatively low marginal tax rates, between 12 and 20 percent.
Some married households are entitled to the Earned Income Tax Credit as well, resulting in low marginal tax rates at the lower end of the income distribution for married households. In fact, the marginal tax rate is only 14
Table 4.
Marginal Tax Rates in percent on Earned and Unearned Income for pooled status deciles
|
Gross |
All Households |
Heads of household |
Single Households |
Married Households |
||||
|
Income |
MTRL |
MTRK |
MTRL |
MTRK |
MTRL |
MTRK |
MTRL |
MTRK |
|
0 - 2,417 |
11.8 |
6.3 |
11.8 |
6.3 |
23.8 |
6.3 |
12.0 |
6.3 |
|
2,417 – 8,200 |
22.8 |
11.4 |
11.8 |
6.3 |
30.3 |
14.1 |
12.2 |
6.3 |
|
8,200 - 14,100 |
25.1 |
18.7 |
16.2 |
17.6 |
35.6 |
23.1 |
15.7 |
13.0 |
|
14,100 –20,000 |
29.0 |
22.4 |
24.9 |
22.3 |
36.7 |
24.6 |
23.3 |
20.4 |
|
20,000 – 25,800 |
36.5 |
25.9 |
33.4 |
22.8 |
41.6 |
30.3 |
32.1 |
22.3 |
|
25,800 – 32,268 |
38.0 |
28.0 |
34.5 |
24.1 |
49.8 |
38.8 |
32.8 |
23.0 |
|
32,268 – 40,000 |
38.8 |
28.5 |
42.6 |
33.2 |
50.2 |
39.2 |
34.4 |
24.0 |
|
40,000 – 50,001 |
43.3 |
34.4 |
46.8 |
38.3 |
49.6 |
38.9 |
41.7 |
33.1 |
|
50,001 – 66,857 |
44.1 |
38.4 |
35.1 |
38.7 |
46.6 |
43.6 |
44.0 |
37.5 |
|
66,857 – 320,633 |
43.2 |
41.1 |
37.9 |
42.4 |
47.7 |
44.8 |
43.0 |
40.7 |
percent for the first decile. Households in deciles three and up do not qualify for the Earned Income Tax Credit and hence face higher marginal tax rates, between 32 and 44 percent. Single households face relatively high marginal tax rates, between 24 and 50 percent. If a single household in decile one increased its earned income by a unit, it would owe 23.8 cents in taxes, mainly in payroll and indirect taxes.
Marginal tax rates for unearned income are low for most heads of household: 40 percent face a marginal tax rate of 6.3 percent on unearned income and only the top 10 percent face marginal tax rates greater than 30 percent. The marginal tax rates on unearned income faced by single households are slightly higher. Only 20 percent face a rate of 6.3 percent and fully 30 percent face rates above 30 percent. Married households face substantially higher marginal tax rates on unearned income. For example, 50 percent of all married households face rates over 30 percent.
2.5 Marginal Effective Tax Rates
Having estimated the marginal transfer effects in section 2.3 and the marginal tax rates in section 2.4, I am now ready to estimate METRs for each decile. This is done by simply adding the MTE and the MTR together. Note, I assign a zero value to marginal transfer effects that are not significantly different from zero at a 95 percent confidence level.
METR Estimates
Estimates of marginal effective tax rates are presented in Tables 5 and 6, and in Figures 1, 2, 3, and 4. Figures 1 and 2 present METR schedules over deciles for the different filing statuses on earned and unearned income, respectively. Figures 3 and 4 present corresponding METR schedules using the same income ranges.
The most remarkable result is that low-income heads of household face exceptionally high METRs. Alarmingly, 30 percent of all heads of household face rates of 100 percent or more on earned income. To put this in perspective, a head of household with no earned income would decrease its consumption by 44 cents if it earned a dollar.
These extremely high METRs may go a long way to explaining the remarkably inelastic labor supply of low-income female heads of household. Considering the high METRs low-income heads of household face, considerable decreases in METRs might still be consistent with substantial work disincentives. Indeed, Moffitt [1992] found that the labor supply of female heads was extraordinarily stable over time despite major changes in benefit levels, benefit-reduction rates, benefit-earnings ratios, and unemployment rates.
Table 5.
Marginal Effective Tax Rates in percent on Earned and Unearned Income for deciles
|
Decile |
Heads of household |
Single Households |
Married Households |
|||
|
METRL |
METRK |
METRL |
METRK |
METRL |
METRK |
|
|
1 |
144.3 |
687.5 |
79.5 |
216.0 |
39.5 |
42.4 |
|
2 |
144.3 |
687.5 |
23.8 |
28.4 |
29.4 |
26.5 |
|
3 |
143.7 |
687.5 |
29.0 |
12.2 |
32.1 |
22.5 |
|
4 |
57.5 |
182.2 |
36.3 |
26.7 |
33.6 |
23.3 |
|
5 |
36.7 |
11.1 |
35.6 |
23.1 |
34.4 |
24.1 |
|
6 |
40.2 |
55.7 |
36.5 |
24.4 |
40.1 |
30.7 |
|
7 |
23.7 |
22.3 |
37.4 |
25.4 |
43.6 |
36.8 |
|
8 |
32.0 |
22.5 |
47.3 |
36.2 |
44.0 |
37.5 |
|
9 |
33.9 |
23.8 |
50.2 |
39.1 |
43.4 |
38.2 |
|
10 |
41.7 |
36.3 |
48.3 |
41.8 |
43.0 |
42.0 |
The METRs on unearned income for low-income heads of household are even higher than the rates on earned income, 688 percent. This figure may seem unreasonably large. As pointed out earlier, the regression approach may be a less appropriate technique for estimating marginal transfer effects on unearned income. The fact that METRs on unearned income are high, however, is consistent with results from Hubbard et al. [1995]. They find that an increase in wealth can lead to a decline in consumption over some ranges, and that social welfare programs with means testing discourage saving by households with low expected lifetime incomes. Low-income heads of household, hence, face even stronger disincentives to save and accumulate physical capital that may further contribute to the welfare dependency trap.
Low-income heads of household face very high METRs. Middle- and high-income heads of household face considerably lower and fairly constant rates (Figure 1 and 2). For those with gross incomes between $6,400 and $32,010, they are generally between 32 and 40 percent on earned income, though they are a little lower for decile seven. On unearned income, rates are generally lower than rates on earned income for the same income interval. The decile of individuals with the highest incomes face slightly higher METRs because of progressively higher marginal tax rates on earned and unearned income.
Figure 1.
Marginal Effective Tax Rates on Earned Income for Heads of Household, Single, and Married Households by deciles

Note: SMETR(L), MMETR(L), HMETR(L) are METRs on earned income for single households, married households and heads of household, respectively.
The METRs for single households follow a similar, though less extreme, pattern to those of heads of household (Figure 1 and 2). Households in the first decile, with a mean gross annual income of $195, face a METR on earned income of 80 percent, providing them with fairly small incentives to work. The corresponding METRs for unearned income are higher, 216 percent, eliminating any incentive to save as well. In fact, single households in the first decile work on average only 6 hours per week and have almost no unearned income (Table C, appendix A). The METRs fall much faster for single households than for heads of households, however. In fact, the second and third deciles face fairly low METRs on both earned and unearned income, somewhere in the upper 20 percents. Again, however, the highest deciles face higher rates because of higher marginal tax rates, around 50 percent for earned income and 35 to 40 percent for unearned income.
The METRs faced by married households differ substantially from those faced by single households and heads of household (Figure 1 and 2). Low-income married households do not face particularly high METRs, primarily because their marginal transfer effects are fairly low. For the most part, the METRs increase linearly with a slightly positive slope.
Households in decile eight (with a mean gross income of $57,629) face the highest METR on earned income, 44 percent, mostly because of high marginal tax rates. Payroll taxes are not levied on the highest income categories, however, so the marginal tax rates and hence METRs decline for deciles nine and ten. For unearned income, the first decile (with a mean gross income of $7,969) faces the highest METR, 42 percent, only slightly more than that faced by the tenth decile. Middle-income households, with gross incomes between $22,000 and $45,701, face METRs on earned income between 30 and 40 percent. The corresponding numbers for unearned income are generally between 20 and 30 percent.
Figure 2.
Marginal Effective Tax Rates on Unearned Income for Heads of Household, Single, and Married Households by deciles

Note: SMETR(K), MMETR(K), HMETR(K) are METRs on earned income for single households, married households and heads of household, respectively.
The differences in the METRs on earned income for different filing statuses are mainly the result of variation in marginal transfer effects, not marginal tax rates. Low-income heads of household with high marginal transfer effects face the highest METRs. Low-income single households, but not the very lowest, have low marginal transfer effects and face the lowest METRs, below 30 percent. METRs on earnings for married households exhibit the least variation because variations in marginal transfer effects and marginal tax rates across the income distribution for married households are small. The highest rate, 44 percent for high-income households, is low compared to the highest rates for heads and singles, and the lowest
Table 6.
Marginal Effective Tax Rates in percent on Earned and Unearned Income for pooled status deciles
|
Gross |
All Households |
Heads of household |
Single Households |
Married Households |
||||
|
Income |
METRL |
METRK |
METRL |
METRK |
METRL |
METRK |
METRL |
METRK |
|
0 - 2,417 |
119.4 |
192.4 |
134.2 |
683.5 |
66.2 |
88.6 |
147.3 |
188.3 |
|
2,417 – 8,200 |
29.3 |
31.6 |
41.7 |
135.4 |
32.4 |
20.6 |
49.1 |
50.2 |
|
8,200 - 14,100 |
29.7 |
25.9 |
34.9 |
39.4 |
35.6 |
24.6 |
25.8 |
27.9 |
|
14,100 –20,000 |
31.2 |
25.2 |
32.5 |
22.3 |
36.7 |
24.6 |
27.6 |
26.3 |
|
20,000 – 25,800 |
36.5 |
25.9 |
33.4 |
22.8 |
41.6 |
30.3 |
32.1 |
22.3 |
|
25,800 – 32,268 |
38.6 |
28.7 |
34.5 |
24.1 |
49.8 |
38.8 |
33.9 |
23.0 |
|
32,268 – 40,000 |
38.8 |
28.5 |
42.6 |
33.2 |
50.2 |
39.2 |
34.4 |
24.0 |
|
40,000 – 50,001 |
43.3 |
34.4 |
46.8 |
43.1 |
49.6 |
38.9 |
41.7 |
33.1 |
|
50,001 – 66,857 |
44.1 |
38.4 |
35.1 |
38.7 |
46.6 |
43.6 |
44.0 |
37.5 |
|
66,857 – 320,633 |
43.2 |
41.1 |
37.9 |
42.4 |
47.7 |
44.8 |
43.0 |
40.7 |
rate for married households, 29 percent, is slightly higher than the lowest rate for single households, 24 percent.
The METR schedules look similar across the different filing statuses whether households are grouped using deciles or pooled status deciles (Figure 3 and 4), with the exception that married households in the income range $0 to $2,417 face an METR on their earned income of 147 percent. While this number is surprisingly high, it must be remembered that few married households, less than 2 percent, actually face this situation. Indeed, the 134 percent faced by low-income heads of household is perhaps more worrisome because it applies to more than 30 percent of heads of household.
Figure 3.
Marginal Effective Tax Rates on Earned Income for pooled status deciles
Note: AMETR(L) is METRs on earned income for all households.
Figure 4 compares METRs on unearned income. The shape of METR schedules for unearned income for the different household types is also similar when the households are grouped into pooled status deciles, though the levels differ. Again, METRs on unearned income are extremely high for heads of household.
My METR estimates are substantially higher than previous estimates. It is hard to compare numbers between this study and earlier studies, however, because of substantial differences in the approaches employed and the benefits considered. My estimates are generally more comprehensive than estimates from earlier studies, though when aggregated across income and filing statuses they are fairly similar to Browning and Johnson’s.
Figure 4.
Marginal Effective Tax Rates on Unearned Income for pooled status deciles
Note: AMETR(K) is METRs on unearned income for all households.
2.6 Conclusions
In this paper, I estimate METRs on earned and unearned income across the income distribution for heads of household, single households, married households, and all households pooled together in the U.S. These METRs measure how fiscal policy affects households’ incentives to work and accumulate capital. The higher they are, the lower are the incentives.
Alarmingly, I find that many low-income heads of household face METRs that are close or even greater than one hundred percent. In fact, 30 percent of all heads of household face METRs of over 100 percent on earned income. For unearned income, 30 percent face a rate over 600 percent.
The reductions in METRs for low-income heads of household required to provide work incentives for the poor, a goal of many current welfare reforms, are enormous and practically unlikely. Indeed, these results may go a long way toward explaining why the labor supply of low-income household heads is so remarkably inelastic and has not changed over time, despite changes to the welfare system and tax laws which have effectively lowered METRs. It seems likely, then, that welfare recipients can perhaps only be compelled to enter the labor force by linking welfare eligibility to work directly. Alternatives should be explored, however, that can provide for the poorest households without creating such extreme disincentives to future self-sufficiency. This is especially urgent because society is becoming increasingly dissatisfied with the functioning of the current welfare system. The Earned Income Tax Credit helps to bring down the high METRs at the very end of the income distribution, but is not enough and its effect tapers off quickly.
References
Barro, R. and C. Sahasakul, 1983, "Measuring the Average Marginal Tax Rates from the Individual Income Tax," Journal of Business, 56(4), 419-452.
¾ , 1986, "Average Marginal Tax Rates from Social Security and the Individual Income Tax," Journal of Business, 59(4), 555-566.
Browning, E. and W. Johnson, 1979, "The Distribution of the Tax Burden," American Enterprise Institute for Public Policy Research, Washington D.C.
Caniglia, A., 1988, "The Economic Evaluation of Food Stamps: An Intertemporal Analysis With Nonlinear Budget Constraints," Public Finance Quarterly, 16(1), 3-29.
Chavas, J. and M. Yeung, 1982, "Effects of the Food Stamp Program on Food Consumption in the Southern United States," Southern Journal of Agricultural Economics, 14(1), 131-139.
Dickert, S., Houser, S. and J. K. Scholz, 1995, "The Earned Income Tax Credit and Transfer Programs: A Study of Labor Market and Program Participation," Tax Policy and the Economy, NBER, 19, 1-50.
Feldstein, M.S., 1973, "On the Optimal Progressivity of the Income Tax," Journal of Public Economics, 2, 357-376.
¾ , 1995(a), "College Scholarship Rules and Private Saving," American Economic Review, 85(3), 552-566.
¾ , 1995(b), "The Effect of Marginal Tax Rates on Taxable Income: A Panel Study of the 1986 Tax Reform Act," Journal of Political Economy, 103(3).
¾ , 1995(c), "Tax Avoidance and the Deadweight-Loss of the Income Tax," Working Paper, Cambridge, Mass., NBER, 5055.
Feldstein, M.S. and A. Samwick, 1992, "Social Security Rules and the Marginal Tax Rates," National Tax Journal, 45(1), 1-22.
Fraker, T., 1990, "The effect of Food Stamps on Food Consumption: A Review of the Literature," U.S. Department of Agriculture, Washington D.C.
¾ , Martini, A., Ohls, J. and M. Ponza, 1995, "The effects of Cashing-Out Food Stamps on Household Food Use and the Cost of Issuing Benefits," Journal of Policy Analysis and Management, 14(3), 372-392.
¾ , T., Moffitt, R. and D. Wolf, 1985, "Effective Tax Rates and Guarantees in the AFDC Programs, 1967-1982, " Journal of Human Resources, 20(2), 251-263.
Hamermesh, D. and J. Johannes, 1985, "Food Stamps as Money: The Macroeconomics of a Transfer Program," Journal of Political Economy, 93(11), 205-213.
Harberger, A., 1962, "The Incidence of the Corporation Income Tax," Journal of Political Economics, 70(3), 215-240.
Hubbard, G., Skinner, J., and S. Zeldes, 1995, "Precautionary Saving and Social Insurance," Journal of Political Economy, 103(2), 360-399.
Hutchens, R., 1978, "Changes in AFDC Tax Rates 1967-1971," Journal of Human Resources, 13(1), 60-74.
Joines, D., 1981, "Estimates of the Effective Marginal Tax Rates on Factor Income," Journal of Business, 54(2), 191-226.
Levedahl, W., 1991, "The Role of Functional Form in Estimating the Effect of a Cash-Only Food Stamp Program," Journal of Agricultural Economic Research, 43(2), 11-19.
Long, J. and J. Gwartney, 1987, "Income Tax Avoidance: Evidence from Individual Tax Returns," National Tax Journal, 40(4), 517-532.
Lurie, I., 1974, "Estimates of Tax Rates in the AFDC Program," National Tax Journal, 27(1), 93-107.
Keane, M. and R. Moffitt, 1998, "A Structural Model of Multiple Welfare Program Participation and Labor Supply" International Economic Review, 39(3), 553-89.
Moffitt, R., 1979, "Cumulative Effective Tax Rates and Guarantees in Low-Income Transfer Programs," Journal of Human Resources, 14(1), 122-129.
¾ , 1989, "Estimating the Value of In-kind Transfer: The Case of Food Stamps," Econometrica, 57(2), 385-409.
¾ , 1992, "Incentive Effects of the U.S. Welfare System: A Review," Journal of Economic Literature, 30(1), 1-61.
¾ , 1995, "Current Population Surveys: March Individual-Level Extracts, 1968 -1992," a computer file. ICPSR version, Madison, WI, University of Wisconsin, Institute for Research on Poverty.
¾ and B. Wolfe, 1991, "A New Index to Value In-kind Benefits," Review of Income and Wealth, 37(4), 387-408.
Scholz, J., K., 1994, "The Earned Income Tax Credit: Participation, Compliance, and Anti-poverty Effectiveness," National Tax Journal, 47(1), 63-87.
Seater, J., 1982, "Marginal Federal Personal and Corporate Income Tax Rates in the U.S., 1909-1975," Journal of Monetary Economics, 10(3), 361-381.
¾ , 1985, "On the Construction of Marginal Federal Personal and Social Security Tax Rates in the U.S.," Journal of Monetary Economics, 15(1), 121-135.
Slemrod, J., 1992, "Do Taxes Matter? Lessons from the 1980’s," American Economic Review, 82(2), 250-256.
Smeeding, T., 1982," Alternative Methods of Valuing Selected In-Kind Transfer Benefits and Measuring Their Effect on Poverty," U.S. Department of Commerce, Bureau of Census. Technical Paper no. 50, Washington D.C.
U.S. Bureau of Census, 1990, Current Population reports, Series P-60, "Measuring the Effect of Benefits and Taxes on Income and Poverty: 1989," U.S. Government Printing Office, Washington D.C.
¾ , 1991, "Government Finance," U.S. Government of Printing Office, Washington D.C.
¾ , 1994, "Statistical Abstract of the United States 1994", 114th edition, Washington D.C.
U.S. Department of Commerce, 1985, "American Housing Survey," Washington D.C.
U.S. Department of Labor, Bureau of Labor Statistics, 1992, "Consumer Expenditures in 1991," Report 835, Table 1, Washington D.C.
U.S. Department of the Treasury, IRS, 1990, "Your Federal Income Tax For Individuals 1989 Returns," Washington D.C.
¾ , 1991a, "Annual Report," Washington D.C.
¾ , 1991b, "The Final Monthly Treasury Statement," Washington D.C.
¾ , 1992, "Statistics of Income: Individual Income Tax Returns 1989", Washington D.C.
U.S. Department of General Accounting Office, 1990, "Low-Income Home Energy Assistance: A Program Overview," GAO, Washington D.C.
U.S. General Service Administration, 1990, "1990 Catalog of Federal Domestic Assistance," Superintendent of Documents, Government Printing Office, Washington D.C.
Appendix A
Table A.
Household Statistics for All Households
|
Decile |
Income |
Transfers |
Labor |
|||
|
Gross |
Earned |
Unearned |
||||
|
Lowest |
Highest |
|||||
|
1 |
0 |
2,417 |
435 |
143 |
2,883 |
10 |
|
2 |
2,417 |
8,200 |
4,567 |
693 |
629 |
31 |
|
3 |
8,200 |
14,100 |
10,531 |
774 |
321 |
38 |
|
4 |
14,100 |
20,000 |
16,253 |
740 |
189 |
41 |
|
5 |
20,000 |
25,800 |
21,798 |
927 |
107 |
42 |
|
6 |
25,800 |
32,268 |
28,179 |
907 |
49 |
43 |
|
7 |
32,268 |
40,000 |
34,846 |
1,233 |
22 |
44 |
|
8 |
40,000 |
50,001 |
43,200 |
1,425 |
18 |
44 |
|
9 |
50,001 |
66,857 |
54,684 |
2,544 |
6 |
44 |
|
10 |
66,857 |
320,633 |
85,729 |
8,849 |
6 |
47 |
Note: All values reflect decile means. Earned, unearned income and transfers refer
to yearly values. Hours worked, Labor, are per week. For married households, hours worked is for the head.
Table B.
Household Statistics for Heads of Household
|
Decile |
Income |
Transfers |
Labor |
|||
|
Gross |
Earned |
Unearned |
||||
|
Lowest |
Highest |
|||||
|
1 |
0 |
0 |
0 |
0 |
7,318 |
0 |
|
2 |
0 |
0 |
0 |
0 |
7,355 |
0 |
|
3 |
0 |
2,000 |
570 |
41 |
5,247 |
19 |
|
4 |
2,000 |
6,400 |
4,187 |
59 |
2,599 |
34 |
|
5 |
6,400 |
10,515 |
8,423 |
121 |
1,230 |
38 |
|
6 |
10,515 |
14,000 |
12,222 |
109 |
555 |
40 |
|
7 |
14,000 |
18,002 |
15,756 |
171 |
500 |
41 |
|
8 |
18,002 |
23,554 |
20,201 |
208 |
104 |
43 |
|
9 |
23,554 |
32,010 |
26,894 |
482 |
100 |
43 |
|
10 |
32,010 |
144,194 |
43,405 |
2,412 |
24 |
44 |
Table C.
Household Statistics for Single Households
|
Decile |
Income |
Transfers |
Labor |
|||
|
Gross |
Earned |
Unearned |
||||
|
Lowest |
Highest |
|||||
|
1 |
0 |
1,000 |
119 |
77 |
1151 |
6 |
|
2 |
1,000 |
3,400 |
1,725 |
461 |
307 |
24 |
|
3 |
3,400 |
6,406 |
3,986 |
894 |
136 |
29 |
|
4 |
6,406 |
10,000 |
7,071 |
1,174 |
98 |
34 |
|
5 |
10,000 |
14,000 |
10,710 |
1,210 |
43 |
38 |
|
6 |
14,000 |
18,052 |
14,915 |
1,052 |
37 |
40 |
|
7 |
18,052 |
23,000 |
18,922 |
1,456 |
28 |
42 |
|
8 |
23,000 |
29,008 |
23,985 |
1,632 |
23 |
42 |
|
9 |
29,008 |
39,000 |
31,395 |
1,996 |
13 |
43 |
|
10 |
39,000 |
310,044 |
50,839 |
8,418 |
4 |
46 |
Table D.
Household Statistics for Married Households
|
Decile |
Income |
Transfers |
Labor |
|||
|
Gross |
Earned |
Unearned |
||||
|
Lowest |
Highest |
|||||
|
1 |
0 |
14,400 |
7,706 |
263 |
1,547 |
32 |
|
2 |
14,400 |
22,000 |
18,063 |
358 |
275 |
41 |
|
3 |
22,000 |
28,500 |
24,823 |
555 |
150 |
43 |
|
4 |
28,500 |
34,000 |
30,577 |
680 |
39 |
43 |
|
5 |
34,000 |
39,756 |
35,928 |
860 |
28 |
44 |
|
6 |
39,756 |
45,701 |
41,529 |
994 |
27 |
44 |
|
7 |
45,701 |
52,700 |
47,964 |
1,199 |
11 |
44 |
|
8 |
52,700 |
63,000 |
55,657 |
1,965 |
3 |
45 |
|
9 |
63,000 |
80,900 |
67,863 |
3,052 |
6 |
45 |
|
10 |
80,900 |
320,633 |
98,698 |
10,911 |
6 |
47 |
Appendix B
Table B1.
Fraction of Average Annual Expenditures Spent on Various
Consumption Goods of Pre-tax Income, by Quintiles
|
Item |
All Households |
Lowest20 % |
Second20 % |
Third 20 % |
Fourth20 % |
Highest20 % |
|
Housing |
30.6 |
36.4 |
32.5 |
30.5 |
29.4 |
29.3 |
|
Food |
14.3 |
17.4 |
17.4 |
15.1 |
14.4 |
12.2 |
|
Health care |
5.1 |
7.7 |
7.2 |
6.0 |
4.6 |
3.7 |
|
Entertainment |
5.0 |
4.4 |
3.9 |
4.6 |
5.2 |
5.6 |
|
Gas |
3.3 |
3.5 |
3.8 |
3.9 |
3.5 |
2.6 |
|
Personal care |
1.4 |
1.5 |
1.6 |
1.4 |
1.4 |
1.3 |
|
Alcohol |
1.0 |
0.9 |
1.0 |
1.2 |
1.0 |
0.9 |
|
Tobacco |
0.9 |
1.3 |
1.4 |
1.2 |
0.9 |
0.5 |
Source: U.S. Department of Labor, Bureau of Labor Statistics [1992).
Sensitivity analysis
1. I conduct sensitivity analyses by re-estimating the model using different gross values for the recipient. The different values are chosen by reducing, one at a time, the various component transfers by 50 percent, an arbitrary though rather extreme value.
I begin with public housing. When the gross value of public housing is reduced by 50 percent, the marginal transfer effect declines are the largest for the first decile of heads of households, 21 percentage points (3 percent) on unearned income and 9 percentage points (6 percent) on earned income. A 50 percent reduction in rent subsidy leads to a decline of at most 13 percentage points (2 percent) on unearned income and at most 4 percentage points (3 percent) on earned income. A 50 percent reduction in Energy Assistance lowers the marginal transfer effect by at most 3 percentage points (0.5 percent) on unearned income and 0.2 percentage points (0.1 percent) on earned income.
I conduct sensitivity analyses on Medicaid transfers using two different values. First, a 50 percent reduction leads to declines in marginal transfer effects of at most 21 percentage points (3 percent) on unearned income and 2 percentage points (1.6 percent) on earned income for the same first decile of heads of household. Second, the effect of using the higher mean Medicaid outlay per beneficiary ($1,242), i.e., under the hypothesis that all recipients value Medicaid benefits equally, the marginal transfer effects increased most for the first decile of heads of household, 48 percentage points (7 percent) on unearned income, and most for first decile of single households, 32 percentage points (22 percent) on earned income.
Finally, I examine the effect of reducing the value of public housing, rent subsidy, Energy Assistance, and Medicaid together. This reduces the marginal transfer effects for the first decile of heads of household by 62 percentage points (9 percent) on unearned income and by 7 percentage points (5 percent) on earned income. This suggests that these transfer programs play a less important role in the determination of the marginal transfer effects than, for example, AFDC and Food Stamps. My assumptions about the value of public housing, rent subsidies, Energy Assistance, and Medicaid transfers, thus, have little bearing on the results.
2. Re-estimating the model with quadratic earned and unearned income terms does not change the results significantly. There is no statistical difference between this regression and the base regression model. For unearned income, however, the fit may be slightly better with the inclusion of the quadratic term for decile one. Including interaction terms for state of residence and income does not improve the model. A simple F-test suggests that the interaction terms do not belong in the model.
3. Because I estimate state personal income taxes based only on the four most populous states in the sample, I may under- or over-estimate the tax obligations for individuals in other States. Excluding state personal income taxes affects neither marginal transfer effects nor METRs for the first deciles of all filing statutes and decreases METRs by at most 7 percentage points (15 percent) for the highest deciles. My results, thus, are unaffected at the lower end of the income distribution by assumptions about state taxes, and moderately sensitive at the higher end.
4. Social security payroll taxes can alternatively be treated as a form of forced saving for retirement. Excluding social security taxes altogether causes the marginal tax rate to decline by 6 percentage points (17.2 percent) for middle-income households, but by a lot less at the lower and higher ends of the income distribution.
5. Indirect tax rates affect METRs on earned income in two ways. A decline in the indirect tax rate first lowers the marginal tax rate on earned income directly, and then increases the net value of transfers and hence the marginal transfer effect. METRs on unearned income are unaffected by indirect taxes.
If ti is zero the marginal tax rate declines by nearly 3 percentage points for the first two deciles of heads of household (21.3 percent), and the marginal transfer effect increases by 9 percentage points (7 percent) on earned income for the lowest decile of heads of household. Taken together, this means that a change in the indirect tax rate can lower or increase the overall METRs. For the first deciles of heads of household the marginal transfer effect dominates and overall METRs on earned income increase by at most 3 percentage points (2 percent). For remaining deciles and filing statuses, overall METRs are reduced by at most 5 percentage points (about 12 percent).
6. I censor negative capital and self-employment incomes at zero. If negative capital and self-employment incomes are included, the results change notably at the lower end of the income distribution. For instance, the first decile of heads of household is no longer the decile receiving most transfers.
Chapter 3
The Optimal Benefit-Reduction Rate
3.1 Introduction
High welfare costs have made welfare reform a hot topic in many developed countries in recent years. In the U.S., for instance, welfare reform became a top priority on the political agenda in the 1990s (Keane[1995]), and it remains a major campaign issue in the 2000 presidential election. The wide range of countries that have recently discussed the need of welfare reforms includes countries from Northern Europe, for instance Sweden (Lindbeck et al. [1994], Socialdepartementet [1999]) and Denmark (Torfing [1999]); Central and Southern Europe, for instance the U.K. (Jessop [1995]) and Italy (Morley-Fletcher [1998]); and even the former Eastern Block (Kornai [1999]).
One of the key problems with many welfare systems is that welfare recipients face high "effective" marginal tax rates, thus making their economic returns to working very low. Even though the statutory tax rates paid in this income range are often low, means-testing of benefits can boost the marginal effective tax rate to 100 percent or more. In Sweden, for example, a welfare recipient who earns income loses welfare benefits on a one-for-one basis. The result of these high marginal effective tax rates is that many safety nets, rather than providing temporary relief, effectively trap participants into welfare dependency.
A common thread of recent welfare programs is the goal of increasing the economic incentives for recipients to join the labor force. The earned-income tax credit in the U.S., for instance, is a negative income tax that directly lowers the marginal effective tax rates faced by the working poor. Nevertheless, for a variety of reasons, the results of welfare reform have generally been mixed (see e.g., Haveman and Scholz [1994], Gueron [1996], Blank [1997], Torfing [1999]).
One potential drawback of lowering the marginal effective tax rates faced by welfare recipients is that doing so could generally be costly for the government. In order to reduce the marginal effective tax rates, either or both of the components that make up the marginal effective tax rates – that is, the marginal tax rate and the benefit-reduction rate – must be reduced. Reducing the marginal tax rate – defined as the increase in tax payments resulting from a unit rise in gross earned income – can be costly because it reduces the taxes paid by all income earners, not just welfare recipients. Similarly, reducing the benefit-reduction rate – defined as the reduction in transfers received resulting from a unit rise in gross earned income – is also costly. Making benefits independent of income, for example, would dramatically increase government outlays.
Decreases in the marginal effective tax rate paid by welfare recipients must be offset by increases in the marginal tax rate paid by higher-income individuals if the budget is to balance. From a general equilibrium perspective, it is not obvious that the distortions of an increase in tax rates faced by higher-income individuals are smaller than the distortions high marginal effective tax rates cause faced by welfare recipients. Indeed, little is known empirically about how they actually compare.
In this paper, we develop a model for deriving the optimal benefit-reduction rate based on Sheshinski’s [1972] model of optimal income taxation. Unfortunately, our framework does not permit an analytic solution without assumptions about functional form. Rather than making these assumptions, however, we derive mathematical bounds around the optimal benefit-reduction rate based on a number of labor supply elasticities.
We find that the optimal benefit-reduction rate depends negatively on the optimal marginal tax rate on labor income, and either negatively on the average labor supply elasticity with respect to changes in net income of welfare recipients or positively on welfare non-recipients’ labor supply with respect to changes in the benefit-reduction rate. The optimal benefit-reduction rate can even be negative if welfare recipients are very elastic in their labor supply and the optimal marginal tax rate on labor income is high. A negative optimal benefit-reduction rate implies that welfare recipients should be given a subsidy for working and an example is the Earned Income Tax Credit in the U.S.
By inserting estimates from the literature of labor supply elasticities into the analytical results, we compute numerical values of the upper bounds of the optimal benefit-reduction rate. These values can then be compared to actual rates. We find that actual rates in some countries are in excess of bounds derived in the optimal income taxation model.
3.2 General Approach
The theory of optimal income taxation, first developed by Mirrlees [1971], provides tools that are well-suited to exploring the welfare dependency problem. The essential elements in Mirrlees’ model are: the social welfare function, an ability distribution, households’ utility functions (labor supply elasticities), and a fiscal side where the government levies taxes to fund government expenditures. In particular, the social welfare function is a maximand and can be used as a yardstick for comparing tax-related distortions for one group against distortions for other groups. The ability distribution allows individuals to differ in their ability to earn wages. Individual utility functions capture individual preferences, for instance labor supply elasticities, which are key inputs in measuring the magnitude of the distortions that marginal tax rates create. Finally, the existence of a fiscal side allows for redistribution. The government chooses a tax function and thereby the level of redistribution.
Without assuming a specific utility function or ability distribution, Mirrlees was limited to a few analytical conclusions. Subsequent work showed that the assumptions made about the functional form are critical (e.g., Atkinson [1973], Stern [1976], and Tuomala [1984])).
Sheshinski [1972] placed linearity restrictions on Mirrlees’ model, which enabled him to draw more analytical conclusions. He found that when labor supply is a non-decreasing function of net wage, the optimal linear tax is characterized by a negative lump-sum tax (a subsidy) at zero income. Moreover, the marginal tax rate is less than one and bounded from above by a function of labor supply elasticities that decreases with the minimum elasticity of the labor supply.
We extend Sheshinski’s [1972] work by introducing the benefit-reduction rate to the model. That is, we generalize Sheshinski’s model from a linear tax to a tax schedule that is kinked with two linear segments. One segment is for welfare recipients who face both marginal tax and marginal benefit-reduction rates. The other segment is for welfare non-recipients who face only a marginal tax rate.
3.3 The Model
The optimal marginal tax rate on labor income and the optimal benefit-reduction rate are chosen so that social welfare is maximized subject to two constraints. The first constraint guarantees that individuals maximize their utility by choosing the optimal bundle of consumption and leisure given the tax rates. The second constraint ensures that the tax rates lead to an outcome that meets the government’s revenue requirements. The government sets the tax rates – the marginal tax rate on labor income and the benefit-reduction rate – thereby fixing the guarantee level. Individuals, in turn, take marginal tax rates as given and maximize utility by choosing optimal amounts of leisure and consumption.
3.3.1 Individual Utility Maximization
Individuals are assumed to have identical preferences that depend on consumption, c, and leisure, l,
(1)
where u is a continuously differentiable and strictly concave function with positive marginal utility of consumption and leisure. Leisure is defined as l = d -L, where d is hours available and L is hours worked. We assume that individuals differ in their ability to supply labor, their marginal product of labor, and hence have different wage rates, n. Gross earning, nL, thus, depends on ability.
Whether individuals receive welfare or not depends on their earned income. Individuals with zero earned income receive a guarantee level. This guarantee level is then diminished at the benefit-reduction rate, assumed to be linear, when individuals’ income increases. The income level at which individuals cease to receive welfare payments is referred to as the cut-off income, B, and is assumed to be set by the government. In addition, individuals who earn income pay income taxes. The income tax rate is assumed to be linear. We refer to individuals with income below the cut-off income as welfare recipients, denoted by the superscript wr, and individuals with income equal to or greater than this level as welfare non-recipients, denoted by wnr.
Disposable income for welfare recipients, wr, and welfare non-recipients, wnr, is given by
(2)
and
(3)
respectively. c is disposable income, T0 is the guarantee level given to individuals with zero earned income, ti is the marginal tax rate on labor income, tb is the benefit-reduction rate, n is wage rate, and L is labor supplied. Figure 1 shows how disposable income depends on gross earned income.
Figure 1.
Disposable Income as a function of Gross Earned Income
![]()
![]()

Individuals choose amounts of consumption and leisure to maximize their utility subject to their budget constraint. The first-order conditions for welfare recipients and welfare non-recipients are, respectively,
when nL < B, and (4)
![]()
where b = (1 - ti – tb) and a = (1 - ti) are the net returns to a unit increase in earned income. Solving the first-order conditions, the optimal amounts of leisure and consumption are
(5)
for welfare recipients and
(6)
for welfare non-recipients.
3.3.2 Government Utility Maximization
Wages in the population are distributed according to the density function f(n). The corresponding cumulative distribution function is F(n), where dF(n)/dn = f(n). The wage rate distribution is assumed to be continuous, where f(n) > 0 for 0 £ n < ¥ .
The social welfare function, W, integrates the utilities across individuals.
(7)
The maximization problem consists of finding the constants T0, b , and a that maximize W subject to satisfying the government budget constraint. The guarantee level, T0, is treated as a negative lump-sum tax. Net tax payments, TR, for welfare recipients and welfare non-recipients are then
and
(8)
![]()
respectively.
Total net tax receipts are calculated as the integral of individual tax payments across all of the individuals. The government budget constraint is thus
(9)
where b is the ability level associated with the cut-off income level B. Rewriting the equality in equation (9) yields
(10)
where

To solve the government’s optimization problem, we set up the Lagrangian
(11)
where l is the shadow price of the government’s budget constraint. The first-order conditions of £ with respect to T0, b , and a are

(12)
(13)
and
(14)
To obtain explicit solutions of the optimal benefit-reduction rate and the optimal marginal tax rate on labor income, specific functional forms of individual and social utility functions as well as assumptions about the wage distribution are required. It is possible to say something about the bounds of the optimal benefit-reduction rate and the optimal marginal tax rate without assuming specific functional forms, however, which is the topic of section 4.
3.4 Analytical Interpretation of the Tax Rates
We can interpret the tax rates analytically if we are willing to make assumptions about how the amounts of labor supplied by welfare and welfare non-recipients are affected by changes in T0, b , and a . This can be done by differentiating the first-order conditions from the individuals’ utility maximization problem with respect to T0, b , and a . (The calculations are presented in the appendix).
The optimal income taxation literature has demonstrated that the optimal tax rate should be less than unity (Mirrlees [1971]), otherwise L = 0 and c = 0. In our framework this corresponds to (ti + tb) < 1, which implies b > 0, for welfare recipients and a > 0 for welfare non-recipients.
If we substitute (1 - ti – tb) for b and (1 - ti) for a , and rewrite equations (13) and (14) using labor supply elasticities with respect to net wage, equations (12) to (14) become
(12')
(13')
and
![]()
(14')
where
![]()
![]()
and
![]()
in accordance with equation (A6) to (A8) and because a and b are greater than zero.
The first term in equations (12’) to (14’), the integral of the marginal utility of consumption, is positive because the marginal utility of consumption should always be positive, as are n and L(n).
It is relatively easy to show that l must be positive, that is, the marginal utility of relaxing the government budget is positive. First, it is clear that l cannot be zero. Similarly, a negative value of l can be ruled out because it cannot simultaneously satisfy all the equations. For instance, from equation (12’) a negative l requires that (ti + tb) < 0. However, equations (A7) and (13’) cannot hold when both l and (ti + tb) are negative.
It is clear from equation (13’) that at least one of the last two terms has to be negative for the whole expression to equal zero. If the absolute value of the third term is greater than the absolute value of the last term, that is
(15)
then the third term must be negative. There are several instances under which (15) may hold. Inequality of (15) may hold, for example, if there are many individuals with an income substantially larger than the cut-off income, that is B/nL is small. It may also hold if e bwnr and e bwr are very small, or if tb is negative and e bwr is large. If equation (15) is satisfied, equation (13’) implies that
(16)
where
is the weighted conditional average labor supply elasticity for welfare recipients,
(17)
The optimal benefit-reduction rate, tb, is then bounded from above by the average labor supply elasticity of welfare recipients with respect to net income and the optimal marginal tax rate on labor income. That is, the optimal benefit-reduction rate depends negatively on both the labor supply elasticity of welfare recipients and the optimal marginal tax rate on labor income. Thus, the more elastic the labor supply elasticity of welfare recipients the lower is the upper bound of the optimal benefit-reduction rate, and the higher the optimal marginal tax rate on labor income the lower is the optimal benefit-reduction rate.
If (15) is not satisfied, the optimal benefit-reduction rate is bounded by
(18)
where
(19)
This allows the benefit-reduction rate to be larger since
is negative. The optimal benefit-reduction rate can be negative if equation (15) is satisfied and
. This may happen if welfare recipients are elastic in their labor supply and the optimal marginal tax rate on labor income is high. A negative benefit-reduction rate implies that welfare recipients receive a subsidy when working.
From equation (14'), it follows
(20)
where
is the weighted conditional average labor supply elasticity for welfare non-recipients,
(21)
if
(22)
This condition holds when tb > 0 and
and
If tb < 0 and
is not sufficiently larger than
so that the above condition is validated, then,
(23)
where
(24)
The optimal marginal tax rate on labor income, ti, depends on the sign of the optimal benefit-reduction rate and on the magnitude of the labor supply elasticities with respect to net labor income for welfare and welfare non-recipients, respectively.
If tb is positive and welfare recipients are more elastic in their labor supply than welfare non-recipients, then the optimal marginal tax rate on labor income is bounded from above by the average labor supply elasticity of welfare non-recipients. The more elastic the labor supply of welfare non-recipients, the lower is the optimal marginal tax rate on labor income.
If tb is negative and
, however, the upper bound of the optimal marginal tax rate is higher. When the optimal benefit-reduction rate is negative, the upper bounds of the optimal marginal tax rate on labor income depend on the elasticities of welfare recipients as well as the magnitude of the optimal benefit-reduction rate. The bounds of the optimal marginal tax rate on labor income are allowed to be higher when the optimal benefit-reduction rate is negative and/or welfare non-recipients are more elastic than welfare recipients. When the optimal benefit-reduction rate is negative, more tax revenues are required to finance welfare recipients' subsidy for working. The optimal marginal tax rate on labor income must be higher, thus, for the budget to balance.
The optimal guarantee level is positive when the optimal benefit-reduction rate is positive or as long as
. The optimal guarantee level depends on the optimal levels of the tax rates and the labor supply elasticities. The guarantee level depends positively on the optimal benefit-reduction rate as long as the labor supply elasticity of welfare recipients with respect to a change in the benefit-reduction rate is low enough that an increase in the benefit-reduction rate is not completely offset by a decrease in the amount of labor supplied by welfare recipients. The guarantee level depends positively on the optimal marginal tax rate on labor income if welfare non-recipients' labor supply elasticity is low enough that an increase in the tax rate is not offset by a decrease in labor supplied. In addition, the optimal guarantee level depends on the cut-off income. An increase in the cut-off income will affect the amount of labor supplied by welfare non-recipients, so the total effect depends on how sensitive welfare non-recipients' labor supply is to changes in the cut-off income. If welfare non-recipients are relatively insensitive, an increase in the cut-off income when tb is positive will increase the guarantee level, while an increase in the cut-off income when tb is negative will lower the guarantee level.
3.5 Discussion
To put the analytical results in perspective, we insert a range of estimates of labor supply elasticities into the analytical results and compute numerical values of the upper bounds of the optimal marginal tax rate on labor income and the optimal benefit-reduction rate. We then insert country-specific estimates of the labor supply elasticities to get an indication of how close actual rates are to rates predicted in the optimal income taxation theory.
3.5.1 General Discussion
The various labor supply elasticities are the key factor in determining the upper bounds of the optimal benefit-reduction rate and the optimal marginal tax rate on labor income. Labor supply elasticities vary across both countries and individuals. Typically, female labor supply is more elastic than male labor supply. In addition, welfare recipient and welfare non-recipient labor supply elasticities likely differ. Whether the elasticities of welfare recipients are more or less elastic than those of welfare non-recipients is an empirical question that depends on numerous circumstances.
Many studies have estimated labor supply elasticities, but no real consensus on their magnitude has emerged (Killingsworth [1983]). Typically, labor supply elasticities have been estimated for the entire labor force, or for male and female labor forces separately, but seldom for welfare and welfare non-recipients separately or for different income levels. In addition, elasticities typically measure how labor supplied is affected by changes in net wages; few studies have examined the effect of changes in alternative sources of incomes. It is very hard to find elasticities that measure, for example, how sensitive welfare recipient and welfare non-recipient labor supply decisions are to changes in benefit levels.
An estimate based on uncompensated labor supply elasticity for males and females, estimated by taking the median elasticities across a large number of studies from the 1970s and early 1980s and weighing together median male and female elasticities using relative male and female income shares as weights, is 0.10 (Hansson and Stuart [1985]). An estimate derived in the same way but based on relatively more recent and sophisticated estimates is 0.44 (Hansson and Stuart [1993]). If
, ti should be no greater than 0.69 given that condition (22) holds. If ti is equal to 0.69, then tb can not exceed 0.31.
Other studies of labor supply elasticities have found different results, however. In fact, Theeuwes [1988], in a survey of eight empirical studies, found that uncompensated elasticities in the Netherlands range from -0.25 to 0.27 for males (mean =0.07) and from 0.20 to 3.23 for females (mean =1.39). Blomquist and Hansson-Brusewitz [1990] found uncompensated elasticities for males in Sweden between 0.08 and 0.13, and elasticities for females between 0.1 and 0.79.
Since labor supply elasticities vary across both countries and individuals, as well as across different studies, we plot the upper bounds of the optimal benefit-reduction rate and marginal tax rate on labor income as functions of labor supply elasticities.
Figure A plots the upper bounds of the optimal benefit-reduction rate for welfare recipients as a function of labor supply elasticities of welfare recipients at
various optimal marginal tax rates on labor income when equation (15) is satisfied. The higher the optimal marginal tax rate on labor income, the lower is the upper
Figure A.
The Upper Bounds of the Optimal Benefit-Reduction Rate as a function of Labor Supply Elasticities for various Optimal Marginal Income Tax Rates on Labor Income when condition (15) holds

bound of the optimal benefit-reduction rate, and the more responsive
welfare recipients are to changes in net income, the lower is the upper bound of the optimal benefit-reduction rate. If, for example, the optimal marginal tax rate on labor income is 0.3 and the labor supply elasticity of welfare recipients is 0.4, then the upper bound of the optimal benefit-reduction rate is 0.41. If the labor supply elasticity of welfare recipients is 0.8 instead and the optimal marginal tax rate on labor income is 0.3, as above, then the upper bound of the optimal benefit-reduction rate is lowered to 0.26. The magnitude of the labor supply elasticity is, thus, crucial for the upper bound of the optimal benefit-reduction rate.From Figure A, it is apparent that the optimal benefit-reduction rate can be negative. At what point the upper bound of the optimal benefit-reduction rate is negative depends on the labor supply elasticity as well as the optimal marginal tax rate on labor income.
Figure B.
The Upper Bounds of the Optimal Benefit-Reduction Rate as a function of Labor Supply Elasticities for various Optimal Marginal Income Tax Rates on Labor Income when condition (15) does not hold

In Figure B, the upper bounds of the optimal benefit-reduction rate are plotted when equation (15) does not hold. Under this scenario, the upper bound of the optimal benefit-reduction rate depends on the labor supply elasticity with respect to changes in the benefit-reduction rate of welfare non-recipients as well as the optimal marginal tax rate on labor income. The more sensitive welfare non-recipients are in their labor supply to changes in the benefit-reduction rate, the higher is the upper bound of the optimal benefit-reduction rate, and the higher the optimal marginal tax rate on labor income, the lower is the upper bound of the optimal benefit-reduction rate. The reason that the upper bounds of the optimal benefit-reduction rate depend positively on the magnitude of the labor supply elasticity with respect to a change in the benefit-reduction rate is that the amount of labor a welfare non-recipient supplies depends positively on the benefit-reduction rate.
Figure C.
The Upper Bound of the Optimal Marginal Tax Rate on Labor Income as a function of Labor Supply Elasticities when condition (22) holds

Figure C plots the upper bounds of the optimal marginal tax rate on labor income as a function of labor supply elasticities of welfare non-recipients when equation (22) holds. The upper bound of the optimal marginal tax rate on labor income depends solely on welfare non-recipients' labor supply elasticity. The higher the elasticity, the lower is the optimal marginal tax rate on labor income. For example, if the labor supply elasticity of welfare non-recipients is 0.4, then the upper bound of the optimal marginal tax rate on labor income is 0.71.
Figure D.
The Upper Bounds of the Optimal Marginal Tax Rate on Labor Income as a function of Labor Supply Elasticities when condition (22) does not hold

Finally, the upper bounds of the optimal marginal tax rate on labor income are plotted in Figure D for the case when equation (22) does not hold. Note that the
upper bounds of the optimal marginal tax rate depend on labor supply elasticities with respect to changes in labor income of welfare recipients. The upper bounds are higher when the optimal benefit-reduction rate is negative. For example, if we assume that the labor supply elasticities of welfare recipients and welfare non-recipients with respect to net labor income are the same, 0.4, then the upper bound of the marginal tax rate on labor income when tb is positive is 0.71. The upper bound is 0.72 when tb = -0.01 and larger when tb is more negative.
3.5.2 Application
At this point, it is useful to ask how close actual prevailing tax rates are to rates supported by the optimal income taxation theory? We know that marginal effective tax rates (the sum of the marginal tax rate on labor income and the benefit-reduction rate) sometimes exceed 100 percent (e.g., Keane [1995], and Hansson [1998]. Actual benefit-reduction rates are hard to measure; though studies generally conclude that they are high (Fraker et al.[1985], Dickert et al.[1995], Hansson [1998]). Actual marginal tax rates on labor income are easier to estimate. Table 1 reports average marginal tax rates on labor income for the "average production worker'' in 22 OECD
countries over the period 1972-1992. These marginal tax rates are inclusive in the sense that they measure how much consumption a taxpayer obtains from a unit increase in labor income.Marginal tax rates vary widely between countries. If actual marginal tax rates reflect optimal conditions, the large variation in marginal tax rates on labor income
across countries may suggest a wide range in labor supply elasticities across the various countries. For instance, it may be the case that taxpayers in Sweden, a high tax country, are less elastic in their labor supply than taxpayers in the U.S., a lower tax country.
Since the optimal benefit-reduction rate also depends on the optimal marginal tax rate on labor income, restrictions on the bounds of the optimal benefit-reduction rate can be obtained if the optimal marginal tax rates on labor income are known. In Sweden, the actual marginal tax rate on labor income for the average production
Table 1.
Inclusive Marginal Tax Rates on Labor Income

worker over 1972-1992 was 0.73 (Hansson and Stuart [1998]). (Again, remember that actual tax rates may be far from optimal tax rates). An optimal marginal tax rate of 0.73 corresponds relatively closely to the lowest of the bounds in Figure A and Figure B, which implies that the upper bound of the optimal benefit-reduction rate is fairly low regardless of welfare recipients' and welfare non-recipients' labor supply elasticities with respect to changes in the benefit-reduction rate. Clearly, the actual benefit-reduction rate of 100 percent in Sweden (Socialdepartementet [1999]) is not supported in this optimal income-tax model.
The U.S., on the other hand, is a low-tax country where the average production worker faces a marginal tax rate of 0.37 (Hansson and Stuart [1998]). An optimal marginal tax rate of 0.37 implies that the upper bound of the benefit-reduction rate is higher than in Sweden, ceteris paribus.
The average production worker in the U.K. faces a marginal tax rate of 0.52 (Hansson and Stuart [1998]), a tax rate that lies between what an average production worker faces in Sweden and the U.S. An optimal marginal tax rate of 0.52 corresponds to the second lowest bound in Figure A and Figure B. Thus, the upper bound of the optimal benefit-reduction rate would lie between the optimal benefit-reduction rates in Sweden and the U.S. if the labor supply elasticity for welfare and welfare non-recipients were the same in the three countries.
3.5.3 Limitations
Several limitations to the approach used in this paper should be acknowledged. First, a linear income tax does not allow for different marginal income tax rates. In many countries, actual income tax schedules are characterized by increasing marginal tax rates. Often, there are two or more income brackets. Similarly, benefit-reduction rates may also be non-linear. The benefit-reduction rate likely depends on income, and allowing for non-linearity in the benefit-reduction rate may allow us to develop a more realistic structure.
A further limitation of this paper is that we do not account for the dependence of the benefit-reduction rate on wealth and unearned income. In reality, the benefit-reduction rate and the amount of welfare payments an individual receives is influenced by wealth and unearned income as well as earned
income. The actual number of individuals that receive welfare payments yet possess wealth and receive unearned income is probably small, so this shortcoming may not be serious.
3.6 Conclusions
We derive upper bounds of the optimal benefit-reduction rate using a modified version of Sheshinski's [1972] model of optimal income taxation. Specifically, we add a term for the rate at which welfare recipients lose welfare when their gross earned income increases by a unit. We find that the optimal income-tax problem is characterized by a guarantee level at zero income that is reduced at the benefit-reduction rate when welfare recipients earn income. The upper bound of the benefit-reduction rate depends negatively on the magnitude of the optimal marginal tax rate on labor income, and either negatively on the average labor supply elasticity with respect to changes in net income of welfare recipients or positively on the average labor supply elasticity with respect to changes in the benefit-reduction rate of welfare non-recipients. The optimal benefit-reduction rate can be negative, in which case welfare recipients should be given a subsidy for working. The optimal marginal tax rate on labor income is positive and depends on the sign of the optimal benefit-reduction rate and on the average elasticities of labor supply with respect to changes in labor income of welfare and welfare non-recipients.
Our results suggest that rates in some countries exceed the bounds derived in the optimal income taxation theory. With a benefit-reduction rate of 100 percent and an inclusive marginal tax rate on labor income around 70 percent welfare recipients in Sweden lack economic incentives to break their welfare dependency. These rates are clearly in excess of optimal bounds.
We consider estimation of bounds on the optimal benefit-reduction rate to be a positive step at a critical moment in policy formation globally. At the same time, we hope it will spawn future modeling efforts to improve correspondence with reality. For example, the model used in this paper is admittedly crude and would likely benefit from relaxing the linearity restriction and by including capital.
References
Atkinson, A.B., 1973, "Maxmin and the Optimal Income Taxation," Discussion Paper no. 47, Department of Economics, University of Essex.
Blank, R., 1997, "Policy Watch: The Welfare Reform," Journal of Economic Perspective, 11(1), 169-177.
Blomquist, S. and U. Hansson-Brusewitz, 1990, "The Effect of Taxes on Male and Female Labor Supply in Sweden," Journal of Human Resources, 25, 317-57.
Dickert, S., Houser, S. and K. Scholz, 1995, "The Earned Income Tax Credit and Transfer Programs: A Study of Labor Market and Program Participation," Tax Policy and the Economy, NBER, 19, 1-50.
Edgeworth, F.,1897, "The Pure Theory of Taxation," Economic Journal, 7, 46-70, and 226-238.
Fraker, T., Moffitt, R and D. Wolf, 1985, "Effective Tax Rates and Guarantees in the AFDC Programs, 1967-1982," Journal of Human Resources, 20(2), 251-263.
Greenberg, D., Michalopoulos, C., Robins, P., and R. Wood,1995, "Making Work Pay for Welfare Recipients," Contemporary Economic Policy, 13(3), 39-52.
Gueron, J., 1996, "A Research Context for Welfare Reform," Journal of Policy Analysis and Management, 15(4), 547-561.
Hansson, I. and C. Stuart, 1985, "Tax Revenue and the Marginal Cost of Public Funds in Sweden," Journal of Public Economics, 27, 331-53.
—, 1993, "The Effects of Taxes on Aggregate Labor: A Cross Country Genreral Equilibrium Study," Scandinavian Journal of Economics, 95, 311-326.
Hansson, AA. and C. Stuart, 1998, "Measurement of Transfers and Peaking of Fiscal Sizes of Government," Manuscript, University of California, Santa Barbara.
Hansson, AA., 1998, "How High are Marginal Effective Tax Rates in the U.S.?," Manuscript, Lund University.
Haveman, R. and K. Scholz, 1994, "Transfers, Taxes, and Welfare Reform," National Tax Journal, 47(2), 417-434.
Kanbur, R., Keen, M., and M. Tuomala, 1994, "Optimal Non-Linear Income Taxation for the Alleviation of Income-Poverty," European Economic Review, 38(8), 1613-1632.
Keane, M.,1995, "A New Idea for Welfare Reform," Federal Reserve Bank of Minneapolis Quarterly Review, 19(2), 2-28.
Killingsworth, M., 1983, Labor Supply, Cambridge: Cambridge University Press.
Lurie, I., 1996, "A Lesson from the JOBS Program: Reforming Welfare Must be Both Dazzling and Dull," Journal of Policy and Management, 15(4), 572-586.
Mirrlees, J.,1971, "An Exploration in the Theory of Optimal Income Taxation," Review of Economic Studies, 38(1), 175-208, 1971.
Moffitt, R., 1991, "Incentives Effects of the US Welfare System: A Review," Journal of Economic Literature, 30(1), 1-61.
Pigou, C.A., 1947, "A Study in Public Finance," MacMillan, 3rd edition, London.
Sheshiniski, E., 1972, "The Optimal Linear Income-Tax," Review of Economic Studies, 39(3), 297-302.
Socialdepartementet, 1999, "Samverkan f"or f"{a}rre bidrag och fler i jobb," Ds1999:54.
Stern, N. H.,1976, "On the Specification of Models of Optimal Income Taxation," Public Economics, 6, 123-162.
Theeuwes, J., 1988, "Arbeid en Belastingen," Belastingheffing en belastinghervorming, Leiden: Stenfert Kroese, 111-43.
Tuomala, M., 1984, "On the Optimal Income Taxation: Some Further Numerical Results," Journal of Public Economy, 23, 351-366.
—, 1987, "Studies in the Theory of Optimal Income Taxation, Helsinki
Appendix
To obtain the derivatives of the supply function with respect to T0, a , and b as an expression of derivatives of the utility function, we differentiate the first-order conditions. Namely,
. (A1)
Leisure is assumed to be a normal good; that is,
(A2)
The higher the guarantee level an individual receives, the more leisure the individual will consume. The assumption that leisure is a normal good requires that
(A3)
From this it follows that the labor supplied depends negatively on the guarantee level, that is
(A4)
We also assume that leisure (labor supply) is a decreasing (increasing) function of the net wage rate; that is
(A5)
or
(A6)
Accordingly,
when nL < B, (A7)
and
when nL ³
B. (A8)
Intuitively, when welfare recipients are allowed to keep more of their earned income, welfare non-recipients have fewer incentives to work.
Chapter 4
Government Size and Growth: An Empirical Study of 21 OECD Countries
4.1 Introduction
Government grew dramatically in size during the 20th century in nearly all developed countries. In Sweden, the U.K., and the U.S., for example, general government expenditures grew from 10, 13, and 8 percent of GDP, respectively, in 1913 to 66, 43, and 33 percent in 1995 (Tanzi and Schuknecht [1997(a)]). This growth in government expenditures was nearly uninterrupted for a century, with the exception of the post- World War II period in Germany and Japan and a period in the 1980s in several OECD countries. The growth in government size was remarkably rapid during the 1960s, 1970s, and 1980s.
As the large share of developed economies currently commanded by the public sector may have important consequences for economic performance, this buildup in government activity has alarmed many observers (Lindbeck [1986], Tanzi and Schuknecht [1997(b)]). For instance, a large government sector may induce inefficiencies in the economic system due to various disincentive effects, crowd out the private sector, and impoverish the economy in the long run.
Alternatively, a large government sector could be a catalyst for economic prosperity, particularly if most activity occurs in areas thought especially likely to promote economic activity. For instance, the provision of a legal framework and well-defined property rights is critical to economic development, as are law and order and an efficient infrastructure (North [1990], Hall and Jones [1997], Tornell and Lane [1999]).
Not surprisingly, the relationship between large government and economic performance (commonly measured as the growth rate) has been widely studied (Levine and Renelt [1992], Durlauf and Quah [1998], Temple [1999]). Despite widespread interest, however, researchers have to date been generally unsuccessful in attempts to settle the issue (Slemrod [1995], Temple [1999]). Some studies have found evidence of a negative relationship between the size of government and the growth rate (e.g., Landau [1983], Barro [1989], Barro[1991]) and others evidence of a positive relationship (e.g., Ram [1986] Devarajan et al. [1996]), though many studies have been unable to demonstrate a statistically significant correlation between the two variables (e.g., Kormendi and Meguire [1985], Levine and Renelt [1992], Easterly and Rebelo[1993]).
Differences in the data and statistical approaches used can explain much of the heterogeneity of results in this branch of the economic growth literature. For instance, even the studied countries vary widely among previous studies, ranging from just developed nations (e.g., Barth and Bradley [1987], Fölster and Henrekson [1999]) to both developed and developing nations (e.g., Kormendi and Meguire [1985], Barro [1991], Easterly and Rebelo [1993]) to only developing nations (e.g., Devarajan et al. [1996] Dowrick [1996]). Interestingly, there is more often evidence of a negative relationship between the size of government and the growth rate in wealthy countries and more often evidence of a positive relationship in poor countries (Landau [1983], Ram [1986], Grier and Tullock [1987]).
Similarly, there is little agreement among the measures of government size used in previous studies. Many studies have used government consumption expenditures as a share of gross domestic product (GDP) to measure government size (e.g., Landau [1983], Ram [1986] Karras [1997]), though a number of them have adjusted the values in various ways to improve the fit. For instance, Barro [1991] and Easterly and Rebello [1993] net out expenditures on defense and education as these relate more to public investment than public consumption. Kormendi and Meguire [1985] and Grier and Tullock [1987], in contrast, use the growth in total government expenditures, though they exclude fixed capital formation and transfer payments. Nevertheless, many of these studies suggest that the total government impact (including regulations) would be a better measure (Landau [1983], Kormendi and Meguire [1985], Grier and Tullock [1987]), though they use less inclusive measures for lack of data.
There is also substantial heterogeneity in the independent control variables included alongside government size, which is not surprising given that theory offers no clear guidelines as to what factors influence economic growth (Agell et al. [1997]). Levine and Renelt [1992] suspected that these differences in the conditioning variables explained much of the conflict seen among results in the economic growth literature, and conducted extensive sensitivity testing using extreme bounds analysis (EBA). Specifically, by systematically altering the set of conditioning variables and observing changes in the resulting coefficient estimates, they found that many fiscal policy indications - either individually or in groups - were correlated with growth but that the relationship for any particular indicator or group of indicators was sensitive to the set of included variables. Sala-i-Martin [1997] criticized this approach as too extreme and re-examined the data by looking at the entire distribution of coefficient estimates rather than just the two extreme bounds. Nevertheless, he also failed to find a measure of government spending that was consistently statistically significant.
While statistical modeling is undeniably important, there is too little heterogeneity in the approaches used in this literature to explain the diversity of results. Indeed, most empirical studies of this relationship have used cross-sectional estimators to estimate the relationship, applying them either to cross-section data sets (one observation per country) or to repeated cross-section (panel) data sets (multiple observations per country over time), with mixed results. A smaller number of studies have estimated the relationship using time series methods for individual countries (Grossman [1988], Aschauer [1989], and Peden [1991]) with likewise mixed results. Though these studies have the advantage that more than one observation per country is used, time series estimation is known to be especially sensitive to model specification issues (Leamer [1974], Leamer [1978]).
Several recent papers cast doubt on the suitability of cross-sectional estimators for estimating this relationship, however. Dowrick [1996], for instance, found that the positive relationship between government and GDP growth found in such studies as Ram [1986] could be attributed to endogeneity. Controlling for time-invariant factors with fixed effects and reverse causality by using lagged endogenous variables as instruments and two-stage least square estimation, he found that the significantly positive relationship, found by Ram [1986], became insignificant. Devarajan et al. [1996] similarly found that government expenditures changed from being a positive to a negative determinant of economic growth when country-specific factors were taken into account, though the coefficient did not attain statistical significance in either specification. Finally, Fölster and Henrekson [1999] find that the relationship between growth and the public sector is more robustly negative when using fixed effect estimation than cross-sectional estimation.
In this paper, we attempt to shed light on the relationship in industrialized countries between government size and economic performance as measured by the growth rate. Specifically, we estimate the relationship using an econometric approach that controls for country- and time-specific fixed factors and a unique measure of government size that more closely captures the true impact of government. Moreover, we additionally estimate the relationship using individual government components, for instance transfers and public goods, both separately and together to examine whether particular government activities have more favorable or less favorable consequences for economic growth.
We also conduct sensitivity testing of the relationship to determine the robustness of model results. For instance, we estimate the model using two cross-section estimators in order to evaluate the importance of the estimators that have been used extensively in previous analyses. Moreover, we also estimate the model using conventional government consumption expenditures as a share of GDP as a measure of government size.
We find that government size is a negative and statistically significant determinant of the economic growth rate. Moreover, the magnitude implies that a 1 percentage point increase in government size leads to a 0.23 percentage point decline in the growth rate, which translates to an elasticity of -2.4 at the means. In contrast, both cross-section estimates are statistically insignificant and considerably smaller. We also find substantial differences in the contributions of different types of government expenditures. For instance, the coefficients for pecuniary transfers - that is spending on education, health, housing and community amenities - and interest payments are negative and statistically significant. Cash transfers are also negatively correlated with the growth rate, but the coefficient is statistically insignificant. In contrast, public goods are positively related to growth. Finally, replacing our measure of government size with the more conventional government consumption measure resulted in virtually unchanged coefficient estimates, but substantially reduced statistical significance.
The paper is organized as follows. Section 4.2 describes the data, focusing especially on our measure of government size. Section 4.3 presents our estimation strategy. Section 4.4 contains the main results and section 4.5 follows up with results from the sensitivity analysis. Finally, section 4.6 concludes the paper.
4.2 Data
We have compiled data for 21 industrialized countries for the period 1973 to 1992 based in large part on data from the OECD and Summers and Heston's Penn World Table (PWT 5.6). See Table A in the appendix. Specifically, we include each of the OECD member countries that reported information over this time period, except for Greece and Iceland which were excluded due to frequently missing data. We divide the data into four five-year periods (cross-sections), so there are 84 observations in total. Relative to much of the existing literature, our data offers two important advantages. First, by limiting the data to the industrialized OECD countries, we can reduce the problems of trying to fit a model to data from unduly dissimilar nations (Grier and Tullock [1987], Devarajan et al. [1996]). Moreover, limiting the data to OECD members increases the comparability and reliability of the data as, while the data are far from perfect, the OECD has gone through great pains to standardize definitions and encourage accuracy in collection. Second, the panel nature of the data enables us to model unobservable time- and country-specific fixed factors, for instance national culture. National culture has been found to be an important determinant for the growth rate (Clark [1987], Abrams and Lewis [1997]. Culture is, however, often omitted in growth regressions because it is hard to observe and measure.
All financial variables are measured in constant U.S. dollars. We define economic growth as the average of the annual per capita GDP growth rates over each five-year interval. While one can measure economic growth in numerous ways, this specification is one of the most widely used and thus offers better comparability with the results of other studies. To assess the sensitivity to this measure, however, we also estimate the model using economic growth defined as the growth in per capita GDP over the five-year interval.
In contrast to previous studies, most of which have used government consumption to proxy for government size, our measure includes all government expenditures plus interest on pre-existing debt. Interest payments are a fairly large component of government spending. This more inclusive measure, thus, better reflects the scope of government activity, and hence may better model the potential overall influence on economic growth (Landau [1983] and Kormendi and Meguire [1985]). Interest payments may crowd out other public consumption, not to mention private investment through changes in the interest rate.
We define government size (S) as government spending on transfers (T), spending on public goods (G), and the interest on pre-existing debt (r b). Specifically,
(1)
We consider the effect of individual spending categories separately in the second part of the analysis, so it is necessary to point out that we depart from convention in defining these categories. In much of the literature, spending categories have been defined based on the anticipated effect of the expenditures on economic growth, i.e., negative or positive. In contrast, we define spending categories based on the attributes of the expenditures themselves. Specifically, we treat general public services, defense, public order and safety, and economic services as public goods (G) because the marginal cost of another consumer is zero (or nearly zero) and it is costly to exclude consumers. We include spending on education, health, and housing and community amenities under pecuniary transfers (T#) because they can easily be provided privately, and in many countries they are. Finally, we consider spending on social security and welfare expenditures to be cash transfers (T$), so that total transfers can be written as:
(2)
Two areas fit poorly into the above categories: recreational, cultural, and religious affairs and the residual "other". Both contain a mixture of public goods and pecuniary transfers. We divide these categories equally between G and T#.
The data on public good expenditures and transfers comes predominantly from OECD data (OECD [1991]), though it reflects International Monetary Fund (IMF) and United Nations (UN) data in some cases. Government interest payments (r b) equal the nominal interest paid minus interest received for consolidated government for each country except Japan and New Zealand, for which net-interest reflects only the budget of the central government. The interest payments come primarily from U.N. data, though it was necessary to use IMF data in a couple of cases.
The other regressors are initial income (i.e., income at the start of each 5-year period), population growth over the interval, average investment as a share of GDP over the five years, and average number of total years of schooling of the total population. Some comments are in order. First, data on real GDP per capita and investment for Germany, Portugal, and New Zealand were missing for the last two years, so data from OECD [1992] were used instead. In addition, data on population for the Netherlands for the whole period, and for the last two years for Germany, Portugal, and New Zealand are from OECD [1991]. Second, the education data are based on quinquennial educational attainment from 1960 to 1985 as reported by Barro and Lee [1993]. We extrapolated figures for 1990 using linear regression on the data from 1960 to 1985. Education data was absent for Luxembourg, so we estimated it as a linear combination of the values seen for Belgium, Germany, and the Netherlands. While our estimation may seem crude, the education data did not influence model results noticeably and, hence, the crudeness of these estimations is most likely negligible.
4.3 Estimation
We estimate the effect of government size on economic growth using fixed effects regression, an approach capable of accounting for many of the unobservable factors that may be confounded with the functioning of government. By controlling explicitly for country- and time-specific fixed factors, this approach may mitigate omitted variable bias due to such unobservable and poorly measurable quantities as national culture (Clark [1987], Abrams and Lewis [1995]), legal-political institutional infrastructure (North [1990]), government efficiency, and the degree of urbanization. To the extent that these factors are correlated with government size, the OLS and OLS-type estimators used so widely in this literature will mistakenly allocate the effect of these omitted factors to government size (as well as perhaps to some of the other included regressors).
To illustrate, consider the example of urbanization. Urbanization
is likely to be a positive determinant of the growth rate, but it is difficult to measure and thus routinely left out of economic growth regressions. Likewise, urbanization may also be positively related to government size. A regression that fails to control for urbanization will allocate the positive effects to government size because the two factors are coincidentally correlated, thus overstating the potential benefits (or understating the potential disadvantages) of big government. To make the illustration concrete, consider a comparison of government size and economic growth in Belgium and Spain. On the surface, large government and fast growth in Belgium and smaller government and slower growth in Spain suggest a positive relationship between government size and economic growth. However, such a conclusion could be confounded by the degree of urbanization, which is also higher in Belgium. That is, government size may only appear to be a positive determinant in this simple example because of its correlation with urbanization. More generally, other examples of important potentially confounding factors include various cultural, geographical, and political factors. Specifically, the legal-political institutional infrastructure is an important factor behind the growth rate (North [1990], Hall and Jones [1997], and Tornell and Lane [1999]). Unfortunately institutional infrastructure is often unobservable and therefore hard to include in regressions.The fixed effects estimator may remedy the problem of omitted variable bias. It exploits the repetition across observational units in panel data to separate the effect of persistent unmeasurable factors from the relationship of interest. Specifically, a separate dummy variable is included for each observational unit so, in consequence, only within movements in the dependent variable and the regressors are used to estimate the parameters. If the unobservables that are correlated with government size do not vary over time, the fixed effects estimator solves the omitted variable problem. For instance, if factors reflecting institutional infrastructure are omitted, the fixed effects estimates will be unbiased as long as these institutions are constant over time. Since it takes a long time for institutions to change, this may not be a totally unrealistic assumption.
Our regression model can be written as:
(3)
The dependent variable, git, denotes the average 5-year per capita growth rate for country i at time period t. Xit is a vector of measures of government size that in various regressions includes such variables as total government size, transfers, public goods spending, and interest payments on debt. Zit is a vector of explanatory variables including the variables investment, initial income, annual average population growth, and human capital.
The m i terms are fixed country effects (i.e., unmeasured shocks). These terms account for time-invariant determinants of economic growth that vary among the countries in our sample. If m i were correlated with Xit in equation (3), then estimators that failed to include the country-specific fixed effects would yield inconsistent estimates of the effect of government size on economic growth.
The d t terms are sample-wide period effects. These terms account for trends that affect the economic growth in each of the countries similarly, such as business cycles and the oil shocks in the 1970s. Because government size increased during this period, a model that failed to account for such trends would confound those trends with the effects of increased government size.
The terms b and g are parameters to be estimated. The e it terms are idiosyncratic disturbance terms that vary by country and time period, and are assumed to be independently and identically distributed with mean zero and variance s e 2. t indexes each 5-year time period.
To illustrate the effect that the choice of econometric approach has on the results, we also report estimates based on two cross-sectional estimators chosen to correspond generally with the results of the majority of studies in the literature. Specifically, we estimate the model using the full panel data but no country-specific fixed effects (time dummies were included, however). We refer to this as the pooled cross-section estimator. We also estimate the model with OLS using a modified data set consisting of country-specific averages of the data over the four time periods. We refer to this as the between estimator. Neither the pooled cross-section estimator nor the between estimator accounts for country-specific unobserved factors that are correlated with both economic growth and government size, and hence both are susceptible to omitted variable bias.
We also estimate the basic model using random effects estimation. In contrast to the fixed effects estimator described above, the random effects model does not address the issue of omitted variable bias (i.e., correlation between the disturbance term and the included regressors). It is a generalized least squares (GLS) estimator, however, and hence is efficient in its class. If country- and time-specific fixed factors are statistically unwarranted, the random effects specification will provide a better opportunity to identify the true relationship. We use Hausman’s test (Hausman [1978]) to discriminate between the two econometric estimators.
We include a final set of regressions to assess the goodness of fit of our fixed effects estimates. Specifically, there is no reason a priori to expect a linear relationship between government size and economic growth, so we consider a quadratic specification for government size, one that would allow for a U-shaped or inverted U-shaped relationship, as well as a cubic and logarithmic specification. Neither do we have a strong reason to believe that the relationship is additive; i.e., that the growth rate should be measured in levels. We, thus, estimate the model first using a log-linear specification and subsequently based on a Box-Cox transformation of the dependent variable (see Greene [1993], chapter 11). Unfortunately, since growth rates are not restricted to being positive, we lose 8 observations because the Box-Cox transformation is not defined for negative values. We then re-estimated the fixed effects model using the restricted sample, but the results changed little.
Studies on government size and growth may suffer from several other statistical problems as well. One of them is the endogeneity problem. Several of the explanatory variables, like government size and investment, may both influence the growth rate and be influenced by it. To deal with this one wants to find instruments that are correlated with the endogenous variable but not with the disturbance term. The studies in the literature that have accounted for this have typically used lagged variables of the endogenous variables as instruments. As already mentioned, Dowrick [1996] showed – using 2SLS and lagged variables on government size, investment, and prices as instruments – that the correlation between government size and growth is caused by reverse causality. Before one draws too many conclusions from this result, however, one needs to evaluate the quality of the instruments. We find in our data that lagged investment and lagged government size are poor instruments for investment and government size, and hence 2SLS estimates using lagged investment and lagged government size are fruitless.
Table 1 reports the results of three different regression models of the relationship between economic growth and government size. The first column reports the results of fixed effects regression. The second and third columns report the results of pooled cross-section and cross-section regression, respectively. Statistical significance refers to the 5 percent significance level unless otherwise stated.
All variables included in the fixed effects model, besides government size, have the expected signs. Initial income is highly significant and negatively correlated with the growth rate, which gives support for the convergence hypothesis. Investment is positively correlated with economic growth, though statistical significance is not attained even at the 10 percent level. Population growth is negatively and significantly correlated with the growth rate and
Table 1.
Fixed Effects, pooled cross-section, and between regression
results of annual average growth rate on total size of government
|
Fixed Effects |
Pooled Cross Section |
Between |
|
|
Constant |
14.6 (4.28) |
3.79 (2.10) |
3.02 (1.57) |
|
Initial Income |
-0.0011 (-4.45) |
-0.00023 (-2.03) |
-0.00015 (-1.14) |
|
Investment |
0.087 (1.56) |
0.033 (0.81) |
0.033 (0.79) |
|
Education |
0.013 (0.066) |
0.054 (0.40) |
-0.033 (-0.21) |
|
Population |
-0.539 (-3.98) |
-0.374 (-2.91) |
-0.171 (-0.47) |
|
Total Size of Government |
-0.228 (-2.76) |
-0.028 (-0.75) |
-0.0021 (-0.041) |
|
Number of observations |
84 |
84 |
21 |
|
R2 |
62.0 |
29.6 |
36.0 |
|
FCountry Dummies |
4.68 |
||
|
FTime Dummies |
13.0 |
7.54 |
Note: Numbers in parenthesis are t-statistics. All standard errors are
based on White’s heteroskedasticity-consistent covariance matrix.
explains a substantial part of the variation in growth rates. Education, however, is positively though insignificantly correlated with the growth rate. In addition, total size of government is negatively correlated with the growth rate and strongly significant. A one-percentage point increase in the size of government is associated with a decline of 0.23 percentage points in the annual average growth rate.
The pooled cross-section estimator yields quite different estimates. All variables have the same sign as in the fixed effects model but generally explain less of the variation in the growth rate and are less statistically significant. The only variables that are significant at a 5 percent significance level are initial income and population growth. Total government size is negatively though insignificantly correlated with the growth rate. The magnitude of the government size coefficient is only about one-tenth of that estimated with fixed effects.
The between estimates explain even less of the variation in the growth rate and are even less significant. In fact, none of the variables are statistically significant at even the 10 percent significance level. The government size coefficient remains negative but falls in absolute magnitude to only one-hundredth of that estimated with fixed effects, and is insignificant.
The results of the three estimation models differ importantly in not only the significance level but also in the magnitude of the coefficients, which suggests that omitted variables are present and the pooled cross-section and between estimators are biased. The coefficient for government size is more negative in the fixed effects model than in the other two estimation models, which suggests that important omitted factors are correlated with both the disturbance term and the regressors. These omitted factors either have positive coefficients and positive correlation with included variables or have negative coefficients and negative correlation. Suppose, for instance, that factors determining the efficiency of government activities are omitted. Then, if these factors are positively correlated with the size of government and have a positive impact on the growth rate, the government size coefficients in the between and pooled cross-section models will have the less negative magnitude observed. That the coefficient for government size in the between model is smaller in magnitude than in the pooled cross-section model suggests that time-specific effects may matter as well. Innovations over time that have a positive impact on growth and are positively correlated with government size can explain this.
Consistently, the F tests support the inclusion of the country-specific fixed effects, m i, in the regressions and hence support the conjecture that the growth rate varies in important unmeasured ways across countries. With a critical value of about 1.97 at 95 percent confidence, the test statistic for including country fixed effects (given that time-specific fixed effects are included) is 4.68. Likewise, the F tests support the inclusion of time-specific fixed effects, d t, in the regression (given that country fixed effects are included), which supports the conjecture that the growth rate varies in important unmeasured ways across time. With a critical value of about 2.79 at 95 percent confidence, the test statistic is 13.0 in the fixed effects model. In the pooled cross-section regression the test statistic is 7.54 and the critical value 3.10 at a 95 percent confidence level.
We also estimate the same model using a random effects estimation technique. Using Hausman’s specification test, however, we find evidence to strongly reject the random effects specification both for country- and time- specific effects. The test statistic has a value of 18.58 and a p-value of 0.0023.
In addition, we test for the existence of other non-linear relationships between government size and its various components and the growth rate, using the fixed effects specification. We do not, however, find much support for non-linear relationships between the total size of government and the growth rate, nor between the various components of government spending and the growth rate. We test for a quadratic, cubic and logarithmic relationship between both total government size and its components and the growth rate. None of these relationships are found to be significant, however.
Since we have no strong prior indication regarding the correct functional form of the model, we re-estimated the model using first a log-linear specification and then a Box-Cox specification. As mentioned above, the growth rate is not constrained to be positive so we lose 8 observations. To provide a benchmark, we re-estimated the fixed effects model using the restricted sample. Table 2 shows these results for the linear, log-linear, and Box-Cox specification models.
Table 2.
Regression results from linear, log-linear and Box-Cox specification
|
Linear |
Log-linear |
Box-Cox |
|
|
Constant |
14.7 (3.46) |
6.22 (2.69) |
10.58 (3.12) |
|
Initial Income |
-0.0012 (-4.27) |
-0.00066 (-3.78) |
-0.0009 (-4.00) |
|
Investment |
0.103 (1.83) |
0.063 (1.98) |
0.085 (1.87) |
|
Education |
0.031 (0.17) |
0.11 (0.69) |
0.060 (0.301) |
|
Population |
-0.19 (-0.31) |
-0.268 (-0.70) |
-0.167 (-2.37) |
|
Total Size of Government |
-0.235 (-2.74) |
-0.096 (-2.04) |
-0.170 (-2.37) |
|
Number of observations |
76 |
76 |
76 |
|
R2 |
59.3 |
54.9 |
59.4 |
|
l l =0, c 2=36.11 (0.000) l =1, c 2=5.27 (0.0216 |
Note that the number of observation is 76 because observations with negative
growth rates were excluded.
The results are generally similar in the three models and government size is negatively and significantly correlated with the growth rate in each specification. l is the dependent variable in the Box-Cox specification estimated by grid search. l = 1 corresponds to a linear specification, whereas l = 0 corresponds to the log-linear specification. The test for log-linear specification is soundly rejected, and the linear specification is also rejected at a 5 percent significance level. Nevertheless, the Box-Cox estimates do not support generally different conclusions from those of linear specification.
Finally, estimating the model with 2SLS and lagged government size as instrument alters the coefficient from –0.259 to –0.322 and the t-statistic from –2.36 to –1.58. The result is still negative though less significant. The validity of this estimation, however, depends crucially on how good an instrument lagged government size is. We abstain from drawing any further conclusions from this result, however, due to the dubious validity of the instrument.
Spending Components
It seems reasonable to expect that the various components of government spending may affect the growth rate to different degrees and in different directions. We test for this by dividing government size into its components. These components are then included in the base line fixed effects regression. The components are included first all together and then one at a time. Again, we use Hausman's test to discriminate between the fixed- and random effects estimates. Table 3 reports results from these regressions.
Including all the components together or one at a time changes the results only slightly. The magnitude of the coefficients and the significance level differ little and the R2 values are slightly lower when components are included one at a time. All variables have the expected sign, though not all are statistically significant. The only component that is positively correlated with
the growth rate is public good spending (regressions 1, 2, and 4). Because it is
insignificant, it is hard to draw any conclusions from this result, but it may
Table 3.
Regression results from fixed effects regressions of annual average growth rate on different components of government size
|
1 |
2 |
3 |
4 |
5 |
6 |
7 |
|
|
Constant |
11.6 (2.51) |
12.4 (2.70) |
10.43 (2.89) |
7.51 (1.53) |
10.57 (3.45) |
11.37 (3.27) |
9.08 (2.56) |
|
Initial Income |
-0.001 (-4.78) |
-0.001 (-5.21) |
-0.001 (-3.59) |
-0.001 (-3.46) |
-0.001 (-4.33) |
-0.001 (-3.95) |
-0.0009 (-3.46) |
|
Investment |
0.094 (1.57) |
0.091 (1.54) |
0.079 (1.33) |
0.088 (1.47) |
0.101 (1.72) |
0.077 (1.35) |
0.080 (1.34) |
|
Education |
0.151 (0.79) |
0.179 (0.93) |
0.107 (0.51) |
0.139 (0.64) |
0.071 (0.36) |
0.146 (0.73) |
0.0893 (0.41) |
|
Population |
-0.518 (-3.72) |
-0.540 (-4.10) |
-0.413 (-2.95) |
-0.366 (-2.19) |
-0.490 (-3.85) |
-0.445 (-3.47) |
-0.379 (-2.62) |
|
Transfer |
-0.259 (-1.91) |
-0.176 (-1.35) |
|||||
|
Public Good |
0.192 (0.72) |
0.165 (0.63) |
0.131 (0.41) |
||||
|
Interest Payments |
-0.319 (-3.28) |
-0.308 (-3.26) |
-0.280 (-2.71) |
||||
|
Pecuniary Transfers |
-0.335 (-2.13) |
-0.301 (-1.97) |
|||||
|
Social Security and Welfare |
-0.132 (-0.25) |
-0.236 (-0.51) |
|||||
|
Number of obs. |
84 |
84 |
84 |
84 |
84 |
84 |
84 |
|
R2 |
64.2 |
65.5 |
58.5 |
57.5 |
61.6 |
60.1 |
57.4 |
|
Hausman (p-value) |
16.89 (0.018) |
17.85 (0.022) |
11.69 (0.039) |
10.13 (0.072) |
18.41 (0.002) |
13.0 (0.023) |
10.98 (0.052) |
Note: Numbers in parenthesis are t-statistics. All standard errors are based on White’s heteroskedasticity-consistent covariance matrix.
suggest that the provision of "true" public goods has a positive effect on the growth rate, at least up to some level.
Total transfers (regressions 1 and 3) are negatively correlated with the growth rate. In regression 1, total transfers are significant at 90 percent confidence, while in regression 3 total transfers are significant only at 80 percent confidence. When total transfers are broken up into pecuniary and cash transfers (regressions 2, 6, and 7), we find that pecuniary transfers seem to be significantly correlated with the growth rate
both statistically and economically. A one-percentage point increase in pecuniary transfers leads to a roughly 0.3 percentage point reduction in the growth rate. This suggests that there may be inefficiencies in having the government provide these private goods. Cash transfers also have a negative economic significance on the growth rate but are not statistically significant.Interest payments on pre-existing debt seem to discourage growth (regressions 1, 2, and 5). A one-percentage point increase in interest payments is associated with about a 0.3 percentage point decline in the growth rate. This could be explained by the crowding out effect that interest payments have on other more productive public spending, and by the negative impact large public deficits may have on private investment. This is consistent with the findings of Aschauer (1989). He finds that public-sector deficits may be an important determinant for the level of real interest rates, private investment decisions, and the economy’s dynamic performance.
In two cases, regressions 4 and 7, the Hausman test statistic of whether fixed or random effects estimation is preferable is just below the critical value at the 5 percent significance level, weakly suggesting that random effects specification may provide better estimates. The random effects estimation of regression 4 results in a larger and statistically significant estimate for public goods. The random effects coefficient is 0.225 and the t-statistic is 2.35. The random effects coefficient (-0.138) for cash transfers in regression 7 is less negative than the fixed effects coefficient (-0.236), though it is statistically more significant. The t-statistic is -1.15 instead of -0.51.
4.5 Multicollinearity and Other Sensitivity Analyses
Another issue in studies of government size and growth is multicollinearity. That is, some of the right-hand side variables may be overly correlated, for example initial income and the size of government. This is less serious than the question of endogeneity, however, since it is not a statistical problem that biases the estimates but rather an issue with the data. The problem arises because it is hard to discern where the effect is actually coming from (initial income or government size), and may result in statistically insignificant estimates even if the estimator provides unbiased estimates of the true effect. A remedy for this is to drop variables. The only variables that are not significant in our fixed effects model are investment and education. Dropping these two variables has no significant impact on the results.
Table 4.
Sensitivity analysis showing how the coefficient for government size and t-statistic change in response to alterations in estimation technique, conditional set of included variables, and data set
|
Regression |
Coefficient for government size (t-statistic) |
|
FE with inclusion of trade variable FE without trade variable |
-0.229 (-1.99) -0.256 (-2.18) |
|
FE excluding education variable FE including education variable |
-0.228 (-2.86) -0.288 (-2.76) |
|
FE excluding investment variable FE including investment variable |
-0.288 (-2.70) -0.228 (-2.76) |
|
FE excluding population growth variable FE including population growth variable |
-0.16 (-1.82) -0.228 (-2.76) |
|
FE excluding initial income variable FE including initial income variable |
-0.14 (-1.4) -0.228 (-2.76) |
|
FE with Summers and Heston’s government size variable FE with our government size variable |
-0.275 (-1.81) -0.258 (-3.15) |
To further test the robustness of our results, we alter the conditioning set of included variables. The result of this is presented in Table 4. Levine and Renelt [1992] show that only by selecting a very particular conditioning set of variables can a significant correlation between government consumption expenditures and the growth rate be identified in a linear regression model. Alterations to that conditioning set lead to an insignificant correlation. They find, for instance, that including a trade variable makes the correlation insignificant.
We alter the conditioning set of variables both by including new ones and by excluding some previously included. Contrary to Levine and Renelt [1992], we find that including a variable for trade (openness) does not change the results excessively. The coefficient for government size only changes from -0.256 to –0.229 and the t-statistic changes from –2.18 to –1.99 when a variable capturing trade volume is included. Excluding education and investment has no noticeable effect on the coefficients or on the t-statistics; the coefficients do
not change at all and the t-statistics change to –2.86 and –2.70, respectively. Excluding population growth and initial income alters the results slightly, however. When population growth is excluded the coefficient for government size is –0.16 with a t-statistic of –1.82, and when initial income is excluded the coefficient for government size is –0.14 and the t-statistic –1.4. These sets of numbers should be compared with a coefficient of –0.228 and a t-statistic of –2.76.Changes in the data set do not alter the result noticeably. For example, including the available years for Turkey (1975 to 1992) and excluding the last period for Germany (because of the unification) does not change the results much.
Using Summers and Heston’s data on the government consumption share instead of our measure of government size does change the results modestly. The coefficient changes from –0.258 to –0.275, and the t-statistic decreases in absolute magnitude from –3.15 to –1.81.
Changes in the growth rate measure have small impacts on the overall results. For instance, changing the growth rate variable to growth rate over 5 years instead of the average annual growth rate over a 5-year period does not change the results noticeably.
Using the growth in government size instead of the level results in a negative, while insignificant, correlation. A one-percentage point increase in the growth rate of the government size is associated with a 7.6 percent decrease in the growth rate (t- statistic of 1.50).
We have estimated the relationship between government size and economic growth in a number of OECD countries using an estimator that controls for omitted country- and time-specific factors. Consistent with Fölster and Henrekson [1999], we find a strong negative correlation between the size of government and the growth rate. When compared to a pair of cross-section estimates based on the same data set, the fixed effects results are statistically significant and substantially larger in absolute magnitude, suggesting that omitted variable bias may be an important problem. Indeed, this could help explain why the results of earlier studies are so diverse.
We have also estimated the model using data on individual government activities. We find that interest payments on pre-existing debt and the provision of pecuniary transfers are strongly negatively correlated with the growth rate. Social security and welfare payments are negatively correlated with the growth rate, though the coefficient is statistically insignificant. The provision of public goods, in contrast, has a positive, though insignificant in the fixed effects specification, impact on the growth rate.
The results of this study have practical implications for public policy; for example, they can provide some guidance on how public resources can best be spent. With increased unification in Europe, public policy may be one of the few remaining determinants of the growth rate that individual governments can modify.
Finally, we wish to point out that we make no claims that our results accurately reflect the true relationship between government size and growth. There are a number of serious econometric issues involved. Our results suggest we have overcome some of the problems common in earlier literature. However, many still remain. For example, studies have been generally unsuccessful in properly accounting for endogeneity. The instruments used have often been inappropriate, and the search to find more suitable instruments should continue. Moreover, extending the time dimension to pick up long-run effects may also be fruitful.
References
Abrams, B., and K. Lewis, 1995, "Cultural and Institutional Determinants of Economic Growth: A Cross-Section Analysis," Public Choice, 83, 273-289.
Agell, J., Lindh, T. and H. Ohlsson, 1997, "Growth and the Public Sector: A Critical Review Essay," European Journal of Political Economy, 13, 33-52.
Aschauer, D., 1989, "Is Public Expenditures Productive?," Journal of Monetary Economics, 23, 177-200.
Aschauer, D., 1990, "Finite Horizons, Intertemporal Substitution, and Fiscal Policy," Public Finance Quarterly, 18(1), 77-91.
Barro, R., 1991, "Economic Growth in a Cross- Section of Countries," Quarterly Journal of Economics, 106(2), 407-443.
Barro, R., and J. Lee, 1993, "International Comparisons of Educational Attainment," NBER Working Paper, No 4349.
Barth, J. and M. Bradley, 1987, The Impact of Government Spending on Economic Activity, Manuscript, George Washington University, Washington.
Blanchard, D., 1984, "Current and Anticipated Budget Deficits, Interest Rates, and Economic Activity," NBER Working Paper, No 1265.
Clark, G., 1987, "Why Isn’t the Whole World Developed? Lessons from the Cotton Mills," Journal of Economic History, 47, 141-173.
Devarajan, S., Swaroop, V. and H. Zou, 1996, "The Composition of Public Expenditure and Economic Growth," Journal of Monetary Economics, 37, 313-344.
Dowrick, S., 1996, "Estimating the Impact of Government Consumption on Growth: Growth accounting and Endogenous Growth Models, Empirical Economics, 21, 163-186.
Durlauf, S. and D. Quah, 1998, "The Empirics of Economic Growth," NBER Working Paper, No 6422.
Easterly, W. and S. Rebelo, 1993, "Fiscal Policy and Growth," Journal of Monetary Economics, 32, 417-458.
Feldstein, M., 1984, "Can an Increased Budget Deficit Be Contractionary?," NBER Working Paper, No 1434.
Fölster, S. and M. Henrekson, 1999, "Growth and the Public Sector: A Critique of the Critics," European Journal of Political Economy, 15, 337-358.
Greene, W., 1993, Econometric Analysis, Macmillan Publishing Company, New York.
Grier, K. and G. Tullock, 1987, "An Empirical Analysis of Cross-National Economic Growth," Journal of Monetary Economics, 24, 259-276.
Grossman, P., 1988, "Government and Economic Growth: A non-linear relationship," Public Choice, 56, 193-200.
Hall R. and C. Jones, 1997, "Levels of Economic Activity across Countries," American Economic Review, 87(2), 173-177.
Hausman, J., 1978, "Specification Tests in Econometrics," Econometrica, 46, 1251-1272.
International Monetary Fund (IMF), 1981, 1985, and 1993, Government Finance Statistics Yearbook, Washington.
Karras, G., 1997, "On the Optimal Government Size in Europe: Theory and Empirical Evidence," The Manchester School, 65(3), 280-294.
Kormendi, R. and P. Meguire, 1985, "Macro Economic Determinants of Growth: Cross-Country Evidence," Journal of Monetary Economics, 16, 141-163.
Landau, D., 1983, "Government Expenditures and Economic Growth: A Cross Country Study," Southern Economic Journal, 49, 783-792.
Leamer, E., 1974, "False Models and Post-Data Model Construction," Journal of the American Statistical Association, 69, 122-31.
Leamer, E., 1978, Specification Searchers: Ad-Hoc Inference with Non-experimental Data, Wiley, New York.
Levine, R. and D. Renelt, 1992, "A Sensitivity Analysis of Cross- Country Growth Regressions," American Economic Review, 84(4), 942-963.
Lindbeck, A., 1986, "Limits to the Welfare State," Challenge, 28(6), 31-36.
Mankiw, N. and L. Summers, 1986, "Money Demand and the Effects of Fiscal Policies," Journal of Money, Credit, and Banking, 18, 415-429.
North, D., 1990, Institutions, Institutional Change and Economic Performance, Cambridge, Cambridge University Press.
Organisation of Economic Cooperation and Development (OECD) (a), 1991, National Accounts 1977-1989, Paris. Also used version 1960-1989, Paris, and microfiche versions 1960-1991 and 1960-1992.
Organisation of Economic Cooperation and Development (OECD) (b), 1992, Economic Outlook, 62, Paris.
Peden, E., 1991, "Productivity in the U.S. and its Relationship to Government Activity: An Analysis of 57 years, 1926 –86," Public Choice, 69, 153-173.
Ram, R., 1986, "Government Spending and Economic Growth: A New Framework and Some Evidence From Cross-Section and Time-Series Data," American Economic Review, 76(1), 191-203.
Sachs J. and A. Warner, 1997, "Fundamental Sources of Long-Run Growth," American Economic Review, 87(2), 184-188.
Sala-i-Martin, X., 1997, "I Just Ran Four Million Regressions," NBER Working Paper, No 6552.
Slemrod J., 1995, "What Do Cross-Country Studies Teach about Government Involvement, Prosperity, and Economic Growth?," Brookings Paper on Economic Activity, 2, 373-431.
Summers, R. and A. Heston, 1984, "Improved International Estimates of Real Product and its Composition: 1950-1980," Review of Income and Wealth, 30, 207-262.
Tanzi, V. and L. Schukneckt, 1997(a), "Reforming Government: An Overview of Recent Experiences," European Journal of Political Economy, 13, 395-417.
¾ , 1997(b), "Reconsidering the Fiscal Role of Government: The International Perspective," American Economic Review, 87(2), 164-168.
Temple, J., 1999, "The New Growth Evidence," Journal of Economic Journal, 37, 112-156.
Tornell, A and P. Lane, 1999, "The Voracity Effect," American Economic Review, 89(1), 22-46.
United Nations (UN), 1985, 1992, 1994, National Account Statistics: Main Aggregates and Detailed Tables, 1982, New York. Also used version 1990, New York, and version 1992, New York.
![]()
APPENDIX
Table A.
Variable statistics as a fraction of GDP across countries over the period 1973-1992
|
Variable |
Obs |
Mean |
Std. Dev |
Min. |
Max. |
|
Growth rate |
84 |
1.9 |
1.3 |
-0.97 |
5.4 |
|
Initial income |
84 |
11178 |
2727 |
4479 |
17710 |
|
Investment |
84 |
27.4 |
5.0 |
18.2 |
40.0 |
|
Education |
84 |
8.0 |
2.2 |
1.2 |
12.1 |
|
Population |
84 |
0.63 |
0.72 |
-0.18 |
5.6 |
|
Total Size of Government |
84 |
19.9 |
4.8 |
9.3 |
30.1 |
|
Transfer |
84 |
10.5 |
3.2 |
4.9 |
19.3 |
|
Public Good |
84 |
7.1 |
1.8 |
3.6 |
11.4 |
|
Cash Transfer |
84 |
1.7 |
1.3 |
0.38 |
6.0 |
|
Pecuniary Transfer |
84 |
8.8 |
2.4 |
4.3 |
14.7 |
|
Interest Payments |
84 |
2.3 |
2.8 |
-3.5 |
10.2 |
Chapter 5
Measurement of Transfers and Peaking of Fiscal Sizes of Government
5.1 Introduction
Taxes and transfers have risen markedly in developed countries over the last century (Tanzi and Schuknecht [1997]. Economic theory suggests that this trend cannot continue indefinitely, however, because the disincentives posed by taxation in theoretically plausible models will eventually limit tax revenues, and hence government spending and transfers. It appears that limits of some sort may have been reached in recent decades. Table 1 reports peak government spending (S0), transfers (T0), and tax revenue (R0) from official budgets of national plus subnational governments as percentages of gross national product (GNP) for 22 OECD countries over the period 1972-1992 as well as the values assumed in 1992. Many of the
countries experienced their greatest spending and transfers during the mid-1980s or
Table 1.
Peaking of On-Budget Transfers, Spending, and Tax Revenue as Percentages of GNP (1972-1992)
|
Maximum S0 |
S0 in 1992 |
Maximum T0 |
T0 in 1992 |
Maximum R0 |
R0 in 1992 |
|
|
Australia |
38% (1976) |
36% |
30% (1976) |
24% |
32% (1990) |
30% |
|
Austria |
52% (1986) |
50% |
41% (1981) |
41% |
44% (1992) |
44% |
|
Belgium |
63% (1984) |
54% |
44% (1984) |
34% |
48% (1985) |
46% |
|
Canada |
48% (1981) |
45% |
36% (1982) |
32% |
39% (1991) |
38% |
|
Denmark |
57% (1982) |
55% |
46% (1982) |
44% |
54% (1988) |
51% |
|
Finland |
66% (1992) |
66% |
60% (1992) |
60% |
49% (1992) |
49% |
|
France |
49% (1992) |
49% |
39% (1986) |
39% |
45% (1984) |
44% |
|
Germany |
48% (1992) |
48% |
40% (1992) |
40% |
43% (1992) |
43% |
|
Ireland |
62% (1985) |
45% |
47% (1982) |
33% |
44% (1988) |
41% |
|
Italy |
55% (1992) |
55% |
37% (1987) |
36% |
43% (1992) |
43% |
|
Japan |
34% (1983) |
31% |
29% (1989) |
26% |
32% (1990) |
29% |
|
Luxembourg |
47% (1984) |
47% |
45% (1991) |
45% |
39% (1978) |
36% |
|
Netherlands |
60% (1983) |
53% |
48% (1983) |
42% |
48% (1988) |
47% |
|
New Zealand |
47% (1984) |
44% |
37% (1981) |
36% |
41% (1989) |
37% |
|
Norway |
58% (1978) |
48% |
49% (1978) |
43% |
51% (1986) |
48% |
|
Portugal |
50% (1985) |
41% |
33% (1985) |
27% |
38% (1992) |
38% |
|
Spain |
42% (1991) |
42% |
33% (1991) |
32% |
36% (1992) |
36% |
|
Sweden |
66% (1985) |
56% |
55% (1985) |
42% |
57% (1990) |
52% |
|
Switzerland |
32% (1976) |
32% |
28% (1992) |
28% |
31% (1986) |
31% |
|
Turkey |
31% (1979) |
30% |
24% (1979) |
23% |
23% (1981) |
23% |
|
U.K. |
47% (1975) |
40% |
35% (1975) |
28% |
39% (1982) |
35% |
|
U.S. |
34% (1991) |
34% |
20% (1991) |
19% |
29% (1987) |
29% |
Note. On-budget transfers equal on-budget spending minus spending on public goods and interest on debt- see text below for details on how spending on public goods and interest were calculated.
earlier. Moreover, the countries with the greatest peak spending and transfers tended to have the greatest declines from the peaks to 1992.
We ask whether some of this apparent leveling or peaking of sizes of government may be a result of limits to taxation. The "limit to taxation" might be defined in several ways. For instance, a well-informed electorate that desires an extensive safety net may rationally keep transfers and spending below maximum feasible levels because efficiency losses per marginal unit of tax revenue could be infinite at maximum revenue (Atkinson and Stern [1974], Browning [1976], Meltzer and Richard [1981], and Svensson [1990]). On the other hand, the political-economy analysis of Brennan and Buchanan [1977] posits that government maximizes revenue in long-run equilibrium, and Buchanan and Lee [1982] model a simple political process that leads to an equilibrium tax rate greater than the rate that maximizes long-run revenue.
We compute theoretical spending and transfer maxima calculated in a model of the long-run Laffer curve and compare these maxima to observed levels of spending and transfers. The model differs primarily from those in the literature in that the focus is on the size of government - for example, spending, transfers, and revenue - rather than on the tax rates that maximize revenue. Although the model is simplified by treating capital as constant, the tax rates that maximize revenue in the model are comparable to the rates that maximize revenue in other models of the long-run Laffer curve (e.g., Fullerton [1982]).
In the model, government provides public goods and private goods, and transfers are defined as the net claims to private goods that result directly from government fiscal decisions about budgeted spending and its funding. This definition differs from the accounting definition underlying Table 1, in which transfers are simply amounts recorded on public budgets when governments make payments to provide private goods. To compare transfers calculated in our theoretical model with actual data, the first step is to measure transfers as we define them in our theoretical model. Because accounting practices vary from country to country, using a measure of transfers that is consistent with our theoretical model can also make comparisons across countries and time more informative than comparisons based on traditional measures from public budgets.
We find that tendencies toward flat or falling spending and transfer percentages are somewhat more marked with measures corresponding to our definition of transfers than the accounting-based relationship observed in Table 1. The pattern that countries with the greatest peak fiscal sizes usually experienced the greatest declines in fiscal sizes is also more marked with our measures. We find, moreover, that calculated long-run limits under reasonable assumptions are less than observed peak ratios of spending, transfers, and revenue to GNP in several OECD countries.
Because the greatest observed fiscal sizes exceed calculated long-run maximum fiscal sizes, and because the countries with the greatest peak fiscal sizes also had the greatest declines in fiscal sizes, we cannot reject the idea that fiscal sizes in some developed countries have bumped up against limits to taxation. Put differently, political processes in a number of developed countries may for a period of time have led to spending and transfers greater than the maximum levels that can be supported in a theoretical long-run equilibrium.
In section 5.2 we calculate maximum spending, transfers, and tax revenue that can be supported in a theoretical long-run general equilibrium. Section 5.3 begins with a description of the approach used to derive internally consistent measures of fiscal size of government, then presents measures of fiscal size established with actual data, and finally compares these numbers to numbers derived in section 5.2. Section 5.4 concludes this paper.
5.2 A Theoretical Limit to the Fiscal Size of Government
The well-known Laffer hypothesis (Laffer [1979]) posits a maximum sustainable size of government. It is not generally known, however, how close actual spending and transfer patterns are to these theoretical maxima. We address this issue by comparing observed levels of spending, transfers, and tax revenue to limits derived in a theoretical model.
We categorize government spending as either transfers (T), spending on public goods (G), or interest on preexisting debt (r b). Note that G includes only public provision of public goods, which is usually a small part of total government spending. We categorize the sources of government revenue as either tax revenues (R), non-tax payments from households to government (N) (e.g., fines, fees, forfeitures, penalties, and sales), non-tax revenue in the form of financial returns from government enterprises (F), or the government's budget deficits (D b), which equals the increase in the household's holding of government debt. The government budget is written as
(1)
According to the Laffer hypothesis one tax rate, or a vector of tax rates, maximizes equilibrium tax revenue. Similarly, there are associated maximum levels of spending and transfers for given levels of public goods (G), interest on the debt (r b), non-tax revenue (N + F), and the deficit (D b). Such maximum levels of spending, transfers, and tax revenue may be thought of as theoretical limits to the fiscal size of government.
We compute rough technical limits by embedding the government and household budgets (described below) into a general-equilibrium model in which the aggregate household receives utility from net income (y), leisure, and public spending. We simplify the model by not allowing capital to vary. While including capital distortions would permit a more accurate representation of the economy, it introduces a number of important complications such as whether to model the economy as open or closed (Feldstein and Horioka [1989], Mendoza and Tesar [1995]), whether to use a naive or optimization model of investment (Bernanke, Bohn, and Reiss [1988]), and what values to assume for the elasticity of intertemporal substitution (Judd [1987], Hansen and Singleton [1983], Mankiw and Zeldes [1991], and Engen and Gale [1997]). In the end, we limit our analysis to labor distortions because, averaged across OECD countries over 1972-1992, labor income was 63 percent of total income.
The aggregate household receives income from labor, capital, and holdings of government bonds (debt), and pays taxes to government on each type of income. The household also receives transfers from government and makes payments to government to cover nontax liabilities and to augment holdings of government bonds.
The aggregate household’s total income, net of all payments to government,
is written as
(2)
where tl, tk, tb, are the marginal tax rates faced by the aggregate household on labor income, capital income, interest income from government bonds; w and r are gross returns to labor and capital; r is the interest rate on government debt; l is labor supplied by the household; k is capital owned by the household including capital owned indirectly through firms; b is the net level of (preexisting) government debt; and T is transfers. The corresponding tax revenues are thus R º tlwl + tkrk + tbr b. All variables are defined over a given accounting period, taken below to be a year, and time subscripts are suppressed.
The model is parameterized to fit a stylized "average OECD economy" in a calibration allocation described in appendix C, with labor supply elasticities in empirically reasonable ranges. Because most differences in spending reflect differences in transfers (see Table 1), we compute equilibria at different values of the tax rate on labor income (tl) when T adjusts to keep the government budget (1) in balance. In these calculations, G, tk, tb, N, and F are held constant, government meets interest obligations on preexisting debt at a given interest rate, and primary deficits (deficits net of interest payments) equal zero, which approximates actual average primary deficits over our sample. By comparing equilibria, we find the value of tl at which spending, transfers, and tax revenue attain maxima, and divide by the associated level of GNP to compare with observed fiscal sizes reported above.,
We assume initially that GNP is a Cobb-Douglas function, a0Kal(1-a), where a is a parameter equal to capital’s share of GNP and K is the sum of private capital (k) plus government capital (k’). Without loss of generality, we choose units so a0 = 1, K=1, and, in the calibration allocation, l = 1. Under competition, gross returns to labor and capital are then
(3)
and
(4)
Factor payments exhaust GNP (wl + rk + F = Kal(1-a)), so (3) and (4) imply that returns on private and government capital are equal (F = rk’).
An equilibrium is an allocation such that (1), (2), (3), and (4) hold, and the household maximizes utility by choice of l subject to the budget (2), taking tl, w, tk, r, k, tb, r , b, T, N, D b, and G as given. Utility is assumed to be a Stone-Geary-modified CES function:
(5)
where a , w , and d parameters and V is a function that captures utility effects of G and utility effects of T not operating through consumption of net income (y). The form of V does not affect the results.
The parameters a , w , and d are chosen such that: (i) the uncompensated labor supply elasticity equals a given value in the calibration allocation; (ii) the compensated labor supply
elasticity equals a given value in the calibration allocation; and (iii) the calibration allocation is an equilibrium. To reflect the uncertainty that exists about labor supply elasticities, we consider two pairs of uncompensated and compensated elasticities: 0.1 and 0.25; and 0.44 and 0.52.
When the uncompensated and compensated elasticities are assumed to be small (0.1 and 0.25, respectively), maximum spending, transfers, and tax revenue occur when the marginal labor tax rate is 81 percent, at which point spending is 72 percent of GNP, transfers are 58 percent of GNP, and tax revenue is 68 percent of GNP. When large uncompensated and compensated elasticities are assumed (0.44 and 0.52, respectively), maximum fiscal size occurs when the labor tax rate is 64 percent, at which point spending is 61 percent of GNP, transfers are 48 percent of GNP, and tax revenue is 57 percent of GNP. The revenue-maximizing tax rates in these two cases are quantitatively similar to those in Fullerton [1982], who used a more detailed model with variable capital.
Sensitivity analysis in appendix A indicates that computed limits to the fiscal size of government are not particularly sensitive to reasonable alternative assumptions about the initial level of government debt, labor’s share, the elasticity of substitution in the aggregate production function, the interest rate on government debt, or the level of taxes on labor at which the model is calibrated. Results are somewhat sensitive, however, to the values of tk and tb, primarily because the assumptions of constant capital and given debt make taxes on capital and debt lump-sum taxes in the model. Computed limits are obviously sensitive to assumed labor supply elasticities, and presumably would also be sensitive to assumptions that would be made if the analysis were broadened to treat capital and debt as endogenous (see appendix A for a brief analysis).
5.3 Comparing Observed Fiscal Sizes of Government in OECD Countries to Theoretical Limits
Unfortunately, we can not compare our theoretical limits to actual numbers of fiscal sizes as reported in official budgets because our theoretical limits are based on the definition of transfers stated above while numbers reported in official budgets are not. Direct comparisons may be very misleading because different countries use different accounting practices to transfer resources to households, and thus appear to have different sizes of government. To make comparisons between theory and actual data more consistent, a first
step is therefore to measure actual transfers as we define them in our theoretical model.
5.3.1 Alternative Measure of Transfers and Fiscal Size of Government
We make two adjustments to the accounting definition of transfers used by the OECD in order to increase the comparability of our theoretical peak levels with the observed data. First, we add tax expenditures, that is subsidies given through tax exclusions, deductions, and credits. Whereas a cash transfer is a payment from government, a tax expenditure is a reduction in payments to government. Second, we subtract indirect taxes paid by recipients of cash or equivalent transfers. Taken together, these two adjustments greatly improve the consistency of the raw data with our definition of transfers, namely that transfers are net claims to private goods that result directly from government fiscal decisions about budgeted spending and its funding.
To put the first adjustment into perspective, consider two countries that are identical except that the government in country A provides cash payments to a group or activity and records this as an expenditure on its budget, while the government in country A¢ provides the same size of subsidy to the same group or activity in the form of tax expenditure which shows up on budget as a reduction in tax revenue. The two governments use different accounting practices but provide households with the same net claims to private goods or transfers. One way to account for this discrepancy is to count tax expenditures as transfers.
The traditional approach to measuring tax expenditures is to list a set of exclusions, deductions, and credits; calculate the values of each of these for each taxpayer at the taxpayer’s own marginal rate; and sum to obtain total tax expenditures (see e.g., Surrey and McDaniel [1985]). Forming the list of exclusions, deductions, and credits in turn requires a large number of decisions about how to treat each aspect of the tax code. The approach we develop avoids this difficulty. We recover inclusive measures of tax expenditures as well as government spending, transfers, and tax revenue from data on marginal tax rates and other variables using a form of the government budget constraint written in terms of marginal tax rates that are taken to be constants, as in linear-tax models and as is implicit in traditional calculations of tax expenditures.
The second adjustment is useful because transfers paid in cash or the equivalent are typically subject to indirect taxes such as sales and value-added taxes, so transfers recorded on official budgets - that is the gross amounts paid to governments - overstate the net claims to private goods provided by government. Netting out indirect taxes from calculated transfers better reflects the actual benefits conferred.
5.3.1.1 Detailed Description of Approach
Implementing these adjustments empirically is a complicated matter. We put them into practice and describe some important features in what follows.
We begin by assuming that each national economy can be characterized to contain an aggregate or representative household. This assumption greatly reduces data requirements but is not required generally for the approach. In appendix B, we show how the approach can be generalized to calculate tax expenditures, transfers, spending, and revenue from data on a large number of households in an economy, each of which faces its own marginal tax rates. The appendix also contains a numerical example showing that calculations based on disaggregate data could yield results essentially similar to results based on aggregate data.
The government budget (1) constraint can be rewritten as,
(1')
Because tax revenue R º tlwl + tkrk + tbr b, and because we wish to distinguish R from tax revenue as measured on official budgets (R0), we refer to R as full tax revenue. We establish below that S and T are also inclusive measures, including tax expenditures as well as on-budget spending, so these might similarly be termed full government spending and full transfers. For brevity, however, we suppress "full" in referring to S and T.
Because it is useful to distinguish between transfers that are not subject to indirect taxes (ti), in-kind transfers (T#), and transfers that are subject to indirect taxes, cash transfers or the equivalent (T$), we break transfer into two components, where T º T# + (1-ti) T$. We refer to transfers that are subject to indirect taxes as pecuniary transfers.
We use (1') to measure S, T, and R over time in OECD countries. Specifically, we use data on tl, wl, tk, rk, tb, and r b for a given year and country to find the value of R, and then add N, F, and D b to find the value of S. Thereafter we subtract G and r b to find the value of T. We also break transfers into in-kind and pecuniary components by solving T º T# + (1-ti) T$ for T$ and inserting data on T# and ti to find the value of T$.
A central property of this approach is that transfers include tax expenditures; specifically, pecuniary transfers equal on-budget cash transfers plus tax expenditures. To see this, let xl, xk, and xb denote income excluded from labor-income, capital-income and debt-interest taxation, let dl, dk, and db denote deductions permitted against labor income, capital income, and debt-interest income, and let c denote total tax credits. Then on-budget tax revenue is
(6)
Similarly, on-budget transfers consist of in-kind transfers plus cash transfers (denoted T$$), and equal total on-budget sources of funds (R0 + N + F + D b) less on-budget spending on public goods and interest (G + r b), or
(7)
Finally, tax expenditures (E) are the loss in revenue from all exclusions, deductions, and credits. The exclusion of one currency unit of income that otherwise would be taxed at rate t results in a loss of revenue of t currency units. A deduction of one currency unit similarly results in a revenue loss of t currency units. A tax credit of one currency unit, on the other hand, costs the government one currency unit in revenue. Thus tax expenditures are
(8)
An immediate consequence of the definition of full tax revenue, (6) and (8) is that tax expenditures are the difference between full tax revenue and on-budget tax revenue, with an adjustment because on-budget tax revenue includes revenue from taxation of pecuniary transfers but full tax revenue does not:
(9)
Note that tiT$ is netted from gross transfers of T# + T$ on the spending side of the government budget (1) and hence is not included in full tax revenue on the sources-of-funds side. Below, we
report values of tax expenditures calculated using (9). Instead of beginning with an explicit list of all tax preferences, these calculations start with full tax revenue and recover tax expenditures indirectly as a revenue difference.
From (1), (7), and (9):
(10)
This "transfer identity" says that gross transfers, T# + T$ = T + tiT$, are identically equal to on-budget transfers plus tax expenditures. The transfer identity, (10), and the definition T = T# +
(1- ti)T$ then imply that
(11)
which establishes that pecuniary transfers equal cash transfers plus tax expenditures.
It should be clear that E is a broad notion that captures the values of all reasons why on-budget tax revenue is less than full tax revenue plus revenue from indirect taxation of pecuniary transfers, including all explicit and implicit exclusions, deductions, and tax credits. The inclusive
treatments of tax expenditures and transfers parallel the inclusive notion of full tax revenue.
A feature of tax expenditures calculated using (9) is that our tax expenditures have a progressive nature. Because the approach here treats the marginal tax rates tl, tk, and tb as constants, any progressivity in the rate structures of actual tax systems shows up not in tl, tk, and tb but rather as exclusions, deductions, and credits. Specifically, a progressive rate structure means that some infra-marginal labor income is taxed at rates below tl. A household is better off if infra-marginal income is taxed at rates below tl than if that income is taxed at rate tl. Under the approach here, the household is viewed as taxed at tl on each unit of labor income, and government is then viewed as providing a lump-sum transfer to the household equal to the benefit of lower rates on infra-marginal income. The resulting progressivity tax expenditure reflects the net claims to private goods that government provides by applying lower tax rates at lower income levels.
Because we treat each economy as containing an aggregate household, progressivity tax expenditures reported below are the sums of both positive and negative components. Thus, the aggregate household in a country represents the household sector in the country, so the marginal tax rates tl, tk, and tb are measured below as the rates that apply to an "average" household in the country. In actual economies with progressive rate structures, this means not only that some income is taxed at rates below tl, tk, and tb, but also that some higher-income households face marginal rates above tl, tk, and tb. Income taxed at rates below tl, tk, and tb results in positive progressivity tax expenditures, whereas income taxed at rates above tl, tk, and tb results in negative progressivity tax expenditures. That is, households with income taxed at rates greater than tl, tk, and tb are viewed as paying lump-sum transfers to government, and such payments reduce the calculated value of E. An implication is that, under the aggregate-household assumption, the progressivity tax expenditure could in principle be negative; this could occur in a tax system with marginal tax rates that are relatively constant up to the income at which rates tl, tk, and tb apply, and strongly progressive at higher incomes.
Although aggregate tax expenditures calculated below using (9) can be larger or smaller than tax expenditures calculated using the traditional "list-and-sum" approach, there are two reasons why (6) might give larger numbers. First, traditional data generally exist for national governments only, and miss tax expenditures by subnational governments. Second, the list of exclusions, deductions, and credits underlying the traditional approach may not be inclusive. For example, traditional calculations typically do not include a progressive tax expenditure. Another example is that the approach here counts as tax expenditures the tax savings to households that arise because personal income taxes are eliminated or postponed when corporations retain capital income.
A potential downside of including tax expenditures as transfers, however, is that it can distort cross-country comparisons of government size. For instance, a country that has high marginal tax rates but generous tax credits will appear to have a larger government than an otherwise identical country that has lower marginal tax rates but tax credits just low enough that the households in the two countries are equally well off. Nevertheless, this is not necessarily a weakness. There is, indeed, clearly greater government involvement in the first country despite an equal net result.
A description of data used to implement the approach is given in appendix C.
5.3.2 Fiscal Sizes Averaged over 1972-1992
Table 2 reports summary statistics for fiscal sizes as percentages of GNP for the 22 countries in the sample averaged across the years 1972-1992. Spending, transfers, and full tax revenue varied by a factor of about two, where the U.S. had the smallest and Sweden the largest values during the period. A clear geographical pattern is that countries with relatively greater fiscal sizes of government lie in northern and, to a lesser extent, central Europe, home to, for instance, all countries with spending above 56 percent of GNP, transfers above 47 percent of GNP, or full tax revenue above 54 percent of GNP except Italy and Canada.
The difference between our measure and measures reported in official budgets is substantial. Formally, we measure government spending as S = T + G + r b, whereas government spending measured on official budgets is S0 º T0 + G +r b. From the definition T = T# + (1- ti)T$, (7), and (11), the difference is
(12)
Table 2.
Components of Spending and Funding
|
S |
T |
T# |
(1-ti)T$$ |
(1-ti)E |
G |
r b |
R |
D b |
N+F |
||
|
Australia |
53.6 |
43.3 |
7.7 |
14.2 |
21.4 |
8.9 |
1.4 |
49.0 |
2.6 |
2.0 |
|
|
Austria |
63.6 |
54.9 |
8.2 |
24.3 |
22.4 |
6.5 |
2.2 |
56.9 |
4.6 |
2.1 |
|
|
Belgium |
75.4 |
59.8 |
7.0 |
25.6 |
27.2 |
8.4 |
7.1 |
66.7 |
8.3 |
.4 |
|
|
Canada |
60.8 |
49.0 |
10.1 |
17.9 |
21.1 |
8.4 |
3.4 |
52.8 |
5.8 |
2.3 |
|
|
Denmark |
74.1 |
64.4 |
12.2 |
19.7 |
32.5 |
8.2 |
1.6 |
71.4 |
1.1 |
1.6 |
|
|
Finland |
64.8 |
58.1 |
10.3 |
18.8 |
29.1 |
6.6 |
.1 |
61.9 |
1.8 |
1.0 |
|
|
France |
57.8 |
48.9 |
8.1 |
22.4 |
18.4 |
7.5 |
1.4 |
54.8 |
2.4 |
.6 |
|
|
Germany |
57.8 |
49.5 |
10.1 |
18.9 |
20.5 |
7.4 |
.9 |
54.4 |
2.9 |
.5 |
|
|
Ireland |
73.4 |
60.9 |
10.4 |
21.9 |
28.6 |
6.6 |
5.9 |
59.7 |
11.4 |
2.3 |
|
|
Italy |
65.9 |
52.9 |
8.3 |
21.2 |
23.4 |
7.0 |
6.0 |
53.8 |
11.8 |
.3 |
|
|
Japan |
44.2 |
38.7 |
4.5 |
20.8 |
13.4 |
4.6 |
.9 |
38.8 |
5.3 |
.2 |
|
|
Luxembourg |
55.2 |
52.3 |
6.3 |
23.1 |
22.9 |
4.6 |
-1.7 |
54.3 |
-.8 |
1.6 |
|
|
Netherlands |
67.3 |
57.2 |
9.3 |
28.6 |
19.3 |
6.6 |
3.6 |
58.7 |
4.9 |
3.7 |
|
|
New Zealand |
55.9 |
46.8 |
11.0 |
19.4 |
16.5 |
4.4 |
4.7 |
47.7 |
4.1 |
4.2 |
|
|
Norway |
69.7 |
62.7 |
10.1 |
22.6 |
30.0 |
8.4 |
-1.3 |
66.5 |
2.1 |
1.1 |
|
|
Portugal |
50.6 |
38.4 |
7.1 |
17.5 |
13.7 |
7.6 |
4.6 |
41.3 |
8.4 |
.9 |
|
|
Spain |
46.5 |
40.2 |
6.6 |
16.7 |
17.0 |
4.7 |
1.6 |
41.8 |
4.0 |
.7 |
|
|
Sweden |
77.4 |
67.8 |
13.6 |
25.8 |
28.3 |
9.0 |
.7 |
70.9 |
4.6 |
1.9 |
|
|
Switzerland |
42.6 |
38.7 |
7.1 |
17.6 |
14.1 |
4.1 |
-.2 |
41.8 |
.3 |
.5 |
|
|
Turkey |
49.0 |
42.9 |
5.6 |
12.5 |
24.7 |
4.6 |
1.5 |
42.8 |
3.9 |
2.2 |
|
|
U.K. |
56.0 |
44.0 |
10.0 |
16.0 |
18.0 |
9.3 |
2.6 |
50.9 |
4.1 |
1.0 |
|
|
U.S. |
42.0 |
28.4 |
6.3 |
10.3 |
11.8 |
10.9 |
2.6 |
39.3 |
2.4 |
.2 |
|
|
Average |
59.3 |
50.0 |
8.6 |
19.8 |
21.6 |
7.0 |
2.2 |
53.5 |
4.4 |
1.4 |
The two terms on the right-hand side of (12) reflect the two differences in the measurement of transfers. Averaged across all countries and years, the approach here adds 21.5 percentage points to calculated spending and transfer levels by including net tax expenditures ((1-ti)E) and subtracts
4.4 percentage points by netting out indirect tax revenues on gross cash transfers (tiT$$).Transfers dominate other spending in Table 2. Averaged across all countries and the entire time period, transfers made up 84 percent of total spending, spending on public goods made up 12 percent of total spending, and net interest payments made up 3.8 percent of total spending.
Differences in government spending across countries primarily reflect differences in transfers. For instance, the correlation coefficient across countries between the transfer percentage and the spending percentage, both averaged over 1972-1992, is 0.96. Similarly, the range of transfers across countries reported in Table 2 substantially exceeds ranges of spending on public goods or net interest payments.
Our estimates of gross tax expenditures (E) are larger than estimates calculated using the traditional approach of listing a set of specific tax breaks and summing values of these tax breaks over all taxpayers. To provide a rough comparison, we took traditional tax expenditure calculations from three collection volumes to obtain a comparison set of one or more years of estimates for nine countries (Australia, Austria, Canada, France, Germany, Ireland, Spain, the U.K., and the U.S.). We then computed the average traditional tax expenditure across available years for each country, and also found the average for that country across the same years from our calculations of E. That left nine pairs of tax expenditure estimates. Across the nine, average gross tax expenditures were 22.4 percent of GNP using our approach versus 8.4 percent of GNP using the traditional approach.
5.3.3 Fiscal Sizes in Individual Countries over Time
Table 3 summarizes the peaking of inclusive spending (S), transfers (T), and full tax revenue (R) in the same format as Table 1, which summarized peaking of on-budget spending (S0), transfers (T0), and tax revenue (R0). Graphs of S, T, and R are
Table 3.
Peaking of Transfers, Spending, and Full Tax Revenue as Percentages of GNP over 1972-1992
|
Maximum S |
S in 1992 |
Maximum T |
T in 1992 |
Maximum R |
R in 1992 |
|
|
Australia |
64% (1985) |
58% |
51% (1985) |
46% |
58% (1985) |
52% |
|
Austria |
71% (1986) |
68% |
61% (1983) |
59% |
63% (1984) |
61% |
|
Belgium |
92% (1986) |
82% |
73% (1984) |
63% |
78% (1985) |
74% |
|
Canada |
71% (1991) |
67% |
54% (1982) |
53% |
61% (1991) |
60% |
|
Denmark |
82% (1982) |
74% |
71% (1982) |
63% |
77% (1986) |
71% |
|
Finland |
76% (1992) |
76% |
70% (1992) |
70% |
66% (1988) |
59% |
|
France |
68% (1986) |
66% |
58% (1986) |
56% |
63% (1986) |
61% |
|
Germany |
64% (1982) |
62% |
56% (1982) |
54% |
60% (1985) |
57% |
|
Ireland |
86% (1985) |
65% |
71% (1983) |
52% |
67% (1985) |
61% |
|
Italy |
73% (1992) |
73% |
58% (1987) |
54% |
62% (1992) |
62% |
|
Japan |
50% (1986) |
45% |
43% (1986) |
40% |
45% (1986) |
43% |
|
Luxembourg |
71% (1992) |
71% |
69% (1992) |
69% |
59% (1984) |
59% |
|
Netherlands |
79% (1983) |
62% |
67% (1983) |
51% |
65% (1983) |
56% |
|
New Zealand |
67% (1975) |
49% |
60% (1975) |
41% |
55% (1976) |
43% |
|
Norway |
75% (1979) |
58% |
68% (1988) |
53% |
72% (1986) |
58% |
|
Portugal |
64% (1985) |
50% |
47% (1985) |
35% |
51% (1988) |
47% |
|
Spain |
58% (1990) |
56% |
50% (1990) |
47% |
53% (1989) |
51% |
|
Sweden |
84% (1982) |
63% |
75% (1980) |
49% |
74% (1976) |
59% |
|
Switzerland |
48% (1984) |
46% |
44% (1984) |
42% |
48% (1986) |
45% |
|
Turkey |
55% (1985) |
53% |
50% (1985) |
46% |
47% (1985) |
46% |
|
U.K. |
67% (1975) |
52% |
55% (1975) |
40% |
56% (1975) |
47% |
|
U.S. |
49% (1978) |
42% |
37% (1978) |
28% |
47% (1978) |
38% |
appended after the appendix. In reading the graphs, note that non-tax revenue (N + F) is generally small and varies little over time relative to GNP, so the budget deficit
in a year can be read roughly as the difference between the S and R graphs, less a few percent of GNP, for that year. (The main exception is for New Zealand, where N + F rose from 2.3 percent of GNP in the mid-1970s to 8.2 percent of GNP in 1987, before falling to 4 percent of GNP in 1992.)
We first address whether or not the countries have clear upward trends in spending. To provide an indication, we say somewhat arbitrarily that a series is on a downward trend if the 1992 value is five or more percentage points below the peak value, and we say that a percentage is essentially level after a peak if the 1992 value is not five or more points below the peak value. (Note, the horizontal scale of the graphs causes even a five percentage-point drop to appear relatively level.) By this criteria, Finland, Luxembourg, Canada, Turkey, and possibly Italy exhibit apparent upward trends in spending. Eleven of the remaining 17 countries in the sample are on downward trends and six have spending percentages that are essentially level. In contrast, only seven
countries exhibited downward spending trends in the measures reported in Table 1.In the five countries that attained the greatest peak spending percentages (Belgium, Ireland, Sweden, Denmark, and the Netherlands), the average decline in spending from peaks until 1992 was 15.4 percent of GNP. In the other 17 countries, the average decline was 5.9 percent of GNP.
The peaking of government spending reflects the peaking of transfers. This relationship is easily seen in the country graphs, where changes in government spending closely track changes in transfers. More formally, the average value across countries of the correlation coefficient between annual spending and annual transfers is 0.96. This correlation between annual transfers and spending indicates that differences in government spending across time mainly reflect differences in transfers.
Transfers appear to be on upward trends (as defined above) as percentages of GNP only in Finland, Luxembourg, Turkey, and possibly Canada. For the other 18 countries, thus, it is natural to ask whether the transfer percentage has been on a downward trend since reaching its peak over the period or has been essentially level since its peak. By the criterion proposed above, 11 of the countries have downward-trending transfers and seven have transfers that are essentially level. In contrast, only eight countries had downward-trending transfers by the measure used in Table 1.
Reductions in peak transfers as a percentage of GNP until 1992 were generally greatest in the countries with the greatest peak transfer percentages, i.e., Sweden, Belgium, Ireland, Denmark, and Finland. The average decline in transfers from peaks until 1992 was 12.6 percent of GNP in these five countries but only 6.7 percent of GNP in the other 17 countries.
Full tax revenue is less sensitive to short-term business-cycle influences than spending and transfers, so it may be a better indicator of longer-term trends. We see from the country graphs and Table 3 that full tax revenue appears to be trending upwards as a percentage of GNP only in Canada, Italy, and Turkey. Of the other 19 countries, 10 have full-tax-revenue percentages on downward trends (as defined above) and nine have full-tax-revenue percentages that are essentially level. Thus while Table 1 provided only weak evidence of breaks in upward trends in on-budget tax revenue (R0) over 1972-1992, Table 3 presents a reasonably clear picture of breaks in upward trends in full tax revenue (R) over the period.
For the five countries with the greatest peak full-tax-revenue percentages (Belgium, Denmark, Sweden, Norway, and Ireland), the average decline in full tax revenue from peaks until 1992 was 9.0 percent of GNP. The average decline in the other 17 countries was 4.1 percent of GNP. So again our measures make peaking more evident.
A general pattern emerged in many of the countries. Spending rose more than full tax revenue over a relatively long period of time, so deficits grew. Full tax revenue reached a plateau at about the same time that spending peaked, though high deficits persisted for several years after the spending peak before spending and deficits fell. Although many of the countries fit some aspects of this pattern, the pattern is particularly prominent in four of the countries that reached the highest levels of government spending. Belgium, for instance, had the greatest peak spending (92 percent of GNP in 1984) and roughly 10 years of large deficits; averaged over 1977-1988, deficits were 10.4 percent of GNP. By comparison, the average deficit across all countries and years was 4.4 percent of GNP. Similarly, Ireland (the second largest with peak spending at 86 percent in 1985) had average deficits of 15.1 percent of GNP over 1974-1987, Sweden (third with a peak at 84 percent in 1982) had average deficits of 7.3 percent of GNP over 1976-1986, and the Netherlands (fifth with a peak at 79 percent in 1983) had average deficits of 6.5 percent of GNP over 1979-1989.
5.3.4 Comparing the Actual Peak Levels to Computed Theoretical Limits
The countries that attained the greatest peak fiscal sizes tended to have greater declines in spending, transfers, and full tax revenue than other countries. Moreover, peak spending in Belgium, Ireland, Sweden, and the Netherlands coincided with peak transfers and with large deficits that lasted for periods of 10 years or more. A possible explanation for these patterns is that political forces may have pushed transfer spending to, or beyond, levels that can be sustained in equilibrium by taxation. This may have contributed to periods of deficits and weak economic performance, and ultimately to reductions in spending.
To see whether this conjecture seems plausible, we compare numbers calculated in the long-run Laffer curve model to the actual peak levels observed in our sample of OECD countries. Note that tax revenue in the model is equivalent to full tax revenue and may be compared directly with revenues reported in Tables 2 and 3, and that computed limits for S, and T may be compared with spending and transfer experiences of OECD countries.
Fourteen of the countries studied (all but Japan, New Zealand, Portugal, Spain, Switzerland, Turkey, the U.K., and the U.S.) had peak spending, transfers, and full tax revenue in excess of theoretical limits calculated under relatively large labor supply elasticities (see Table 3). That is peak spending, transfers, and full tax revenue were in excess of 64, 48, and 57 percent of GNP, respectively. Belgium, Denmark, Norway, and Sweden had peak spending, transfers, and full tax revenue in excess of theoretical limits calculated even under the small labor supply elasticities. That is peak spending, transfers, and full tax revenue were in excess of 72, 58, and 68 percent of GNP, respectively. This may help explain why fiscal sizes have fallen in some OECD countries. By 1992, for instance, only 12 of the countries had spending, transfers, and full tax revenue in excess of theoretical limits calculated under large labor supply elasticities, and only Belgium and Denmark had spending, transfers, and full tax revenue in excess of theoretical limits calculated under small labor supply elasticities. The theoretical limit calculations here are admittedly crude, but they suggest that fiscal sizes of government in many OECD countries may be or have been close to theoretical limits.
5.4 Conclusions and Discussion
We ask whether the apparent leveling or peaking of fiscal sizes that many developed countries have experienced after a century of nearly uninterrupted growth in government size may be a result of limits to taxation. We address the issue by comparing numbers computed in a long-run Laffer curve model with actual observed data from OECD countries. To improve the consistency of the comparisons we use an alternative to the conventional accounting definition of transfers that more closely captures the actual size of government. A drawback of conventional measures is that countries use different accounting practices to transfer resources to households and, thus, appear to have different sizes of government. Another problem is that, in many instances, numbers reported in official budgets are for national government and miss expenditures made by subnational governments. This can also be misleading if countries vary in governmental organization. We define transfers, instead, as the net claims to private goods that result directly from government fiscal decisions about budgeted spending and its funding, including tax policy. Our approach to calculating transfers so defined, as well as inclusive measures of government spending, tax revenue, and tax expenditures, uses data on marginal tax rates and a form of government budget constraint. We apply the method to study peaking of fiscal sizes of government in OECD countries over 1972-1992.
Based on defendable assumptions about labor supply elasticities, our findings suggest that fiscal size may reach a maximum when spending equals roughly 75 percent of GNP, transfers equal roughly 60 percent of GNP, and tax revenue equals roughly 70 percent of GNP. The experiences of countries with the greatest peak fiscal sizes suggest that some combination of political and economic forces may restrict sustainable spending above levels remarkably similar to the theoretical maxima. Fiscal sizes exceeded these levels only for short periods around times of peak fiscal sizes in these countries.
It may also be worthwhile to note the obvious; fiscal sizes peak at different levels in different countries. The determinants of these observed differences are not well known, and a full empirical assessment is beyond the scope of this paper. Several observations from this study may be useful for future research, however.
Our model suggests that labor supply elasticities are key determinants of how big government can become. High income tax rates, of course, discourage working more where the labor supply is elastic than where it is inelastic. Large differences in labor supply elasticities across countries, particularly for women, and the factors that explain them may be an important part of the explanation. For example, consider the uncompensated female labor supply elasticities reported for Canada, Germany, the U.K., and the U.S. in Killingsworth and Heckman [1986]. They aligned exactly with peak government size in these countries. Specifically, the elasticity was greatest and peak government size lowest in the U.S. In Canada, the elasticity was lowest but peak government size the highest. Germany and the U.K. lay in between for both measures.
An alternative explanation
centers on political structure. Historical differences across countries are also likely to contribute to the observed differences in government size. For instance, the preferences of the median voter may be more influential in some political structures than others. Moreover, it may also be easier in some political structures for bureaucrats to maximize their budgets.Numerous country- and time- specific factors such as business cycles, political structure, history likely influence the size of government and deserve attention. Moreover, more research should be devoted to further improving the measurement of government size to make comparisons across countries and time more informative. A first step may be to use a more disaggregated measure than that used here.
References
Atkinson, Anthony and Nicholas Stern, 1974, "Pigou, Taxation, and Public Goods," Review of Economic Studies, 41, 119-28.
Bernanke, Ben, Henning Bohn, and Peter Reiss, 1988, "Alternative Non-Nested Specification Tests of Time-Series Investment Models," Journal of Econometrics, 37, 293-326.
Blanchard, Olivier, Jean-Claude Chouraqui, Robert Hagemann, and Nicola Sartor, 1990, "The Sustainability of Fiscal Policy: New Answers to an Old Question," OECD Economic Studies, 15, 7-36.
Blomquist, Sören and Urban Hansson-Brusewitz, 1990, "The Effect of Taxes on Male and Female Labor Supply in Sweden," Journal of Human Resources, 25, 317-57.
Blundell, Richard and François Laisney, 1988, "A Labour Supply Model for Married Women in France," Annales d'Economie et de Statistique, 11, 41-72.
Borcherding, Thomas, 1985, "The Causes of Government Expenditure Growth: A Survey of the U.S. Evidence," Journal of Public Economics, 28, 359-382.
Bourguignon, François and Thierry Magnac, 1990, "Labor Supply and Taxation in France, " Journal of Human Resources, 25, 358-89.
Brennan, Geoffrey and James Buchanan, 1980, The Power to Tax: Analytical Foundations of a Fiscal Constitution, Cambridge: Cambridge University Press.
Brosio, Giorgio and Carla Marchese, 1988, "The Growth of Government under Different Redistributive Rules: A Long Term Study of the Italian Case," Public Finance Quarterly, 16, 439-63.
Browning, Edgar, 1976, "The Marginal Cost of Public Funds," Journal of Political Economy, 84, 283-98.
Buchanan, James and Dwight Lee, 1982, "Politics, Time, and the Laffer Curve," Journal of Political Economy, 90, 816-19.
Engen, Eric and William Gale, 1997, "Consumption Taxes and Saving: The Role of Uncertainty in Tax Reform," American Economic Review, 87, 114-19.
Feldstein, Martin and Charles Horioka, 1980, "Domestic Savings and International Capital Flows," Economic Journal, 90, 314-29.
Fullerton, Don, 1982, "On the Possibility of an Inverse Relationship Between Tax Rates and Government Revenues," Journal of Public Economics, 19, 3-22.
Hansen, Lars and Kenneth Singleton, 1983, "Stochastic Consumption, Risk Aversion, and the Temporal Behavior of Asset Returns," Journal of Political Economy, 91, 331-53.
Hansson, Ingemar and Charles Stuart, 1985, "Tax Revenue and the Marginal Cost of Public Funds in Sweden," Journal of Public Economics, 27, 331-53.
Husted, Thomas and Lawrence Kenny, 1987, "The Effect of the Expansion of Voting Franchise on the Size of Government," Journal of Political Economy, 105, 54-82.
International Labour Organisation (ILO), 1993, Yearbook of Labour Statistics, 1993, Geneva.
International Monetary Fund (IMF) Washington, 1981, 1985, 1993, Government Finance Statistics Yearbook.
Judd, Kenneth, 1987, "The Welfare Cost of Factor Taxation in a Perfect-Foresight Model," Journal of Political Economy, 95, 675-709.
Killingsworth, Mark, 1983, Labor Supply, Cambridge: Cambridge University Press.
¾ and James Heckman, 1986, "Female Labor Supply: A Survey," Handbook of Labor Economics vol. 1, Elsevier Science Publishers BV.
Mankiw, Gregory and Stephen Zeldes, 1991, "The Consumption of Stockholders and Nonstockholders," Journal of Financial Economics, 29, 97-112.
Meltzer, Allan and Scott Richard, 1981, "A Rational Theory of the Size of Government," Journal of Political Economy, 89, 914-27.
Mendoza, Enrique and Linda Tesar, 1996, "International Ramifications of U.S. Tax Reforms," UCSB Working Paper.
Mueller, Dennis and Peter Murrell, 1986, "Interest Groups and the Size of Government," Public Choice, 48, 125-45.
Peter Murrell, 1985, "The Size of Public Employment: An Empirical Study," Journal of Comparative Economics, 9, 424-37.
Organisation for Economic Cooperation and Development (OECD) (a), 1994, Economic Outlook, 55 June 1994, Paris. Also used version December 1995 (Paris, 1995).
¾ (b), 1995, OECD Economic Surveys 1994-1995 Finland, Paris.
¾ (c), 1995, OECD Economic Surveys 1994-1995 Belgium/Luxembourg, Paris.
¾ (d), 1991, Labour Force Statistics 1969-1989, Paris.
¾ (e), 1991, National Accounts 1977-1989, Paris. Also used version 1960-1989 (Paris, 1991) and microfiche versions 1960-1991 and 1960-1992.
¾ (f), 1989, Revenue Statistics of OECD Member Countries 1965-1988, Paris. Also used versions 1965-1983 (Paris, 1984) and 1965-1993 (Paris, 1994).
¾ (g), 1983, The 1982 Tax/Benefit Position of A Typical Worker in OECD Member Countries, Paris.
¾ (h), 1984, The Tax/Benefit Position of Production Workers, 1979-1983, Paris. Also used versions: 1979-1984 (Paris, 1986), 1981-1985 (Paris, 1986), 1983-1986 (Paris, 1987), 1984-1987 (Paris, 1988), 1985-1988 (Paris, 1989), 1986-1989 (Paris, 1990), 1987-1990 (Paris, 1991), and Annual Report Covering 1982-1992 (Paris, 1993).
¾ (i), 1978, The Tax/Benefit Position of Selected Income Groups, 1972-1976, Paris. Also used version 1974-1978 (Paris, 1980).
¾ (j), 1984, Tax Expenditures: A Review of the Issues and Country Practices, Paris.
Peltzman, Sam, 1980, "The Growth of Government," Journal of Law and Economics, 23, 209-87.
Surrey, Stanley and Paul McDaniel, 1985, Tax Expenditures, Cambridge: Harvard University Press.
Svensson, Lars-Gunnar, 1990, "Inequality Aversion and Marginal Tax Rates," Scandinavian Journal of Economics 92, 31-45.
Tanzi, Vito and Ludger Schuknecht, 1997, "Reconsidering the Fiscal Role of Government: The International Perspective," American Economic Review, 87, 164-68.
Theeuwes, Jules, 1988, "Arbeid en Belastingen," in Belastingheffing en belastinghervorming, Leiden: Stenfert Kroese, 111-43.
United Nations (UN), 1985, National Account Statistics: Main Aggregates and Detailed Tables, 1982, New York. Also used version 1990 (New York, 1992).
Willis, J. and P. Hardwick, 1978, Tax Expenditures in the United Kingdom, Institute for Fiscal Studies. London: Heinemann.
Appendix A: Simulation Details
1. Description of Calibration Allocation. It is necessary to choose parametric values for a, tk, tb, b, D b, N, F, G, and r as well as a value of tl in the calibration allocation. For a, tk, tb, r , and tl, we set these values equal to weighted average values over 1988-1992 across all countries using the data described in appendix C. For b, D b, N, F, and G, we use weighted averages of ratios to GNP. Weights are purchasing-power-parity-based GDP shares reported in OECD (a, 1995, p. A2).
Resulting values are a = 0.391, tk = 0.441, tb = 0.319, b = 0.576, N = 0.003, F = 0.003, G = 0.085, r = r b / b = 0.032 / 0.576 = 0.056, and tl = 0.480. To adjust assumed deficits so that the primary deficit (D b - r b) equals zero, we set D b = r b = 0.032. Note that the actual average value of D b across our sample was 0.029, so actual primary deficits averaged -0.3 percent of GNP.
2. Sensitivity Analysis. The following sensitivity analyses assume that labor supply elasticities are small (0.1 and 0.25) and each makes a single change in parameterizing assumptions:
(a) Several countries have run up large national debts in recent decades. Number one in this regard is Belgium, which had a national debt that was 1.41 times its GNP in 1992. The assumption that b = 1.41 instead of b = 0.576 lowers transfers at maximum fiscal size ("limit" transfers) from 58 percent of GNP to 55 percent of GNP, raises limit spending from 72 to 74 percent of GNP, and raises limit tax revenue from 68 to 69 percent of GNP.
(b) In the model, taxes on capital and interest are lump-sum taxes. Increasing the assumed values of each by 0.1 (so tk = 0.541 and tb = 0.419) raises limit transfers from 58 to 62 percent of GNP, raises limit spending from 72 to 76 percent of GNP, and raises limit tax revenue from 68 to 71 percent of GNP.
(c) An increase in the assumed value of capital's share from a = 0.391 to a = 0.441 lowers limit transfers from 58 to 57 percent of GNP, lowers limit spending from 72 to 70 percent of GNP, and lowers limit tax revenue from 68 to 66 percent of GNP. Replacement of the Cobb-Douglas production function with CES functions with elasticities of substitution of 0.5 and 1.5 (instead of 1.0 in the Cobb-Douglas case) affects limit transfers, spending and tax revenue as percentages of GNP only at the third (or in one case the second) significant figure.
(d) An increase in the assumed value of r by 0.01 affects limit transfers, spending, and tax revenue only at the third significant figure. A potentially more important change is to assume that arbitrage in long-run equilibrium leads the return on government debt (r ) to equal the return on private capital less a constant premium on government debt over private capital (p ), or
![]()
where q is a scaling factor needed because r is a traditionally-measured interest rate but the unit choice K = 1 implies that r is capital’s share of GNP. We find the appropriate value of q by noting that, in the calibration allocation, r takes a known value of 0.056 and from (11) r = a, so (13) implies q = (0.056 + p ) / a. We assume somewhat arbitrarily that p = 0.01. (Because changes in r affect results only at the third significant figure, this assumption is innocuous.) This change causes maximum transfers not to coincide with maximum spending and tax revenue. At maximum transfers, transfers are 60 percent of GNP, spending is 73 percent of GNP, and tax revenue is 67 percent of GNP. At maximum spending and revenue, transfers are 59 percent of GNP, spending is 72 percent of GNP, and tax revenue is 67 percent of GNP.
(e) It might be argued that labor supply behavior in the economies under study has not adjusted fully to 1988-1992 levels of taxation of labor income. We therefore assume as an alternative that the labor tax rate in the calibration allocation equals 0.430 instead of 0.480. This reduces limit transfers from 58 to 57 percent of GNP, reduces limit government spending from 72 to 71 percent of GNP, and reduces limit tax revenue from 68 to 67 percent of GNP.
Appendix B: Generalizing the Approach
To generalize to a framework with n ³ 2 households or individuals in an economy, each with specific values of the variables in (1') and (2), the government budget would be written
,
where j is an index of households or individuals. All national-income accounting relations in section 5.3 would continue to hold, mutatis mutandis.
To illustrate that disaggregate calculations would provide essentially similar but not necessarily identical results, consider an economy with three households, with w1l1 = 1, w2l2 = 1.5, w3l3 = 2, and k = b = N = F = D b = G = 0. Suppose a graduated labor income tax is the only tax and total tax payments are
0 if I < 1
.4(I - 1) if 1 £ I< 1.5
.5(I - 1.5) + .2 if 1.5 £ I <2
.6(I - 2 ) + .45 if 2 £ I
where I is household labor income. The three households face marginal tax rates of .4, .5, and .6, and make actual tax payments of R0 = 0 + .2 + .45 = .65. Full tax revenue is R = .4*1 + .5*1.5 + .6*2 = 2.35. Tax expenditures in this example are due solely to progressivity, with E = (.4*1) + (.5*1.5 - .2) + (.6*2 - .45) = 1.7, which equals R - R0.
Under an aggregate household view, note that aggregate labor income is 4.5, and take the aggregate marginal tax rate to be the median rate .5. Then full tax revenue would be R = .5*4.5 = 2.25, and tax expenditures would be R - R0 = 2.25 - .65 = 1.6. This example makes clear that choices of marginal rates matter, just as they matter in traditional calculations of tax expenditures.
Appendix C: Data
We study OECD economies over 1972-1992. Greece and Iceland are excluded due to lack of data. Data for Turkey begin in 1975. Calculation of transfers, spending, and full tax revenue in a given economy and year requires taking values of tl, wl, tk, rk, tb, r b, N, F, D b, and G for that economy and year. Calculation of tax expenditures requires, in addition, values of R0, ti, and T#. We study consolidated (general) government, which includes national (central) and subnational (state, local, provincial) budgetary government plus social security funds.
Tax-rate data are crucial. Consistent tax-rate data are available in OECD publications starting from 1972.
An important issue is that personal income taxes typically have progressive rate structures, so it is necessary to specify a rule for choosing the income level at which to evaluate each country’s marginal income tax rate. In the OECD data, this choice has been made: the income level is "the income of an average production worker". Our use of the OECD data thus interprets the aggregate household in a country as reflecting the experience of the average production worker in the country and compares countries based on the experiences of average production workers. It should be stressed that if income levels different from those of average production workers were chosen, marginal taxes would sometimes differ and hence so would calculated sizes of government. In this way, calculated sizes of government are conditional on the rule for choosing the incomes at which marginal income tax rates are evaluated.
We take the tax rate on labor income to include marginal effects from all taxes affecting the amount of potential consumption that can be obtained per marginal unit of labor. We do not include marginal effects due to means-testing of transfers because most means-tested transfers go to households with incomes below those of average production workers. Payroll taxes paid by the employer are typically levied at a given rate, denoted tp, per unit of gross wages paid to the employee. Denote the employee’s rate of marginal personal income taxes by tm, the rate of employee-paid payroll taxes by te, and let tm’ = tm + te. Then (1 + tp) currency units of employee labor costs result in 1 currency unit of gross wages. After personal income taxes and employee-paid payroll taxes
We take the marginal rates tm and tm’ to be .75 times the appropriate marginal rate in the OECD data for a married average production worker with two children plus .25 times the marginal rate for a single average production worker. Data on tm and tm’, and tp are from OECD (g), (h), and (i). OECD source material does not report values of tm and tm‘ for 1979-1981. Values are interpolated for these years. Because ti is the rate of indirect taxation of disposable income, we calculate ti as total indirect tax revenue divided by disposable income using data in OECD (e) and (f).
Total labor income (wl) is the sum of compensation of employees plus an estimate of the labor income of employers and own-account workers. Data on compensation of employees are directly available in national income accounts. To estimate the labor income of employers and own-account workers, we assume that compensation of employees divided by the number of employees equals labor income of employers and own-account workers divided by the number of employers and own-account workers. With this assumption, total labor income equals compensation of employees (source: OECD (e)) times one plus the ratio of the number of employers and own-account workers to the number of employees (sources: ILO and OECD (d)).
The tax rate on private capital income includes corporation and related business taxes (at rate tc), personal income taxes, indirect taxes, and property and wealth taxes (at rate tw). Starting from nominal private capital income of rk, we reason that (1 - tc)rk remains after corporation and related business taxes are drawn, (1 - tc)(1 - tm)rk remains after personal income taxes are drawn, and [(1- tc tc)(1 - tm)- tw]rk remains after property and wealth taxes are drawn, leaving [(1 - tc)(1- tm)- tw](1- ti)rk to be consumed. We thus calculate tk as tc + tm - tc tm + tw + ti – ti [tc + tm - tctm + tw]. We measure tc as the ratio of corporation income taxes to private capital income and tw as the ratio of total wealth and property taxes to private capital income (sources: OECD (e) and (f)), which assumes that the two ratios approximate the additional corporation taxes and property and wealth taxes that would be paid if private capital income in a country (rk) rose marginally. Given the relatively small magnitudes of tc and tw in our data, this approximation should affect resulting calculations of T, S, R, and E little (and in uncertain direction).
We take private capital income (rk) to be GNP (source: OECD (e)) minus total labor income and minus public capital income (F). The source for F for most countries is UN, Table 3.12, from which F is operating surplus plus property income net of interest. For other countries, the source is
IMF, from which F is nontax revenue minus other property income (largely interest) and minus fines, fees, forfeitures, penalties, and sales.
The tax rate on interest on government debt (tb) includes personal income taxes and indirect taxes. We calculate tb as tm + ti – tmti.
Government interest payments (r b) equal nominal interest paid minus interest received for consolidated government except in the cases of Japan and New Zealand, where net-interest data are for budgetary central government only. The source for r b for most countries is UN; for a few countries, the source is IMF. The increase in private holdings of government debt (D b) is also measured for consolidated government except in the cases of Japan and New Zealand, where available data are for budgetary central government (source: IMF).
Data on fines, fees, forfeitures, penalties, and sales (N) for most countries are from UN. For a few countries, data on N are from IMF.
Data used to form series of expenditures on public goods (G) and transfers (T# and T$$) are from OECD (e), UN, and IMF, which contain coarse breakdowns of public spending by categories. We count spending categorized as general public services, defense, public order and safety,
and economic services as G; count spending categorized as education, health, and housing and community amenities as T#; and count spending categorized as social security and welfare as T$$. The only other non-debt spending categories are recreational, cultural, and religious affairs, and other. These generally contain mixtures of public goods and transfers not subject to indirect taxes, so we count one-half of these expenditures as G and one-half as T#.
Data on on-budget tax revenue (R0) are from OECD (f).