The Advertising Spillover Model: Koyck Distributed Lags, Annual Data, and Compustat Evidence

May 5, 2026 by Ednaldo Silva

1. The Geometric Distributed-Lag Model

Like research and development, advertising is a risk-taking investment. The economic rationale is straightforward: a campaign launched in period t builds brand awareness and consumer loyalty that generate revenue and operating profit in the current and subsequent periods, with diminishing intensity. The canonical theoretical foundation is Nerlove and Arrow’s (1962) goodwill-stock model, in which advertising accumulates into an intangible asset that depreciates (amortizes) at a constant rate. The resulting revenue or profit process is a geometric distributed lag in current and past advertising expenditures.

Let R(t), X(t), and P(t) denote net revenue (Compustat REVT), advertising expenditure (XAD), and operating income [profit] before depreciation and amortization (OIBDP) in year t. In fact, X(t) can be any investment-like recurring expense with discernible spillover effects, such as research and development, that is treated for tax purpose (or under GAAP or IFRS) as deductible expenses to calculate profits before or after depreciation, depletion, or amortization. The structural model for revenue is:

\begin{aligned}
R(t) &= \alpha + \gamma \sum_{k=0}^{\infty} \lambda^k X(t-k) + u(t) \\
     &= \alpha + \gamma \Big[ X(t) + \lambda X(t-1) + \lambda^2 X(t-2) \\
     &\quad + \cdots \Big] + u(t)
\end{aligned}

The parameter γ is the impact multiplier: dollars of revenue generated per dollar of current-period advertising. The parameter λ (0 < λ < 1) is the carryover rate: the fraction of advertising’s productive effect surviving to the next period. The two parameters are structurally distinct and must be estimated separately. The same model applies when P(t) replaces R(t) as the dependent variable, and to any investment flow—research and development (XRD), advertising (XAD), net property, plant, and equipment (PPENT), capitalized software development costs, or any other expenditure that generates multi-period productive capacity.

SymbolDefinition
γImpact multiplier: revenue (or profit) per current-period advertising
λCarryover rate (AR(1)): fraction of advertising effect surviving one period (0 < λ < 1)
αIntercept of the structural distributed-lag model
u(t)Structural disturbance; assumed serially uncorrelated

2. The Koyck Transformation

The structural equation contains infinitely many regressors and is not directly estimable. Koyck (1954) showed that the infinite sum can be eliminated by a single algebraic operation. Write the structural equation at t and at t−1, multiply the latter by λ, and subtract:

\begin{aligned}
R(t) = \hat\delta_0 + \lambda R(t-1) + \hat\gamma X(t) + \varepsilon(t), \\ \quad \hat\delta_0 \equiv \hat\alpha(1-\lambda), \quad \varepsilon(t) \equiv u(t) - \hat\lambda u(t-1)
\end{aligned}

This three-parameter regression equation is the Koyck estimating form. The original structural parameters are recovered as α = δ₀ / (1−λ), and the following derived statistics carry economic content:

Derived QuantityFormulaInterpretation
Long-run multiplierγ̂ / (1 − λ̂)Revenue per one unit increase in advertising
Mean lagλ̂ / (1 − λ̂)Expected years for advertising effect to materialize
Median lag− ln 2 / ln(λ̂)Years at which 50% of total effect has been realized

3. Generalization to Any Investment Flow

The Koyck transformation is not specific to advertising. It applies to any expenditure that generates a depreciating (or amortization) stock of operating capacity. The identifying condition is that the investment flow, denoted Flow(t), accumulates into a stock according to the formula:

\text{Stock}(t) = \hat\lambda \cdot \text{Stock}(t-1) + \hat\gamma \cdot \text{Flow}(t)

This perpetual-inventory recurrence—familiar from the capital-stock literature; see Griliches (1979)—connects the transformation to several Compustat investment flows, as shown in the following table. In each case the estimating form is identical; only the economic interpretation of γ and λ changes.

Compustat FlowMnemonicAccumulated StockEconomic Content of λ
AdvertisingXADBrand equity / goodwillRetention of consumer awareness
Research & developmentXRDKnowledge / technology assetsObsolescence of technical know-how
Net PP&E investmentPPENTPhysical assetsPhysical depreciation rate
Capitalized softwareCOGS / separate itemSoftware assetsTechnological refresh rate

4. Econometric Considerations

The composite disturbance ε(t) = u(t) − λ u(t−1) is MA(1). Because R(t−1) is correlated with u(t−1), OLS applied to the Koyck form is inconsistent: the bias survives as N grows. A rival estimator is the instrumental-variable (IV) procedure of Liviatan (1963). The identifying assumption—that past advertising is correlated with past revenue but uncorrelated with the current composite error—holds when u(t) is serially uncorrelated and X(t) is predetermined. We employ the ARX algorithm in EdgarStat whenever the regression model includes a lagged dependent variable, such as R(t-1), plus an X-factor independent variable.

Serial correlation in the OLS residuals is tested by Durbin’s (1970) h-statistic, which replaces the invalid Durbin-Watson statistic when a lagged dependent variable is present:

h = \hat{\rho} \cdot \sqrt{\frac{N}{1 - N \cdot \mathrm{Var}(\hat{\lambda})}} \sim \mathcal{N}(0,1)

Annual data introduce a frequency-mismatch problem. Meta-compiled evidence (Clarke 1976) places the advertising effect half-life between two and six months for most consumer categories. At annual aggregation, intra-year decay is collapsed into the intercept δ₀ and the contemporaneous coefficient γ̂. The lagged dependent variable absorbs macroeconomic momentum unrelated to advertising, biasing λ̂ toward zero or even negative values. Annual data can identify the impact multiplier γ reliably; the carryover rate λ should be treated with caution and tested for its sign and plausibility. A weak-instrument F-statistic from the first stage of the IV procedure (rule of thumb: F ≥ 10) is a prerequisite diagnostic when the sample is small.

5. Empirical Example: The Gap, Inc. (Annual, 1976–2025)

ParameterR(t) equationP(t) equation
γ̂ (impact multiplier)1.885 (t = 1.7791)0.349 (t = 1.7785)
λ̂ (carryover rate, AR(1))0.8994 (t = 19.2835)0.7186 (t = 11.4104)
Adjusted R20.9940.8538

The impact multiplier γ̂ is statistically significant and empirically plausible in both R(t) and P(t) equations (we use 68% confidence level). For the revenue equation, one dollar of advertising generates $1.885 of contemporaneous revenue. For the profit equation, the multiplier of $0.349 per dollar of advertising is relevant to transfer pricing analysis under the CPM or TNMM, where the benchmark is an operating profit margin rather than a revenue multiplier. The P(t) specification is therefore the preferred equation for intangibles valuation purposes: advertising creates pricing power and profit margin, not merely sales volume. The regression results were computed using the ARX algorithm inside EdgarStat, and the t-statistics are adjusted Newey-West.

References

Fisher, Irving. 1925. “Our Unstable Dollar and the So-Called Business Cycle.” Journal of the American Statistical Association 20 (150): 179–202.

Theoretical and empirical. Fisher introduced geometrically weighted distributed lags to model the relationship between price level changes and business conditions, using U.S. price and output series. He is the proto-innovator of the geometric weighting scheme that Koyck later formalized into a general parameterization. Fisher did not derive the algebraic transformation that renders the model estimable.

Koyck, Leendert M. 1954. Distributed Lags and Investment Analysis. Amsterdam: North-Holland.

Theoretical, with empirical application to Dutch fixed capital investment data. Koyck’s specific and non-trivial innovation is the algebraic transformation that converts an equation with infinitely many regressors into a three-parameter autoregressive form. This distinguishes him from Fisher: Koyck used geometric weights and showed how to eliminate them. The model was developed for investment analysis, not advertising; its application to advertising came a decade later (Palda, 1964). Koyck is the innovator of the transformation; Fisher is the innovator of the geometric-weight concept.

Nerlove, Marc. 1958. Distributed Lags and Demand Analysis for Agricultural and Other Commodities. Agricultural Handbook No. 141. Washington, DC: USDA.

Theoretical and empirical; agricultural commodity data from U.S. government sources. Nerlove derived the identical estimating equation—a first-order autoregression in the dependent variable and the current regressor—grounded on a partial-adjustment model adjusted to a target level. Because the derivation is independent of Koyck’s, and rests on optimizing behavior rather than a statistical parameterization, Nerlove is a co-innovator of the estimating form. Includes a review of Fisher and Koyck.

Nerlove, Marc, and Kenneth J. Arrow. 1962. “Optimal Advertising Policy under Dynamic Conditions.” Economica 29 (114): 129–142.

Theoretical. Nerlove and Arrow introduced the goodwill-stock model of advertising: expenditure accumulates into an amortizable intangible, G(t) = (1−δ)G(t−1) + γ·Flow(t). This is the theoretical scaffold that justifies applying the Koyck estimating form to advertising data. Like boys in toyland, the authors display a naïve infatuation with optimality and an indulgent recourse to integral calculus to dress up a simple recursion already present in Koyck (1954).

Liviatan, Nissan. 1963. “Consistent Estimation of Distributed Lags.” International Economic Review 4 (1): 44–52.

Theoretical. Liviatan identified the inconsistency of OLS applied to the Koyck estimating form (due to the MA(1) composite error) and proposed the instrumental-variable (IV) remedy: instrument the lagged dependent variable with the lagged exogenous variable. This is a methodological follow-on to Koyck.

Palda, Kristian S. 1964. The Measurement of Cumulative Advertising Effects. Englewood Cliffs, NJ: Prentice-Hall.

Empirical. Palda was the first to apply the Koyck distributed-lag model specifically to advertising data, using annual sales and advertising expenditure for the Lydia E. Pinkham Medicine Company (1908–1960). He demonstrated statistically significant carryover and estimated a long-run advertising multiplier. Palda is the innovator of the empirical advertising-spillover literature; but he lacked statistical discipline, not estimating his dynamic specification.

Almon, Shirley. 1965. “The Distributed Lag Between Capital Appropriations and Expenditures.” Econometrica 33 (1): 178–196.

Empirical. Almon proposed polynomial distributed lags as an alternative to geometric (Koyck) weighting, applied to U.S. quarterly capital appropriations and expenditures data. Her approach imposes a polynomial shape on the lag weights rather than geometric decay, allowing for non-monotone lag distributions. Almon is an independent innovator of a competing parameterization, not a follower of Koyck; the two approaches answer different prior beliefs about lag-weight shape.

Durbin, James. 1970. “Testing for Serial Correlation in Least-Squares Regression When Some of the Regressors Are Lagged Dependent Variables.” Econometrica 38 (3): 410–421.

Theoretical. Durbin derived the h-statistic as a large-sample test for first-order serial correlation when the equation contains lagged dependent variables, replacing the Durbin-Watson statistic, which is asymptotically invalid in that setting. This is a methodological contribution that resolves a diagnostic gap in estimating Koyck and Nerlove equations. Durbin is a methodological innovator; his paper has no predecessor for this specific problem.

Clarke, Darral G. 1976. “Econometric Measurement of the Duration of Advertising Effect on Sales.” Journal of Marketing Research 13 (4): 345–357.

Empirical; meta (compilation) analysis of 69 published advertising econometric studies across consumer packaged goods, durables, and services. Clarke documented that the measured advertising effect half-life is typically two to six months, and that annual data aggregate intra-year decay into the contemporaneous coefficient, biasing the estimated carryover rate λ toward higher values. This is the key empirical finding motivating caution when the Koyck model is estimated on annual Compustat data.

Griliches, Zvi. 1979. “Issues in Assessing the Contribution of Research and Development to Productivity Growth.” Bell Journal of Economics 10 (1): 92–116.

Theoretical and empirical; industry-level and firm-level R&D and productivity data for U.S. manufacturing. Griliches established that R&D expenditure should be treated as a capital investment and that the perpetual-inventory method (equivalent to the Koyck recurrence Stock(t) = λ·Stock(t−1) + γ·Flow(t)) is the appropriate way to construct an R&D capital stock from flow data. His framework generalizes the Nerlove-Arrow goodwill model to all intangible investment flows—including XAD, XRD, and capitalized software development costs. Griliches is the innovator of the R&D-capital-stock literature and a follower of Koyck and Nerlove-Arrow on the modeling structure.

Sethuraman, Raj, Gerard J. Tellis, and Richard A. Briesch. 2011. “How Well Does Advertising Work? Generalizations from Meta-Analysis of Brand Advertising Elasticities.” Journal of Marketing Research 48 (3): 457–471.

Empirical; meta (compilation) analysis of 751 short-run and 402 long-run brand advertising elasticity estimates from 56 studies. Sethuraman et al. documented that short-run advertising elasticity averages approximately 0.12 across product categories, and that long-run elasticities (capturing carryover) are larger. The apparel and specialty retail range relevant to the Gap example is at the upper end of the cross-category distribution. This paper is the standard empirical reference for benchmarking estimated advertising multipliers. Meta-analysis is the quantitative synthesis of empirical results across multiple independent published studies that estimate the same underlying parameter — typically an effect size such as an elasticity, correlation, or regression coefficient.