Title: | Quantification of Qualitative Survey Data |
---|---|
Description: | Provides different functions for quantifying qualitative survey data. It supports the Carlson-Parkin method, the regression approach, the balance approach and the conditional expectations method. |
Authors: | Joachim Zuckarelli <[email protected]> |
Maintainer: | Joachim Zuckarelli <[email protected]> |
License: | GPL-3 |
Version: | 0.2.0 |
Built: | 2025-03-28 05:06:46 UTC |
Source: | https://github.com/jsugarelli/quantification |
This package provides different functions for quantifying qualitative survey data. It supports the Carlson-Parkin method (developed by Carlson/Parkin (1985)), the regression approach (developed by Pesaran (1984)), the balance approach (see Batchelor (1984)) and the conditional expectations method.
These methods are usually applied in economics to quantify qualitative inflation expectations collected through surveys (e.g. consumer surveys). However, the same approaches can be used to quantify any qualitative expectation about the change of a variable that is quantitative in nature.
The four functions of the package – cp
for the Carlson-Parkin method, ra
for the regression approach, bal
for the balance approach and ce
for the conditional expectations method – allow the user to customize a wide range of parameters and make use of certain extensions of the original methods. Apart from that all the functions deliver per default two versions of quantified expectations: one under the assumption that the survey respondents form expectations over the absolute change of the variable in question and one under the assumption of expectations over the relative change. The functions also provide the user with standard measures for the forecast quality of the quantified expectations enabling the user to quickly assess the effect of a change in the quantification method used.
Your comments and suggestions are highly appreciated.
Joachim Zuckarelli
Maintainer: Joachim Zuckarelli <[email protected]>
Batchelor, R.A. (1984), Quantitative vs. qualitative measures of inflation expectations, Oxford Bulletin of Economics and Statistics 48 (2), 99–120.
Carlson, J. A./Parkin, M. (1975), Inflation expectations, Economica 42, 123–138.
Henzel, S./Wollmershaeuser, T. (2005), Quantifying inflation expectations with the Carlson-Parkin method: A survey-based determination of the just noticeable difference, Journal of Business Cycle Measurement and Analysis 2, 321–352.
Nardo, M. (2003), The quantification of qualitative survey data: a critical assessment, Journal of Economic Surveys 17 (5), 645–668.
Pesaran, M. (1984), Expectations formation and macroeconomic modelling, in: Malgrange, M. (1984), Contemporary macroeconomic modelling, 27–55.
Zuckarelli, J. (2015): A new method for quantification of qualitative expectations, Economics and Business Letters 3(5), Special Issue Energy demand forecasting, 123-128.
bal
implements the balance approach for the quantification of qualitative survey data. A description of the method can be found in Batchelor (1986).
bal(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), growth.limit = NA, suppress.warnings = FALSE)
bal(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), growth.limit = NA, suppress.warnings = FALSE)
y.series |
a numerical vector containing the variable whose change is the subject of the qualitative survey question. If, for example the survey asks participants to assess whether inflation will increase, decrease or stay the same, |
survey.up |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.same |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.down |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
forecast.horizon |
a numeric value defining the number of periods the survey question looks in to the future. If the data in |
first.period |
an optional numeric value indexing the first period for which survey data in |
last.period |
an optional numeric value indexing the last period for which survey data in |
growth.limit |
serves to limit the effect of outliers when expectations are quantified under the assumption that survey respondents form expectations on the percentage change of |
suppress.warnings |
a logical value indicating if runtime warnings shall be displayed ( |
bal
provides two alternative versions of quantified expectations, depending on the assumed expectation formation process of survey respondents. The basic common assumption of the balance method is that survey participants are asked to assess whether variable y
will go up or down or stay the same. Survey respondents can now form expectations on either the absolute or the relative change of y
which differ because the scaling factors (thetas) used to scale the difference between the shares of 'up' and the 'down' respondents are calculated differently in each case. The bal
function calculates both versions.
The survey result vectors survey.up
, survey.down
and survey.same
as well as the variable y.series
must be of the same length and must cover the forecasted horizon (i.e. last.period
+ forecast.horizon
length(survey.up)
).
Data in survey.up
, survey.down
and survey.same
outside the survey period interval [first.period, last.period]
are ignored. Similiarly, y.series
data with a period index greater than last.period
is ignored.
survey.up
, survey.down
and survey.same
need not sum up to 100%
or 1
(which may happen, for example, if the survey has a 'Don't know' answer option).
y.e.mean.abs
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
y.e.mean.perc
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.abs
: a numeric vector containing the quantified mean absolute change of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.perc
: a numeric vector containing the quantified mean percentage change of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.sd.abs
: a numeric vector containing the standard deviation of the absolute change expectation for variable y
in the population. Assumes that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.sd.perc
: a numeric vector containing the standard deviation of the absolute change expectation for variable y
in the population. Assumes that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
theta.abs
: a numeric vector containing the estimated factor which scales the difference between the shares of 'up' respondents and 'down' respondents assuming that survey respondents form expectations on the absolute change in variable y
. For all periods which are not in scope of the survey the value is NA
.
theta.perc
: a numeric vector containing the estimated factor which scales the difference between the shares of 'up' respondents and 'down' respondents assuming that survey respondents form expectations on the relative change in variable y
. For all periods which are not in scope of the survey the value is NA
.
nob
: a numeric value showing the number of periods for which expectations have been quantified.
mae.abs
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the absolute change in variable y
.
rmse.abs
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the absolute change in variable y
.
mae.perc
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the relative change in variable y
.
rmse.perc
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the relative change in variable y
.
Zuckarelli, Joachim (2014). Quantification of qualitative survey data in R.
R package version 1.0.0. http://CRAN.R-project.org/package=quantification
Joachim Zuckarelli, [email protected]
Batchelor, R.A. (1984), Quantitative vs. qualitative measures of inflation expectations, Oxford Bulletin of Economics and Statistics 48 (2), 99–120.
quantification-package
, cp
, ra
, ce
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call bal for quantification quant.bal<-bal(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4)
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call bal for quantification quant.bal<-bal(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4)
ce
implements the Conditional Expectations approach for the quantification of qualitative survey data. The method calculates expectations on a distribution of past realizations of the variable of interest (variable y
), conditional on the expectation of either an increase or a decrese in y
. These conditional expectations are then weighted with the share of survey respondents expecting variable y
to rise or fall, respectively. For details see
ce(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 11, last.period = (length(survey.up) - forecast.horizon), exp.horizon.type = "moving", mov.horizon.length = 10, fix.horizon.start = 1, fix.horizon.end = 10, distrib.param = "mean", suppress.warnings = FALSE)
ce(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 11, last.period = (length(survey.up) - forecast.horizon), exp.horizon.type = "moving", mov.horizon.length = 10, fix.horizon.start = 1, fix.horizon.end = 10, distrib.param = "mean", suppress.warnings = FALSE)
y.series |
a numerical vector containing the variable whose change is the subject of the qualitative survey question. If, for example the survey asks participants to assess whether inflation will increase, decrease or stay the same, |
survey.up |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.same |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.down |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
forecast.horizon |
a numeric value defining the number of periods the survey question looks in to the future. If the data in |
first.period |
an optional numeric value indexing the first period for which survey data in |
last.period |
an optional numeric value indexing the last period for which survey data in |
exp.horizon.type |
an optional character vector indicating the type of experience horizon to be used. The experience horizon is the time period over which the distribution of variable
Default value is " |
mov.horizon.length |
an optional numeric value indicating the length of the (moving) forecast horizon. Is only considered when |
fix.horizon.start |
an optional numeric value indicating the first period of the (fixed) forecast horizon. Is only considered when |
fix.horizon.end |
an optional numeric value indicating the last period of the (fixed) forecast horizon. Is only considered when |
distrib.param |
an optional character vector indicating the distribution parameter that shall be used for calculating conditional expectations based on the distribution of variable |
suppress.warnings |
a logical value indicating if runtime warnings shall be displayed ( |
The survey result vectors survey.up
, survey.down
and survey.same
as well as the variable y.series
must be of the same length and must cover the forecasted horizon (i.e. last.period
+ forecast.horizon
length(survey.up)
).
Data in survey.up
, survey.down
and survey.same
outside the survey period interval [first.period, last.period]
are ignored. Similiarly, y.series
data with a period index greater than last.period
is ignored.
survey.up
, survey.down
and survey.same
need not sum up to 100%
or 1
(which may happen, for example, if the survey has a 'Don't know' answer option).
ce
returns a list containing the quantified survey data and some meta information. The list has the following elements:
y.e
: a numeric vector containing the quantified expectations of the variable y
.
nob
: a numeric value showing the number of periods for which expectations have been quantified.
mae
: a numeric value showing the mean absolute error (MAE) of expectations.
rmse
: a numeric value showing the root mean squared error (RMSE) of expectations.
Zuckarelli, Joachim (2014). Quantification of qualitative survey data in R.
R package version 1.0.0. http://CRAN.R-project.org/package=quantification
Joachim Zuckarelli, [email protected]
Zuckarelli, J. (2015): A new method for quantification of qualitative expectations, Economics and Business Letters 3(5), Special Issue Energy demand forecasting, 123-128.
quantification-package
, cp
, bal
, ra
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.5, 1.5, 1.5, 1.1, 0.9, 1.3, 1.3, 1.2, 1.7, 1.7, 1.5, 2, 1.4, 1.9, 1.9, 2.3, 2.8, 2.5, 2.1, 2.1, 1.9, 1.9, 1.5, 1.6, 2.1, 1.8, 2.1, 1.5, 1.3, 1.1, 1.1, 1.3, 1.3, 1.3, 1.1, 1.1, 1, 1.2, 1.1, 0.9) answer.up<-c(72.7, 69.7, 60.9, 53.7, 54.9, 54.8, 56.1, 51.7, 62.2, 54.2, 39.8, 18.6, 5.4, 8.2, 8.6, 8.5, 16, 18.9, 7.7, 6.5, 6.4, 7, 7.4, 6.8, 9.5, 17.1, 13.1, 21.5, 22.7, 26.9, 32.4, 20.2, 20.4, 15.8, 11.4, 7.9, 11.3, 10, 11.3, 9.7) answer.same<-c(24.1, 22.8, 24.3, 26.2, 31.1, 35.4, 33, 35.5, 27.4, 24.8, 32.1, 44.8, 41.8, 37.9, 33.2, 30.9, 29.9, 22.1, 17.2, 15.5, 21.8, 25.2, 23.2, 24.2, 32.9, 31.2, 42.2, 50.5, 52.5, 56.3, 53.8, 62.8, 65.6, 63, 60.3, 61.1, 57.8, 63, 61.4, 61.9) answer.down<-c(3.2, 7.5, 14.8, 20.1, 14, 9.8, 10.9, 12.8, 10.4, 21, 28.1, 36.6, 52.8, 53.9, 58.2, 60.6, 54.1, 59, 75.1, 78, 71.8, 67.8, 69.4, 69, 57.6, 51.7, 44.7, 28, 24.8, 16.8, 13.8, 17, 14, 21.2, 28.3, 31, 30.9, 27, 27.3, 28.4) ## Call ce for quantification quant.ce<-ce(inflation, answer.up, answer.same, answer.down, first.period=30, last.period=36, forecast.horizon=4, exp.horizon.type = "fix", fix.horizon.start = 1, fix.horizon.end = 29)
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.5, 1.5, 1.5, 1.1, 0.9, 1.3, 1.3, 1.2, 1.7, 1.7, 1.5, 2, 1.4, 1.9, 1.9, 2.3, 2.8, 2.5, 2.1, 2.1, 1.9, 1.9, 1.5, 1.6, 2.1, 1.8, 2.1, 1.5, 1.3, 1.1, 1.1, 1.3, 1.3, 1.3, 1.1, 1.1, 1, 1.2, 1.1, 0.9) answer.up<-c(72.7, 69.7, 60.9, 53.7, 54.9, 54.8, 56.1, 51.7, 62.2, 54.2, 39.8, 18.6, 5.4, 8.2, 8.6, 8.5, 16, 18.9, 7.7, 6.5, 6.4, 7, 7.4, 6.8, 9.5, 17.1, 13.1, 21.5, 22.7, 26.9, 32.4, 20.2, 20.4, 15.8, 11.4, 7.9, 11.3, 10, 11.3, 9.7) answer.same<-c(24.1, 22.8, 24.3, 26.2, 31.1, 35.4, 33, 35.5, 27.4, 24.8, 32.1, 44.8, 41.8, 37.9, 33.2, 30.9, 29.9, 22.1, 17.2, 15.5, 21.8, 25.2, 23.2, 24.2, 32.9, 31.2, 42.2, 50.5, 52.5, 56.3, 53.8, 62.8, 65.6, 63, 60.3, 61.1, 57.8, 63, 61.4, 61.9) answer.down<-c(3.2, 7.5, 14.8, 20.1, 14, 9.8, 10.9, 12.8, 10.4, 21, 28.1, 36.6, 52.8, 53.9, 58.2, 60.6, 54.1, 59, 75.1, 78, 71.8, 67.8, 69.4, 69, 57.6, 51.7, 44.7, 28, 24.8, 16.8, 13.8, 17, 14, 21.2, 28.3, 31, 30.9, 27, 27.3, 28.4) ## Call ce for quantification quant.ce<-ce(inflation, answer.up, answer.same, answer.down, first.period=30, last.period=36, forecast.horizon=4, exp.horizon.type = "fix", fix.horizon.start = 1, fix.horizon.end = 29)
cp
implements the method for the quantifcation of qualitative survey data proposed by Carlson/Parkin (1985). Additionally, it provides certain extensions of the Carlson-Parkin approach (e.g. other distributions than the normal distribution, indifference limens depending on the level of the forecasted variable).
cp(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), limen.type = "carlson.parkin", const.limen = 0, user.symm.limen = 0, user.upper.limen = 0, user.lower.limen = 0, correct.zero = TRUE, correct.by = 0.01, growth.limit = NA, distrib.type = "normal", distrib.mean = 0, distrib.sd = 1, distrib.log.location = 0, distrib.log.scale = 1, distrib.t.df = (last.period - first.period), suppress.warnings = FALSE)
cp(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), limen.type = "carlson.parkin", const.limen = 0, user.symm.limen = 0, user.upper.limen = 0, user.lower.limen = 0, correct.zero = TRUE, correct.by = 0.01, growth.limit = NA, distrib.type = "normal", distrib.mean = 0, distrib.sd = 1, distrib.log.location = 0, distrib.log.scale = 1, distrib.t.df = (last.period - first.period), suppress.warnings = FALSE)
y.series |
a numerical vector containing the variable whose change is the subject of the qualitative survey question. If, for example the survey asks participants to assess whether inflation will increase, decrease or stay the same, |
survey.up |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.same |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.down |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
forecast.horizon |
a numeric value defining the number of periods the survey question looks in to the future. If the data in |
first.period |
an optional numeric value indexing the first period for which survey data in |
last.period |
an optional numeric value indexing the last period for which survey data in |
limen.type |
an optional character vector describing the type of indifference limen that shall be used for quantification. Possible values are:
|
const.limen |
an optional numeric value containing the symmetric, time-invariant user-defined indifference limen. Must be provided when " |
user.symm.limen |
an optional numeric vector containing the symmetric, time-varying indifference limen. Must be provided when " |
user.upper.limen |
an optional numeric vector containing the upper, time-varying indifference limen. Must be provided when " |
user.lower.limen |
an optional numeric vector containing the lower, time-varying indifference limen. Must be provided when " |
correct.zero |
an optional logical value steering the automatic correction of zero-values in |
correct.by |
an optional parameter indicating the amount by which |
growth.limit |
serves to limit the effect of outliers when expectations are quantified under the assumption that survey respondents form expectations on the percentage change of |
distrib.type |
an optional character vector describing the type of distribution used for quantification. Possible values are:
|
distrib.mean |
an optional numerical value defining the mean of the normal distribution (used in case |
distrib.sd |
an optional numerical value defining the standard deviation of the normal distribution (used in case |
distrib.log.location |
an optional numerical value defining the location of the logistic distribution (used in case |
distrib.log.scale |
an optional numerical value defining the scale of the logistic distribution (used in case |
distrib.t.df |
an optional numerical value defining the degrees of freedom (df) of the t distribution (used in case |
suppress.warnings |
a logical value indicating if runtime warnings shall be displayed ( |
cp
provides two alternative versions of quantified expectations, depending on the assumed expectation formation process of survey respondents. The basic common assumption of the Carlson-Parkin method is that survey participants are asked to assess whether variable y
will go up or down or stay the same. Survey respondents can now form expectations on either the absolute or the relative change of y
which differ because the indifferent limens used for quantification are calculated differently in each case. The cp
function calculates both versions.
The survey result vectors survey.up
, survey.down
and survey.same
as well as the variable y.series
must be of the same length and must cover the forecasted horizon (i.e. last.period
+ forecast.horizon
length(survey.up)
).
Data in survey.up
, survey.down
and survey.same
outside the survey period interval [first.period, last.period]
are ignored. Similiarly, y.series
data with a period index greater than last.period
is ignored.
first.period
must be greater than forecast.horizon
, because indifference limens use the current change of variable y
for calibration. In order to calculate the change in y
for the survey obersavation with index first.period
the observation of y
with index first.period - forecast.horizon
is required.
survey.up
, survey.down
and survey.same
need not sum up to 100%
or 1
(which may happen, for example, if the survey has a 'Don't know' answer option).
The Weber-Fechner option (see Henzel/Wollmershaeuser (2005) for details) weighs the 'expectations' term in the traditional Carlson-Parkin limen calculation with the current inflation rate. The resulting time-invariant value is applied as a proportionality factor to current inflation leading to a time-varying, inflation-proportional indifference limen.
When the indifference limen is user-defined (i.e. limen.type="symm.series"
or limen.type="asymm.series"
) then each limen value needs to be placed in such a way in the limen vector(s) (i.e. user.limen
or user.upper.limen
/ user.lower.limen
) that it has the same index as the survey observation (in survey.up
, survey.down
and survey.same
) to which it belongs.
cp
returns a list containing the quantified survey data and some meta information. The list has the following elements:
y.e.mean.abs
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
y.e.mean.perc
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.abs
: a numeric vector containing the quantified mean absolute change of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.perc
: a numeric vector containing the quantified mean percentage change of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.sd.abs
: a numeric vector containing the standard deviation of the absolute change expectation for variable y
in the population. Assumes that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.sd.perc
: a numeric vector containing the standard deviation of the absolute change expectation for variable y
in the population. Assumes that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
limen.abs
: a numeric vector containing the estimated (or user-defined) indifference limens assuming that survey respondents form expectations on the absolute change in variable y
. For all periods which are not in scope of the survey the value is NA
.
limen.perc
: a numeric vector containing the estimated (or user-defined) indifference limens assuming that survey respondents form expectations on the relative change in variable y
. For all periods which are not in scope of the survey the value is NA
.
nob
: a numeric value showing the number of periods for which expectations have been quantified.
mae.abs
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the absolute change in variable y
.
rmse.abs
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the absolute change in variable y
.
mae.perc
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the relative change in variable y
.
rmse.perc
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the relative change in variable y
.
Zuckarelli, Joachim (2014). Quantification of qualitative survey data in R.
R package version 0.1.0. http://CRAN.R-project.org/package=quantification
Joachim Zuckarelli, [email protected]
Carlson, J. A./Parkin, M. (1975), Inflation expectations, Economica 42, 123–138.
Henzel, S./Wollmershaeuser, T. (2005), Quantifying inflation expectations with the Carlson-Parkin method: A survey-based determination of the just noticeable difference, Journal of Business Cycle Measurement and Analysis 2, 321–352.
Nardo, M. (2003), The quantification of qualitative survey data: a critical assessment, Journal of Economic Surveys 17 (5), 645–668.
quantification-package
, bal
, ra
, ce
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call cp for quantification quant.cp.limens<-cp(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4) ## With Weber-Fechner limens instead of Carson-Parkin limens quant.wf.limens<-cp(inflation, answer.up, answer.same, answer.same, first.period=5, last.period=7, forecast.horizon=4, limen.type="weber.fechner")
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call cp for quantification quant.cp.limens<-cp(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4) ## With Weber-Fechner limens instead of Carson-Parkin limens quant.wf.limens<-cp(inflation, answer.up, answer.same, answer.same, first.period=5, last.period=7, forecast.horizon=4, limen.type="weber.fechner")
ra
implements the regression approach developed by Pesaran (1984).
ra(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), distrib.type = "normal", distrib.mean = 0, distrib.sd = 1, distrib.log.location = 0, distrib.log.scale = 1, distrib.t.df = (first.period - last.period), growth.limit = NA, symmetry.error = "white", suppress.warnings = FALSE)
ra(y.series, survey.up, survey.same, survey.down, forecast.horizon, first.period = 1, last.period = (length(survey.up) - forecast.horizon), distrib.type = "normal", distrib.mean = 0, distrib.sd = 1, distrib.log.location = 0, distrib.log.scale = 1, distrib.t.df = (first.period - last.period), growth.limit = NA, symmetry.error = "white", suppress.warnings = FALSE)
y.series |
a numerical vector containing the variable whose change is the subject of the qualitative survey question. If, for example the survey asks participants to assess whether inflation will increase, decrease or stay the same, |
survey.up |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.same |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
survey.down |
a numerical vector containing the number or the share of survey respondents expecting the variable contained in |
forecast.horizon |
a numeric value defining the number of periods the survey question looks in to the future. If the data in |
first.period |
an optional numeric value indexing the first period for which survey data in |
last.period |
an optional numeric value indexing the last period for which survey data in |
distrib.type |
an optional character vector describing the type of distribution used for quantification. Possible values are:
|
distrib.mean |
an optional numerical value defining the mean of the normal distribution (used in case |
distrib.sd |
an optional numerical value defining the standard deviation of the normal distribution (used in case |
distrib.log.location |
an optional numerical value defining the location of the logistic distribution (used in case |
distrib.log.scale |
an optional numerical value defining the scale of the logistic distribution (used in case |
distrib.t.df |
an optional numerical value defining the degrees of freedom (df) of the t distribution (used in case |
growth.limit |
serves to limit the effect of outliers when expectations are quantified under the assumption that survey respondents form expectations on the percentage change of |
symmetry.error |
an optional character vector indicating the type of standard error used for testing the symmetry of the estimated upper and lower indifference limens. Can be either " |
suppress.warnings |
a logical value indicating if runtime warnings shall be displayed ( |
ra
estimates the time-invariant, asymmetric indifference limens using OLS regression with non-robust standard errors.
The function ra
provides two alternative versions of quantified expectations, depending on the assumed expectation formation process of survey respondents. The basic common assumption of the regression approach is that survey participants are asked to assess whether variable y
will go up or down or stay the same. Survey respondents can now form expectations on either the absolute or the relative change of y
. The reg
function calculates both versions.
The survey result vectors survey.up
, survey.down
and survey.same
as well as the variable y.series
must be of the same length and must cover the forecasted horizon (i.e. last.period
+ forecast.horizon
length(survey.up)
).
Data in survey.up
, survey.down
and survey.same
outside the survey period interval [first.period, last.period]
are ignored. Similiarly, y.series
data with a period index greater than last.period
is ignored.
survey.up
, survey.down
and survey.same
need not sum up to 100%
or 1
(which may happen, for example, if the survey has a 'Don't know' answer option).
y.e.mean.abs
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
y.e.mean.perc
: a numeric vector containing the quantified mean expectations of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.abs
: a numeric vector containing the quantified mean absolute change of the variable y
, assuming that survey respondents form expectations on the absolute change in y
. For all periods which are not in scope of the survey the value is NA
.
delta.y.e.mean.perc
: a numeric vector containing the quantified mean percentage change of the variable y
, assuming that survey respondents form expectations on the relative change in y
. For all periods which are not in scope of the survey the value is NA
.
upper.limit.abs
: a numeric value containing the estimated upper indifference limen when survey respondents form expectations on the absolute change in variable y
.
lower.limit.abs
: a numeric value containing the estimated upper indifference limen when survey respondents form expectations on the absolute change in variable y
.
upper.limit.per
: a numeric value containing the estimated upper indifference limen when survey respondents form expectations on the relative change in variable y
.
lower.limit.perc
: a numeric value containing the estimated upper indifference limen when survey respondents form expectations on the relative change in variable y
.
nob
: a numeric value showing the number of periods for which expectations have been quantified.
mae.abs
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the absolute change in variable y
.
rmse.abs
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the absolute change in variable y
.
mae.perc
: a numeric value showing the mean absolute error (MAE) of expectations when survey respondents form expectations on the relative change in variable y
.
rmse.perc
: a numeric value showing the root mean squared error (RMSE) of expectations when survey respondents form expectations on the relative change in variable y
.
symmetry.abs
: a numeric value containing the p-value of the test for symmetry of the estimated indifference limens when survey respondents form expectations on the absolute change in variable y
. The standard error used for testing depends on the argument symmetry.error
.
symmetry.perc
: a numeric value containing the p-value of the test for symmetry of the estimated indifference limens when survey respondents form expectations on the relative change in variable y
. The standard error used for testing depends on the argument symmetry.error
.
Zuckarelli, Joachim (2014). Quantification of qualitative survey data in R.
R package version 1.0.0. http://CRAN.R-project.org/package=quantification
Joachim Zuckarelli, [email protected]
MacKinnon, J.G./White, H. (1985), Some heteroskedasticity-consistent covariance matrix estimators with immproved finite sample properties, Journal of Econometrics 29, 305–325.
Pesaran, M. (1984), Expectations formation and macroeconomic modelling, in: Malgrange, M. (1984), Contemporary macroeconomic modelling, 27–55.
quantification-package
, cp
, bal
, ce
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call ra for quantification quant.ra<-ra(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4, symmetry.error="small.sample")
## Data preparation: generate a sample dataset with inflation and survey data inflation<-c(1.7, 1.9, 2, 1.9, 2, 2.1, 2.1, 2.1, 2.4, 2.3, 2.4) answer.up<-c(67, 75.1, 76.4, 72.4, 69.7, 49.7, 45.2, 31.6, 14.9, 19.3, 19.2) answer.same<-c(30.1, 19.6, 19.5, 21.3, 20.1, 33.1, 34.4, 33.5, 44.6, 38.1, 35.3) answer.down<-c(2.9, 5.3, 4.1, 6.3, 10.2, 17.2, 20.4, 34.9, 40.5, 42.6, 45.5) ## Call ra for quantification quant.ra<-ra(inflation, answer.up, answer.same, answer.down, first.period=5, last.period=7, forecast.horizon=4, symmetry.error="small.sample")