Markov Chain Models for Rolling Cross-section Data: How Campaign
by user
Comments
Transcript
Markov Chain Models for Rolling Cross-section Data: How Campaign
Markov Chain Models for Rolling Cross-section Data: How Campaign Events and Political Awareness Aect Vote Intentions and Partisanship in the United States and Canada by Walter R. Mebane, Jr.y and Jonathan Wandz April 7, 1997 Prepared for delivery at the 1997 Annual Meeting of the Midwest Political Science Association, April 10{12, Palmer House Hilton, Chicago, IL, Political Methodology Section. Thanks to Jasjeet Sekhon for helpful discussion of several important points. Thanks to Jonathan Cowden for letting tron help macht, p2c2e and tempter with the computing. The authors share equal (or at least equally distributed) responsibility for all errors. y Associate Professor, Department of Government, Cornell University, [email protected]. z Doctoral Student, Department of Government, Cornell University, [email protected]. Abstract Markov Chain Models for Rolling Cross-section Data: How Campaign Events and Political Awareness Aect Vote Intentions and Partisanship in the United States and Canada We use a new approach we have developed for estimating discrete, nite-state Markov chain models from \macro" data to analyze the dynamics of individual choice probabilities in two collections of rolling cross-sectional survey data that were designed to support investigations of what happens to voters' information and preferences during campaigns. Using data from the 1984 American National Election Studies Continuous Monitoring Study, we show that not only did individual party identication vary substantially during the year, but the dynamics of party identication changed signicantly in response to the conclusion of the Democratic party's nomination contest. Party identication appears to have measurement error only when the model misspecies the dynamics. There are rapid oscillations among some categories of partisanship that may reect individual stances regarding not only competition between the parties but also competition among party factions. Using data from the 1993 Canadian Election Study, we show that the critical events that shaped voting intentions in the election varied tremendously depending on an individual's level of political awareness, and that the eects of awareness varied across regions of the country. Do electoral campaigns matter? Theories of resources and information (\momentum") in American presidential elections suggest some ways that they do (Brams 1978; Aldrich 1980; Gurian 1987; Bartels 1988), and both spatial theory (Hinich and Munger 1997) and popular wisdom suggest that in all kinds of elections campaigns have a great potential to make a dierence at least some of the time. But some have suggested ways that volatility one may observe during campaigns may be more apparent than real (Green and Palmquist 1990; Gelman and King 1993). We use a new approach we have developed for estimating discrete, nite-state Markov chain (DFMC) models from \macro" data to analyze the dynamics of individual choice probabilities in two collections of rolling cross-sectional survey data that were designed to support investigations of what happens to voters' information and preferences during campaigns.1 One set of data comes from the United States and one from Canada. From both countries we nd strong evidence that the answer is, \Yes, campaigns do matter." First we estimate the degree of instability and the amount of measurement error in individual Americans' party identications during 1984, using the 1984 American National Election Studies Continuous Monitoring Study (ANES-CMS) (Miller and the National Election Studies 1985). Green and Palmquist (1990) argue that individual party identication exhibits a high degree of stability once measurement error is taken into account. We nd not only that individuals' partisan propensities varied substantially during the year, but that the dynamics changed signicantly in response to the conclusion of the Democratic party's nomination contest. During the early part of the primary season, many Republicans were briey shifting into \Democrat Leaner" status. But as soon as the contest between Mondale and Hart was eectively decided in favor of the former, Republican irtations with a Democrat identication ceased, to be replaced by a slow but steady 1 In the usage of the econometrics literature on estimation for Markov chain parameters, \macro" data are data in which the state-to-state transitions over time for each individual are not observed, but rather one observes at each of a number of times a distinct cross-section of individuals and their current states. 1 stream of Democrat Leaners becoming Republicans. There is an appearance of substantial measurement error in the survey items that measure party identication only when the model is incorrectly specied to ignore the qualitative change in the dynamics. Second we study changes in individual Canadians' voting intentions during the campaign period of the 1993 Canadian federal election, using the 1993 Canadian Election Study. Several authors have pointed to a few key dates during the campaign at which events occurred that they say signicantly eroded support for the incumbent Progressive Conservative party (Frizzel, Pammet and Westall 1994; Johnston, Nevitte and Brady 1994). We verify that the dynamics of support outside of Quebec for the Progressive Conservative, Liberal and Reform parties did change at the suggested dates. But the critical events that shaped voting intentions in the election varied tremendously depending on an individual's level of political awareness, and the eects of awareness varied across regions of the country. Previous analyses of Markov chain models of voter transitions have focused on transitions between two elections, using either panel survey data (Miller 1972; Browne and Payne 1986) or aggregate vote returns (McCarthy and Ryan 1977). Studies of the former class have an advantage of observing transitions between states, but both types have suered from small numbers of time periods. The economics literature on Markov chain models treats estimation problems more akin to the ones we encounter in trying to model unobserved transitions over numerous time periods (Lee et al. 1970; McRae 1977; Rosenqvist 1986, Mot 1993). Our new approach avoids the combinatorial diculties that prevented Rosenqvist and McRae from using full maximum likelihood techniques. Most directly comparable is the work by Mot (1993) that uses maximum likelihood to estimate a binary state model for female labor supply. But the assumptions Mot makes about the initial probability vector constrains his model to the analysis of binary states and would be untenable for 2 most political science research.2 We use maximum likelihood to estimate parameters that represent the initial probability vector and each row of the transition matrices as functions of individual-level characteristics.3 We use a multinomial logit functional form to specify the initial probability vector and each row of each transition matrix. An important feature of our model is that the initial probability vector is estimated as a function of all of the data, rather than simply the observations from the rst time period. The initial probability estimates are therefore fully ecient; estimates of the distribution of voters at the beginning of the Markov chain are not determined solely by the small survey samples obtained for the rst time period. By allowing transition matrices to change during the time covered by the data, we can test whether campaign events changed the structure of individuals' decision making. In particular, we can test whether there were changes in the rate at which individuals moved between voting preferences or changed partisanship. Party Identication in the United States Green and Palmquist (1990) call into question work by many authors4 who in various ways have argued against the original conception of party identication (PID) as a highly stable individual orientation that does not respond signicantly to short-term political stimuli (Converse, Campbell, 2 Mot constrains the initial probabilities distribution between the two states to be one and zero. He argues, \as most outcome variables are dened pi0 will be zero|an individual is unemployed at the beginning of an unemployment spell, not working prior to entering the labor force, unmarried at the beginning of the lifetime, and so on" (Mot 1993, fn 17). 3 To solve the severely nonconcave optimization problem we use GENOUD (Mebane and Sekhon 1996), a computer program that combines evolution programming (drawing on both genetic algorithm and homotopy search techniques) with quasi-Newton methods. 4 Green and Palmquist cite Jackson 1975a, Jackson 1975b, Meier 1975, Page and Jones 1979, Markus and Converse 1979, Fiorina 1981, Erikson 1982, Franklin and Jackson 1983, Franklin 1984, and Brody and Rothenberg 1988. 3 Miller and Stokes 1960). Green and Palmquist argue that impressions of instability and reactivity in PID are in large part due to failures to take into account measurement error in the PID scales. They base their argument on a \quasi-Markov simplex" (Joreskog 1970) model for measurement error introduced by Wiley and Wiley (1970), which they apply to data from the 1980 ANES Major Panel Study (ANES-MPS) to estimate the variance in the measurement error in the seven-point PID scale.5 They use the variance estimate to correct for measurement error in lagged PID when estimating several models that bear on some of the claims about particular \short-term forces" that aect PID. The typical nding is that eects that appear statistically signicant and substantively large when measurement error is ignored become insignicant and small when they use the error variance estimates from their quasi-Markov simplex technique. For testing the hypothesis that PID is stable but measured with error, the most important problem with Green and Palmquist's approach is that it requires the assumption that instability aects all kinds of partisans equally. This assumption is a consequence of their assumption that PID is an interval-level variable. The assumption of interval-level measurement emulates the practice of most (not all!) of the authors that Green and Palmquist target, and so can be defended to some extent as necessary for a fair comparison between the original results and their new ones (Miller 1987). Nevertheless the assumption is highly problematic. The problem is that at any given time some kinds of partisans may be stable in their orientations while others are not. During a particular election year it may be, for instance, that Democrats are likely to become Republicans but Republicans are unlikely to return the favor and become Democrats. Treating PID as an interval-level variable implies that any instability in PID must act on all of the variable's values in a uniform manner. To see this, imagine that the model for instability in an individual's true, interval-level PID at time t, yt , is yt = ayt,1 + bxt, for coecients a and b and random variable xt. 5 Green and Palmquist also use data from other ANES panel studies for some of their empirical demonstrations. 4 Such a form covers the models that Green and Palmquist use. There is perfect stability if a = 1 and b = 0; otherwise there is instability. The problem with such a model is that any particular type of partisanship can be unstable only if all types of partisanship are unstable. It is not possible, in such a model, for only Democrats to have unstable loyalties. Either every kind of partisan is subject to instability, or no kind is.6 Instability in PID is best thought of as a propensity for individuals to change from one kind of partisanship to another. PID should be thought of as a set of discrete categories, perhaps ordered categories. For PID to be unstable and reactive to short-run political stimuli means that individuals change categories in response to new political events or new political information. For such a purely categorical PID to be susceptible to measurement error means that at any particular time an individual may be observed in a category dierent from the one that actually characterizes the individual. To distinguish error from truth it is necessary to specify properties of the true values that the erroneous components do not possess. A basic distinction is to specify that the errors of observation at dierent times are independent while the true values are not. The basic Markov assumption for the true values' serial dependence is that the current true value for an individual depends only on the true value for that individual in the immediately preceding time period. The quasi-Markov simplex model that Green and Palmquist use has such a structure for PID presumed measurable at the interval level, with the assumption that the measurement errors are not only serially independent but identically distributed over time. A Markov chain model for categorical PID with independent and identically distributed observation errors may be written as follows. Let it be the probability that some individual has PID of 6 The limitation to uniform eects would disappear in a suciently non-linear model of PID's relationships with other variables. But such a model would require even stronger assumptions about PID's measurement scale. To specify the values of PID as signicant in non-linear relationships would require PID to be measured on at least a ratio scale. 5 type i at time t 2 f0; 1; 2; : : : g. To use the conventional terminology for a Markov chain, we say that it is the probability that the individual is in state i at time t. The vector t = (1t; 2t; : : :; 7t)> contains the probabilities that the individual is in each of the seven states corresponding to the conventional seven levels of PID;7 t is the distribution at t of the process that generates the true values. Let P be a 7 7 matrix in which element pji is the conditional probability that the individual is in state i at time t + 1, given that the individual is in state j at time t. As conditional probabilities, the values in each row of P sum to one: i=1 pji = 1; P is P7 the transition probability matrix (see the Appendix for more details). The distribution of the true values at time t is given by t> = 0>Pt , where 0 is the initial distribution, 0> is the transpose of 0 and Pt indicates t postmultiplications of 0> by P. To represent observation error, each element mji of the 7 7 matrix M is the conditional probability that an individual is observed in state i, given that the individual P is truly in state j ; naturally, 7i=1 mji = 1. The observed distribution at t is ~t> = t> M.8 There is no observation error if M = I, the identity matrix. The conventionally assumed ordinality of the PID categories9 suggests that it is appropriate to use a tridiagonal form for P. In such a form, transitions are possible only between immediately 7 In the ANES-CMS data only 40 of the 3496 survey respondents did not select one of the seven conventional levels of PID. We omit those 40 respondents from our analysis and ignore those types of responses here. 8 M is identiable only up to a row permutation. To identify the parameters of M we use a specication in which the diagonal value is the largest value in each row. 9 The conventional order is \Strong Democrat," \Democrat," \Democrat Leaner," \Independent," \Republican Leaner," \Republican," \Strong Republican". 6 adjacent states: 2 p11 p12 0 0 6 6p21 p22 p23 0 6 6 6 0 p32 p33 p34 6 P = 666 0 0 p43 p44 6 0 0 p54 6 0 6 6 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 7 7 7 7 7 7 7 7 7 7 7 7 7 675 p45 : p55 p56 0 p65 p66 p 0 0 p76 p77 For instance, to go from state 2 (Democrat) to state 5 (Republican Leaner), an individual must rst occupy states 3 (Democrat Leaner) and 4 (Independent). For daily observations, the minimum time to go from being a Democrat to being a Republican Leaner is therefore three days. Unconstrained maximum likelihood estimates (MLEs) of the three wave-to-wave transition matrices for PID using the 1980 ANES-MPS data provide indirect support for the tridiagonal form.10 In Table 1 one can see that for all three matrices the diagonal value is the largest value in each row, and the values o the three central diagonals are generally small. Especially in view of the fact that we estimate P for transitions on a time scale of single days, while the values in Table 1 represent transitions over periods of months, the transitions observed during 1980 give no reason to doubt the suciency of using the tridiagonal form for the 1984 data. *** Table 1 about here *** 10 Data from the ANES surveys were made available by the Inter-university Consortium for Political and Social Research and by the Data Archive of the Cornell Institute for Social and Economic Research. The ANES-MPS and ANES-CMS data were originally collected by the Center for Political Studies of the Institute for Social Research, The University of Michigan, for the National Election Studies, under the overall direction of Warren E. Miller. Maria Elena Sanchez was director of studies in 1980. Santa Traugott was director of studies in 1984. The data were collected under grants from the National Science Foundation. Neither the original collectors of data nor the Consortium bear any responsibility for the analyses or interpretations presented here. 7 Dynamics of Party Identication During 1984 The principal political event of 1984 whose eects we examine is the conclusion of the Democratic nomination contest. The conclusion of the contest is not the same as the end of the nomination process. Counting delegates and superdelegates, Walter Mondale eectively locked up the nomination by mid-May, even though primaries then remained pending in several large states, many of which were ultimately won by Gary Hart (Bartels 1988, 247{252). The question is whether that mid-May conclusion of the contest changed the dynamics of individuals' partisan self-identications. Hart had run as a \new Democrat," with a \campaign of ideas" rather than a \campaign of personality" and with clear intentions to reform the Democratic party in ways that would have greatly reduced the inuence of traditional interest groups (Hart 1983; 1993, 209{219). Mondale stood as much more the heir to the aura of Roosevelt's New Deal, and especially with his embrace of protectionist \domestic content" legislation that would have restricted competition in automobile production, he appeared captive to traditionally Democratic labor unions. Before mid-May, an observer could have perceived a realistic chance for the more reformist alternative to gain the nomination. It was therefore uncertain what kind of Democratic party would exist for the general election and beyond. After mid-May there was no more uncertainty. Anyone who may have been attracted by the prospect of Hart-style reform in the Democratic party's organization or policy positions should therefore have lost interest in the party after mid-May. For individuals to react in such a way to the conclusion of the 1984 Democratic nomination contest would appear to require that they were basing their partisan self-identications at least in part on well-informed and forward-looking judgments. Finding evidence that individuals did react in such a way would support arguments that an individual's PID summarizes a history of relatively sophisticated judgments about issues, performance and candidates (e.g. Jackson 1975a; Page and Jones 1979; Fiorina 1981; Franklin and Jackson 1983). 8 We use four Markov chain models estimated using the daily cross-sections obtained in the 1984 ANES-CMS to test whether PID varied over time during 1984, and if so whether the dynamics changed in mid-May to reect the foreclosed possibility that the Democratic party would undergo substantial \reform." The rst interview in the ANES-CMS data occurred on January 11, 1984, the last on December 7, 1984. In the Markov chain models, January 11 is day t = 1. In two of the models there is a single transition probability matrix for the entire span of 331 days. In the other two models the transition matrix changes at day 120 (May 10), using the method described in the Appendix.11 Of each of these pairs of models, one allows for measurement error while in the other all states are assumed to be perfectly observed. As described in the Appendix, we use a multinomial logit functional form to specify the initial probability vector and each row of each transition matrix in each model. The measurement error matrices M are parameterized so that the conditional probabilities decrease in each row as one moves away from the main diagonal.12 Likelihood ratio (LR) tests support the hypothesized mid-May change in the dynamics and suggest that once the dynamics are specied to include that change there is no longer any indication that PID is subject to measurement error. From Table 2 one can see that with only one transition matrix for the entire time period, there appears to be signicant measurement error: the test statistic is 16.64, which by a 21 distribution is signicant at any reasonable test level.13 The 11 May 10 is two days after a set of primaries on May 8 (IN, MD, NC, OH). Mondale won MD and NC and lost very narrowly to Hart in IN (40.9 to 41.8) and OH (40.3 to 42.1). The results put Mondale over the top in delegates and superdelegates (Bartels 1988, 251{252). 12 For each state i there is one parameter, i 2 f,15; 15g. Dene ai = 1=(1 + exp(i )) and then mii = 1 and mij = ajii,jj if i 6= j . The conditional probabilities in each row of M are mij = mij = P7j=1 mij . 13 There is a complication because several parameters are xed rather than freely estimated in the nal specication for each model. This occurs whenever a value in the initial distribution or a conditional probability value in a transition matrix or measurement error matrix is zero or one. In this case the Markov chain has parameters on the boundary of the parameter space. The number of free parameters is smaller than in the case of an interior solution. The rank of the hessian matrix evaluated at the MLE is an estimate of the number of free parameters. If the estimates of the 9 measurement error matrix produced by the MLEs for the one-matrix model is striking: 2 :6215 6 6 :2009 6 6 6 :0380 6 c=6 M 6 :1429 6 6 6 :0000 6 6 4 :0000 :0164 :2357 :3329 :1520 :1429 :0000 :0000 :0282 :0894 :2009 :6083 :1429 :0000 :0000 :0486 :0339 :1213 :1520 :1429 :0000 :0000 :0838 :0129 :0732 :0380 :1429 1:0000 :0000 :1445 :0049 :0442 :0095 :1429 :0000 1:0000 :2491 :0019 3 :0267 777 :0024 777 :1429 777 : :0000 777 :0000 75 :4294 The matrix indicates that the Independent state is subject to the maximum possible amount of measurement error. Someone whose true state is Independent is equally likely to be observed in any state. Four of the other states also exhibit large amounts of measurement error. A Democrat has only a 33 percent chance of being correctly observed in that state. A Strong Republican has only a 43 percent chance of being correctly observed. The pattern of the errors is dicult to understand. Why are Democrats observed so imperfectly while Republicans are observed perfectly? Why are Strong Republicans so much more dicult to observe in their true states than are Republicans or Republican Leaners? It is not reasonable to attribute such an asymmetric pattern to imperfections in the survey instrumentation. Whatever the origin of such errors, it is dicult to believe that it is apt to describe them as \measurement" errors. *** Table 2 about here *** The LR tests suggest that there appears to be such a stupendous amount of measurement error only because the model with one transition matrix is grossly misspecifying the dynamics. Comparing the one-matrix model to the two-matrix model under the assumption of no measurement error gives a test statistic value of 23.32, which by the 23 distribution is statistically signicant at o-boundary parameters are asymptotically normal, then the LR test statistics are asymptotically chi-squared with degrees of freedom equal to the dierence in the number of free parameters in the respective nested models. We have used this theory to conduct our hypothesis tests. 10 any reasonable test level. Comparing the two models that allow measurement error to one another gives a test statistic value of 7.17, which is not signicant by the 24 distribution. The test for the presence of measurement error given two transition matrices is .49, which is not even remotely c signicant by the 22 distribution. If the amount of measurement error were truly as large as the M estimates suggest, we would not expect the estimated eect of the error to vanish so completely just because the transition matrix is allowed to change at the conclusion of the Democratic nomination contest. In combination with the substantive implausibility of the \measurement" error pattern c , the LR tests convince us that both of the models that allow for measurement error shown in M should be rejected in favor of the model that has two transition matrices and no measurement error. The transition matrices for the two-matrix, no-measurement-errors model do exhibit the kind of pattern to be expected if the Democratic party appeared attractive to non-Democrats during the period when there were reasonable prospects that it might be reformed, but repulsive to them once the reform prospects were terminated by the conclusion of the nomination contest. The MLEs in Table 3 produce the initial distribution and transition probability matrices in Table 4. The fact that all the values in the lower diagonal of the transition matrix for days 1{119 are positive while in that matrix the value for the probability of a transition from Democrat to Democrat Leaner (^p1:23) is zero means that no Strong Democrat or Democrat abandoned the party during that period, while non-Democrats moved both toward a Democrat PID and away from it. During this period, every non-Democrat had a small but positive probability of converting to a Democrat or Strong Democrat identication. As of day 120 the situation reverses. During that period all the values in the upper diagonal of the transition matrix are positive while the value for the probability of a transition from Independent to Democrat Leaner (^p2:43) is zero. That means that neither Independents nor any kind of Republican took on any kind of Democrat identication after the Democratic nomination contest concluded. Strong Democrats, Democrats and Democrat Leaners, on the other hand, all 11 had small but positive probabilities of converting to some kind of Republican identication. *** Tables 3 and 4 about here *** One way to get a sense of the practical signicance of the mid-May change in the dynamics is to contrast the distribution the MLEs predict for PID on election day using both transition matrices to a simulation of what the distribution would have been had the rst transition matrix continued unchanged until then. The former distribution is > = :14 :17 :11 :09 :17 :17 :15 ^300 while the latter we may write as > = :20 :24 :15 :09 :11 :14 :08 : ~300 Given the mid-May change, we expect Strong Democrats, Democrats and Democrat Leaners to comprise 42 percent of the population. In the simulated case in which movement toward Democratic identication continued until election day, 59 percent of the population has one of those three types of identication. Had the simulated case occurred with the typical conditional probabilities of voting Democratic given individual PID continuing to apply, the result would have been a landslide Democratic victory. To get a better sense of the rapidity of movement that the estimated transition matrices suggest was occurring, consider the quartiles of the distribution of hitting times shown in Table 5. The hitting time ji is the time it takes for an individual who starts in state j to enter state i for the rst time. Each nite hitting time has a geometric distribution that can be computed from the transition probabilities (Feller 1968, 443; Resnick 1992, 98{110). Table 5 shows the rst quartiles and the medians of those distributions, based on the transition probabilities produced by the MLEs. The hitting times in many instances show quite rapid transitions. During the period when 12 the Democratic nomination contest was alive, for instance, about a quarter of Republicans took just seven days to pass through the Republican Leaner and Independent states and arrive in the Democrat Leaner state, while about half of Republicans took just 18 days to do so. But having arrived at a Democratic Leaner identication, individuals did not take long to abandon it. About a quarter of Democrat Leaners took just ve days to arrive in the Republican state; about half took 14 days. After the Democratic contest concluded, no Independent, Republican Leaner, Republican or Strong Republican ever became a Democrat Leaner, but circulation among the Independent, Republican Leaner and Republican states became if anything more rapid. *** Table 5 about here *** On the Democratic side the hitting times show moderately rapid oscillations during the early period between the Strong Democrat and Democrat states that accelerated to become almost as rapid as possible at the conclusion of the nomination contest. The median times that show daily oscillations between the two states in the later period could mean that there was an extreme degree of indecision on the part of Democrats and Strong Democrats about the strength of their commitment to their party, or it could mean simply that the two states had ceased to have distinctive meanings to individuals once the only presidential choice of any consequence had become the one between Walter Mondale and Ronald Reagan. But the most interesting hitting time pertaining to Democratic identiers is probably the time it took to leave the Democratic fold entirely by going from the Democratic Leaner state to the Independent state. The median value of 204 days indicates that by election day just less than half of those who were Democrat Leaners at the conclusion of the Democratic nomination contest had become some kind of Republican. The daily oscillations between the Strong Democrat and Democrat states after the conclusion of the Democratic nomination contest, and the nearly as rapid wandering among the Independent, Republican Leaner and Republican states, are remarkable. The rapid movements may indicate 13 that after all PID was aected by a kind of observational error, albeit not serially independent and temporally identically distributed measurement error such as M represents. To say that the Strong Democrat and Democrat states had ceased to be distinctly meaningful is to say that to individuals there was only one state|\General Election Democrat" (GEDem), say|even though the survey instrumentation continued to record responses in two dierent categories. Similarly, on the other side there were not in fact the three states of Independent, Republican Leaner and Republican but only one, say \Non-Movement-Conservative Republican" (NMCRep). The most important fact about this error|if that's what it is|is that it is endogenous to the political process. The error is not purely or merely an artifact of the survey interviewing instrumentation, but rather reects the developing political process and the strategies individuals are using within that process. The key detail here is that the Strong Democrat and Democrat categories do not appear to collapse into one another until the Democratic nomination contest concludes and therefore the issue of drawing internal distinctions|distinctions among Democratic party factions|had become as a practical matter moot. It is reasonable to speculate that the distinctions that remained separated subjective party members, the GEDems, from the Democrat Leaners who did not consider themselves party members but did feel likely to vote for the party's presidential candidates. On the Republican side, the separation between the NMCReps and Strong Republicans persists throughout the span of the data, from January through the general election in November. It seems reasonable to guess that this pattern reects the then-continuing war within the Republican party between Movement Conservatives and Republican moderates.14 The rapid oscillations may reect individual stances regarding not only competition between the parties but also competition among party factions. The change in the dynamics of PID at the conclusion of the Democratic nomination contest is 14 This interpretation extends the argument of Keith, Magleby, Nelson, Orr, Westlye and Wolnger (1992) that independent \Leaners" are in eect partisans. 14 characteristic not only of the MLE but of virtually all the parameter vectors contained in the 95 percent condence ellipsoid (95% CE) of the joint sampling distribution of the parameter estimates. The plots in Figures 1, 2 and 3 illustrate this. To generate the gures we sampled 50 parameter vectors from the boundary of the 95% CE.15 The gures show trajectories of the PID distribution over the 331 days covered by the 1984 ANES-CMS data, for the MLE and six of the 50 sampled vectors. Each curve in each plot shows the trajectory given the initial distribution and the transition probability matrices produced by one parameter vector; lines in dierent plots that have the same dash style represent dierent components of the distribution for the same parameter vector. The solid black lines show the components of the distribution for the vector of parameter MLEs. The six other vectors were chosen to illustrate the maximal range of variation in the distribution trajectories produced by parameter vectors on the boundary of the 95% CE. From the set of 50 sampled parameter vectors, we chose the vectors that produced either the largest or the smallest average probability over the 331 days for at least one of the categories of PID. *** Figures 1, 2 and 3 about here *** All the curves show a change in the slope of the trajectory at the conclusion of the Democratic nomination contest. In every case, the proportion of \Democrats" (Strong, not-Strong or Leaner) is increasing up to day 119 and decreasing thereafter, while the proportion of \Republicans" is decreasing up to day 119 but thereafter increasing. The variations in the heights of the curves represent only dierent allocations of probability among the three kinds of Democrat or among the 15 Using the robust estimate ^ = @ 2 log L ,1 , @ log L , @ log L > @ 2 log L ,1 > > @ !^ @ !^ @ !^ @ !^ @ !^ @ !^ of the asymptotic covariance matrix of the free parameters ! (White 1994, 92), we randomly sampled vectors from the multivariate normal distribution with mean !^ (the MLE) and covariance matrix ^ . For each sampled vector !~ we computed the statistic ~2 = (!~ , !^ )> ^ ,1 (!~ , !^ ). Sampling was continued until we obtained 50 vectors such that j~2 , 2(1,) j < :02, where 2(1,) denotes the critical value for an = :05 level test using the 2 distribution. 15 three kinds of Republican. 1993 Canadian Federal Election Of the political changes that occurred during the 1993 Canadian federal election, the near electoral annihilation of the Progressive Conservative (PC) party is one of the most remarkable. The PC party won parliamentary majorities in the two preceding elections, with the latter setting a historic record 211 of 295 seats. On October 25, 1993|election day|the Progressive Conservative party was reduced to 2 parliamentary seats. A number of explanations have been oered for the precipitous decline in PC support, including the loss of Campbell's credibility, higher levels of media scrutiny, increasing regional polarization, and campaign strategy disasters. Even if strategic weaknesses of the PC party existed prior to the beginning of the campaign, the early public opinion polls did not foretell their inevitable demise. The events of the campaign mattered. The focus here is two fold: to test the impact of selected campaign events on vote intentions, and to contrast the behavior of dierent awareness groups. In the context of vote analysis, DFMC models have a number of appealing characteristics. DFMC models enable tests of hypotheses about the eects of specic events. More generally, DFMC models allow estimation of properties of the process that generates the changes in voter support observed during the campaign. While other statistical methods such as vector autoregression and loess can model only aggregate daily proportions, DFMC models provide estimates of the individuallevel movements between states by directly estimating the transition matrix or matrices. Attempts to explain the relative importance of campaign events in the 1993 Canadian federal election require an analysis that can test whether potential voters acted dierently after each particular event. Only if the probability of transition between parties signicantly changed can one claim that an event had a signicant impact on vote intentions. 16 Timing There is a general consensus about what were the major events during the 1993 campaign. Most accounts rely on anecdotal evidence to assess the consequences of the events, however, so the extent to which individual campaign events contributed to the decline in support for the PC party have remained untested. In the most comprehensive analysis, Johnston, Nevitte and Brady (1994) identify four key points of change in the campaign, using graphical analysis of the 1993 Canadian Election Survey (CES) data. In order of ascending importance, the dates are: September 11{15 (day 1{5 of the CES), October 3{4 (Party Leader Debates, day 25{26 of the CES),16 October 13{ 14 (PC Chretien advertisement debacle, day 35 of the CES), and September 20{22 (Conservative Party \secret plan to cut social programmes," day 10{12 of the CES).17 They argue that the drop September 20{22 \may not have been enough to deny the Conservatives control of the opposition, perhaps not even enough to guarantee the Liberals a majority, but was a cardinal event in that other events hinged on it." (3) By employing a DFMC model we are able to estimate transition matrices for each of the periods and test the signicance of these events with respect to their impact on the distribution of daily vote intentions. The null hypothesis is that a single transition matrix adequately represents campaign dynamics and thus P1r = P2r = : : : PKr for all K . The substantive interpretation of this hypothesis is that the Progressive Conservatives were destined from the beginning of the campaign to be reduced to near electoral extinction: none of the campaign events signicantly increased or decreased the 16 LeDuc (1994) notes that \one should not conclude that the debates determined the outcome of the election. Various polls taken at dierent times during the campaign show that the conservatives had already entered a steep slide in public support well before the debates took place..." (137) 17 Woolstencroft (1994) describes in detail the controversy over Kim Campbell's statement on September 23 that \an election is no time to talk about social policy." Johnston, Nevitte and Brady (1994) point out that the drop in PC vote intentions had largely occurred prior to September 23. 17 magnitude of their defeat. Although there are numerous alternative hypotheses,18 we restrict ourselves to the four most substantively interesting dates outlined by Johnston, Nevitte and Brady (1994). Political Awareness Political awareness is dened as an individual's cognitive engagement in politics, \as against emotional or aective engagement, or no engagement at all" (Zaller 1992). With Zaller we view an individual's level of political awareness as aecting both the probability that the individual receives political information and the individual's ability to process and contextualize new information. We can test for heterogeneity of behavior associated with awareness in two ways: by estimating separate transition probabilities for each level of awareness; and by allowing the transition matrices for each group to change at dierent dates. The null hypothesis here is that both the transition matrices and the dates of change are the same for dierent levels of political awareness. There are three broad alternative hypotheses with substantively dierent interpretations. First, the transition matrices may dier while the dates of change are uniform. This could reect that low awareness individuals are not signicantly delayed in responding to events, but do not have the cognitive abilities or contextual information to react in the same manner as the highly aware. Second, the transition matrices may be the same but the dates dier. As opposed to the rst alternative, low awareness people do not react dierently than the highly aware to events, but their limited exposure to the media impedes the rate at which they receive the information the event provides. Third, it may be that neither transition probabilities nor dates of change are shared by dierent awareness groups. This nal alternative encompasses all remaining possible patterns of 18 There are in fact P45 ,45 r=1 r dierent ways to select the number of transition matrices to estimate and the date of their change. A search over dates is certainly possible for a xed number of transition matrices, however, the practical implementation is a project for future research. 18 behavior. The empirical results suggest that patterns akin to the logic of the second hypothesis are dominant. Events early in the campaign quickly sort out the positions of high awareness people, while it is not until much later that less aware people begin to make the same choices. In some cases the disparity in vote distributions between awareness groups becomes small by election day, but individuals in the two groups have arrived at similar points by very dierent paths. To measure political awareness we draw on three factual survey questions that probed each respondent's knowledge of the 1993 rates of Canadian ination and unemployment, as well as of the size of the federal government's decit. These questions have the appealing property of separating respondents into awareness groups whose proportions remain stable over the entire period of the campaign. The more politically-oriented awareness questions all have increasing correct response rates over the campaign; to use these would likely confound tests of homogeneity with the changing composition of each group. In addition, there is a growing consensus that \factual awareness is the best single indicator of [political] sophistication and its related concepts" (Delli Carpini and Keeter 1993). For this analysis, respondents have been divided into two roughly equally sized groups, designated as high and low awareness. Data Data for measuring vote intentions, as well as other respondent characteristics, are drawn from the Canadian Election Study, 1993.19 The CES was a rolling cross-sectional survey that spanned 45 days of the 1993 Canadian federal election campaign period. The survey interviewed 70-90 new 19 The data were collected by the Institute for Social Research, York University for Richard Johnston, Andre Blais, Henry Brady, Elisabeth Gidenl, and Neil Nevitte. The project was funded by the Social Sciences and Humanities Research Council of Canada. The investigators, SSHRCC and the Institute bear no responsibility for the analyses and interpretations presented here. 19 respondents daily from a random sample of eligible voters across Canada. The campaign-period survey began 3 days after the ocial election writ was dropped and terminated the day preceding the election. Although the PC party was represented nationally, we believe that Quebec and the maritime provinces have unique electoral environments and therefore exclude them from the present analysis. The sample is restricted to Western Canada. Heterogeneity in voter preferences between these provinces is modeled by use of dummy variables for BC and Ontario residency. Canonical Form For the case of Canadian multiparty elections, there is no obvious ordering of the parties and therefore no ordinal structure to the states of the Markov chain. We adopt a transition matrix with the form 2 p11 0 0 p14 6 6 P = 666 00 p022 p0 pp24 33 34 4 p41 p42 p43 p44 3 7 7 7 7 7 5 (1) The rst three diagonal elements (p11, p22 and p33) represent the probability of maintaining the same vote intention between intervals. This form is generalizable to any number of states, but we focus on the vote proportions for three parties: PC, Liberal, and Reform. We reduce the number of parameters by constraining all transitions to take place through the last category. The elements in the last column of the rst three rows represent the probability of renouncing ones' vote intention and moving to the residual category. The elements in the last row represent the probability of entering the state corresponding to each column. All other cells are zero. This abstraction requires defection to another party to occur over the course of two intervals, but avoids imposing an ordinal structure on the parties. The state corresponding to the last row and column includes primarily 20 those who respond \Don't know" or abstain as their vote intention.20 For estimation, the rst three rows are parameterized as binary logits and the fourth row is a multinomial logit (see the Appendix for details). Results We estimate a Markov chain for each of the two levels of political awareness with dummy variables for Ontario and BC. Tests between non-pooled models with dierent specications indicate that the days at which transition matrix changes occur are not the same for each level of awareness.21 Table 6 provides a summary of solution values. For the present analysis we focus on the estimates from models with three transition matrices, where high aware individuals have the potential to change matrices at days 10 and 25 of the CES, and less aware individuals have the potential to change at days 25 and 35. *** Table 6 about here *** Figure 4 and Figure 5 plot the time paths of the estimated vote proportions in BC and Ontario, respectively, for each awareness level. These case studies provide insight into two key elements that the DFMC model takes into account. The model not only reveals dierent vote proportions in each province and awareness group, but distinct patterns of behavior over the course of the campaign. 20 For the present analysis we also include the New Democratic vote in this residual category. This arbitrary residual category is not of interest itself, but constructed such that we can include respondents in at least one state. In so doing we attempt to minimize the additional problems inherent in open systems, where individuals could exit from the states analyzed, by not allowing the pool of potential respondents analyzed by the DFMC model to change systematically over the course of the campaign. We feel that for this initial analysis, the benets of this strategy outweigh the potential violations of lumpability assumptions 21 The LR test of the pooled against non-pooled estimation does not reject the hypothesis that they are the same. As noted in the analysis of PID, we treat the accuracy of these LR tests with circumspection. 21 *** Figure 4 and Figure 5 about here *** The estimated distribution of intentions to vote for the PC party in BC was below the historic mean22 for the early portion of the campaign,23 but stable for both awareness groups. A loss of some of the 12 PC seats would be expected, but annihilation was not imminent. The debates did provide a ripple among the low awareness group, while for the high aware the event resulted in a quick and near complete abandonment of the party. The small percent of individuals who remained in the party following the debates were loyalists who were not subsequently dissuaded by the Chretien ad debacle. In contrast, the support for the party among the less aware was cut nearly in half following day 35. A similar pattern among awareness groups is also evident for Liberal and Reform support. The Reform party enjoys signicant growth among the less aware following the party leader's debates, although the gains recede following day 35. For the aware, all changes in transitions are accomplished by day 25. PC vote intentions among the less aware in Ontario resemble the behavior seen among their BC counterparts: the debates gave a considerable boost primarily at the expense of the Liberals. However, these new converts, along with the other low awareness Tories, quickly ed following the negative press coverage of the Chretien ad debacle at day 35. The debates had only a slight eect on high awareness voters, the major drop having already occurred around day 10. What would have been the outcome on election day had these events not occurred? Among the innite number of counterfactuals, we propose just one to highlight the importance of these events: projecting the election day results with and without the nal transition matrix change for each 22 23 See Fiegert 1989, Table 13-1 By estimating the initial probability distribution, the DFMC model is particularly good at tting the early data points in the survey. Due to concerns over smaller samples and extreme values in the early days we do not place too much substantive weight on the initial time path (day 1{3), especially since the stationary distribution (Feller 1968, 394) is achieved by day three in most cases. 22 group. Using the same three-transition matrix estimates that created the time paths in Figure 4 and Figure 5, we can calculate \hypothetical" election day results by using only a subset of the estimated matrices. Instead of using the third transition matrix we continue to use the second, and thus track the proportions as if each group's nal event never happened. Table 7 shows election day proportions for both the true and hypothetical estimates by province and awareness group. The nal transition changes cost the PC party a majority among less aware individuals in both Ontario and BC. *** Table 7 about here *** Figure 6 summarizes the dierences in behavior by awareness group. We subtract the less aware daily vote proportions, again using the same estimates used in Figure 4 and Figure 5, from those of the high aware. In BC, there is remarkable convergence by election day among PC and Liberal supporters. In Ontario, the results for convergence are less convincing, although individuals in each group are closer than they had been even a week prior to election day. Although low awareness individuals are usually heading in the same general direction as their aware counterparts, election day still nds these two groups with substantially dierent vote proportions. *** Figure 6 about here *** To answer the question of which were the critical events in the 1993 election requires conditioning on awareness levels. September 20 was important for only half of the population: those with the interest, abilities and background necessary to observe and evaluate the early crises in the PC party. Events in the nal two weeks of the campaign, in contrast, caused no signicant change in the preferences of the aware group. For the less aware, it was not until media coverage and interest peaked during the party leader's debates that opinion began to be reshaped in anticipation of the upcoming election. Extreme volatility characterizes the vote intentions of the less aware, with up 23 to 20 point shifts following the debates and reversals by election day. Events early in the campaign quickly sort out the positions of high awareness people, while it is not until much later that less aware people begin to make the same choices. The question that cannot be resolved by the present analysis is why individuals are making these changes in response to events. Are the qualitative dierences in the events revealing distinct decision rules based on either style or substance? Were aware people who abandoned the PC party early in the campaign responding to the Progressive Conservative policy failures, while less aware people were merely reacting to a supercial media frenzy near the end of the campaign? Or are less aware individuals rationally relying on media cues about the prospects and abilities of each party: a party with a viable chance at governing would not be so brutally treated in the nal days of the campaign. The condemnations of the anti-Chretien advertisement, both by the media and from within the PC party itself, could be a clear signal to those who had not been closely attuned to the campaign that the Progressive Conservatives were not a viable party. The DFMC model provides a framework for the further investigation of these possibilities. Including a richer array of covariates from both survey and media data, and allowing for the simultaneous estimation of endogenous variables, are the next extensions for the analysis of pooled cross-sectional data. Appendix: Estimating Markov Chains We use the theory of Finite Markov Chains (Feller 1968; Kemeny and Snell 1960; Bartholomew 1982) to analyze the movement of individuals between discrete states over time. The fundamental property of a Markov chain is that the probability of event Yit at time t depends solely upon the event of the directly preceding time, Yjt,1 . Thus Pr(Yit jYjt,1 ) = Pr(YitjYjt,1 ; Ykt,2) = Pr(Yit jYjt,1; Ykt,2; Ylt,3) = etc : : : 24 The probability of an ordered set of sequences is obtained by the multiplication law of conditional probability, Pr(Y0 ; Y1 ; :::; YT ) = Pr(Y0 )Pr(Y1 jY0 ) Pr(YT jYT ,1 ). To calculate in generality for any time (t = 1; 2; :::; T ) the probabilities of being in one of a nite number of states (i = 1; 2; :::; m), one needs only to know the initial distribution 0> = 10 20 30 : : : m0 and the transition probability matrix, 2 3 p11 p12 : : : p1m 6 p 7 6 21 p22 : : : p2m 7 P = 64 where .. . .. . ... .. . 7 5 pm1 pm2 : : : pmm Pm i=1 i0 = 1 and i0 0 for i = 1; :::; m, and i=1 pji = 1 for j = 1; :::; m and pji 0 Pm for i; j = 1; 2; :::; m. The constraints reect the assumptions that the states are mutually exclusive and exhaustive, and that probabilities cannot be negative. Repeatedly multiplying the initial probability vector by t transition matrices gives the probabilities for each state at time t, t> = 0>Pt Allowing the transition matrix to change in K successive time periods requires the calculation of multiple matrixes P1, P2, ... PK each covering t1 ; t2; :::tK days respectively. An individual P interviewed on day t = kk=1,1 tk + w, w tk , would have probabilities t calculated as follows, t> = 0> ,1 kY k=1 Ptkk ! Pwk (2) The greatest diculty with using pooled cross-sectional surveys is that the transition of each individual between states is not observed. We use maximum likelihood to estimate the initial probability vector and transition matrices that generate the series of vectors of probabilities t> that presumably describe the distribution of survey respondents on each day. For a random sample of R independent observations the log-likelihood is log L = R X m X r=1 i=1 yritr log ritr (3) 25 where yritr = 1 if respondent r is in state i on the day of interview tr , and yritr = 0 otherwise. rtr = (r1tr ; : : :; rmtr )> is the vector of probabilities for respondent r. We use a multinomial logit (MNL) (Maddala 1983, Theil 1969) form to represent the initial probabilities, each row of the transition matrices, and the jump vectors. A major advantage of the MNL parameterization is that the sum of probabilities are constrained to equal one.24 Each MNL can be estimated as a function of individual respondent attributes. The result is transition matrices and prior probabilities that are functions of individual level characteristics. The initial probability that respondent r is in state i is calculated by ri0 = 1 exp x>r i and rm0 = Pm,1 > 1 + s=1 exp xr s 1 + s=1 exp x>r s Pm,1 (i = 1; 2; :::; m , 1) (4) where xr is a vector of observations on the variables x for individual r, and i is a vector of coecients for state i. Similarly each row h of the kth transition matrix is calculated as, exp log (pk,1;rhi =pk,1;rhm ) + x>r khi (i = 1; 2; :::; m , 1) and (5) pkrhi = Pm,1 1 + s=1 exp flog (pk,1;rhs =pk,1;rhm ) + x>r khs g pkrhm = 1 1 + s=1 exp flog (pk,1;rhs =pk,1;rhm ) + x>r khs g Pm,1 Note that log (pk,1;rhi =pk,1;rhm ) is the log odds of a transition by individual r in state h to state i rather than to state m. We change each transition probability by adding the sum of the products of the variables in x>r and the coecients khi to the value of the log odds according to the previous transition matrix. The probability in the cell does not change if khi = 0. 24 Similar uses of the MNL form in Markov chain models occur in MacRae (1977) and Brown and Payne (1986). 26 References Aldrich, John H. 1980. Before the Convention: Strategies and Choices in Presidential Nomination Campaigns. Chicago: University of Chicago Press. Bartels, Larry M. 1988. Presidential Primaries and the Dynamics of Public Choice. Princeton: Princeton University Press. Bartholomew, David J. 1982. Stochastic Models for Social Processes. New York: Wiley. Brams, Steven. 1978. The Presidential Election Game. New Haven: Yale University Press. Brody, Richard A., and Lawrence S. Rothenberg. 1988. \The Instability of Partisanship: An Analysis of the 1980 Presidential Election." British Journal of Political Science 18:445{465. Brown, Philip J. and Clive D. Payne. 1986. \Aggregate Data, Ecological Regression, and Voting Transitions." Journal of the American Statistical Association 81(394):452{460. Campbell, Angus, Philip E. Converse, Warren E. Miller and Donald E. Stokes. 1960. The American Voter. New York: Wiley. Delli Carpini and Keeter. 1994. \Measuring Political Knowledge: Putting First Things First." American Journal of Political Science 35(4):1179{1206 Erikson, Robert S. 1982. \The `Uncorrelated Errors' Approach to the Problem of Causal Feedback." Journal of Politics 44:863{881. Feller, William. 1968. An Introduction to Probability Theory and Its Applications. Vol. 1, 3 ed., revised printing. New York: Wiley. Fiegart, Frank. 1989. Canada Votes: 1935{1988. Durham: Duke University Press. 27 Fiorina, Morris P. 1981. Retrospective Voting in American National Elections. New Haven: Yale University Press. Franklin, Charles H. 1984. \Issue Preferences, Socialization, and the Evolution of Party Identication." American Journal of Political Science 28:459{478. Franklin, Charles H., and John E. Jackson. 1983. \The Dynamics of Party Identication." American Political Science Review 77:957{73. Gelman, Andrew, and Gary King. 1993. \Why are American Presidential Election Polls So Variable When Votes are So Predictable?" British Journal of Political Science 23:409{451. Green, Donald Philip, and Bradley Palmquist. 1990. \Of Artifacts and Partisan Instability." American Journal of Political Science 34:872{902. Gurian, Paul-Henri. 1987. \Resource Allocation Strategies in Presidential Nomination Campaigns." American Journal of Political Science 31:802{821. Hart, Gary. 1983. A New Democracy. New York: William Morrow and Company. Hart, Gary. 1993. The Good Fight. New York: Random House. Hinich, Melvin J., and Michael C. Munger. 1997. Analytical Politics. New York: Cambridge University Press. Jackson, John E. 1975a. \Issues, Party Choices and Presidential Votes." American Journal of Political Science 19:161{185. Jackson, John E. 1975b. \Issues and Party Alignment." In Louis Maisel and Paul M. Sacks, eds. The Future of Political Parties. Beverly Hills: Sage. 28 Joreskog, Karl G. 1970. \Estimation and Testing of Simplex Models." British Journal of Mathematical and Statistical Psychology 23:121{145. Johnson, Norman and Samuel Kotz. 1969. Discrete Distributions. Houghton Miin. Johnston, Richard, Andre Blais, Henry Brady, Elisabeth Gidengil and Neil Nevitte. 1995. Canadian Election Study, 1993. Ann Arbor: Inter-university Consortium for Political and Social Research. Machine-Readable Data File. Johnston, Richard, et al. 1992. Canadian National Election Study, 1988. Ann Arbor: Interuniversity Consortium for Political and Social Research. Machine-Readable Data File. Johnston, Richard, Neil Nevitte and Henry Brady. 1994. \Campaign Dynamics in 1993: Liberals, Conservatives, and Reform.". Paper presented at the Annual Meeting of the Canadian Political Science Association. Keith, Bruce, David Magleby, Candice Nelson, Elizabeth Orr, Mark Westlye, and Raymond Wolnger. 1992. The Myth of the Independent Voter. Berkeley: University of California Press. Kemeny, John G. and J. Laurie Snell. 1960. Finite Markov Chains. Princeton, N.J.: Van Nostrand. LeDuc, Lawrence. 1994. The Leaders' Debates: Critical Event or Non-Event? Carleton University Press chapter 9, pp. 9{26. Lee, Judge, and Zellner. 1977. Estimating the Parameters of the Markov Probability Model from Aggregate Time Series Data. Number 65 in \Contributions to Economic Analysis" 2 ed. North-Holland Publishing Co. McCarthy, Colm and Terence Ryan. 1977. \Estimates of Voter Transition Probabilities from the British General Election of 1974." Journal of the Royal Society of Statistics 140(1):78{85. 29 MacRae, Elizabeth Chase. 1977. \Estimation of Time-Varying Markov Processes with Aggregate Data." Econometrica 45(1):183{198. Maddala, G. S. 1983. Limited-Dependent and Qualitative Variables. Econometric Society Monographs No. 3 Cambridge University Press. Magnus, Jan and Heinz Neudecker. 1988. Matrix Dierential Calculus with Applications in Statistics and Econometrics. Wiley. Markus, Gregory B., and Philip E. Converse. 1979. \A Dynamic Simultaneous Equation Model of Electoral Choice." American Political Science Review 73:1055{1070. Mebane, Walter R. Jr., and Jasjeet Sekhon. 1996. GENOUD: GENetic Optimization Using Derivatives. Computer program. Meier, Kenneth J. 1975. \Party Identication and Vote Choice: The Causal Relationship." Western Political Quarterly 28:496{505. Miller, Richard W. 1987. Fact and Method: Explanation, Conrmation and Reality in the Natural and the Social Sciences. Princeton: Princeton University Press. Miller, Warren E., and the National Election Studies. 1983. American National Election Study, 1980: Major Panel File [computer le]. Ann Arbor, MI: Center for Political Studies, Uni- versity of Michigan. [original producer]. Second ICPSR ed. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [producer and distributor]. Miller, Warren E., and the National Election Studies. 1985. American National Election Study, 1984: Continuous Monitoring Survey [computer le]. Ann Arbor, MI: Center for Political Studies, University of Michigan. [original producer]. First ICPSR ed. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [producer and distributor]. 30 Mott, Robert. 1993. \Identication and estimation of dynamic models with a time series of repeated cross-sections." Journal of Econometrics 59:99{123. Northrup, David. 1989? Technical Documentation: The 1988 Canadian Election Study. Technical report Institute for Social Science Research, York University. Page, Benjamin I., and Calvin C. Jones. 1979. \Reciprocal Eects of Policy Preferences, Party Loyalties and the Vote." American Political Science Review 73:1071{1089. Resnick, Sidney I. 1992. Adventures in Stochastic Processes. Boston: Birkhauser. Rosenqvist, Gunnar. 1986. Micro and Macro Data in Statistical Inference on Markov Chains. Number 36 in \Ekonomi Och Samhalle" Helsingfors: Swedish School of Economics and Business Administration. Theil, Henri. 1969. \A Multinomial Extension of the Linear Logit Model." International Economic Review 10(3):251{259. White, Halbert. 1994. Estimation, Inference and Specication Analysis. New York: Cambridge University Press. Wiley, James A., and Mary Glenn Wiley. 1974. \A Note on Correlated Errors in Repeated Measurements." Sociological Methods and Research 3:172{188. Woolstencroft, Peter. 1994. `Doing Politics Dierently': The Conservative Party and the Campaign of 1993. Carleton University Press chapter 2, pp. 9{26. 31 Table 1: Unconstrained Wave-to-wave Transition Matrix Estimates, 1980 ANES Major Panel Study January-February to June SDem Strong Democrat .7214 Democrat .1364 Democrat Leaner .0361 Independent .0179 Republican Leaner .0120 Republican .0000 Strong Republican .0000 Dem .2143 .6515 .2892 .0982 .0843 .0217 .0149 DemL .0500 .1061 .4337 .1875 .0361 .0072 .0149 Ind .0000 .0202 .0964 .4196 .0723 .0580 .0000 RepL .0071 .0253 .1325 .1875 .4819 .0942 .0149 Rep .0071 .0404 .0120 .0714 .2771 .6304 .2239 SRep .0000 .0202 .0000 .0179 .0361 .1884 .7313 Dem .1870 .7119 .2222 .1061 .0920 .0221 .0274 DemL .0407 .0565 .4691 .1515 .1034 .0074 .0000 Ind .0000 .0395 .1358 .5758 .1149 .0147 .0548 RepL .0000 .0113 .0864 .1667 .5287 .1250 .0274 Rep .0000 .0282 .0247 .0000 .1494 .6838 .1370 SRep .0000 .0056 .0123 .0000 .0115 .1397 .7534 September to November (Post-election) SDem Dem DemL Strong Democrat .7886 .1707 .0081 Democrat .1978 .6484 .0440 Democrat Leaner .0541 .1351 .5541 Independent .0267 .0667 .0933 Republican Leaner .0000 .0482 .0241 Republican .0000 .0248 .0248 Strong Republican .0000 .0000 .0139 Ind .0081 .0549 .1622 .6667 .0964 .0331 .0000 RepL .0000 .0220 .0811 .1067 .6506 .0826 .0417 Rep .0244 .0275 .0135 .0400 .1446 .6116 .1111 SRep .0000 .0055 .0000 .0000 .0361 .2231 .8333 June to September SDem Strong Democrat .7724 Democrat .1469 Democrat Leaner .0494 Independent .0000 Republican Leaner .0000 Republican .0074 Strong Republican .0000 Note: Maximum likelihood estimates. Number of cases for each pair of waves: Jan-Feb/Jun, 821; Jun/Sep, 743; Sep/Nov, 730. Table 2: Log Likelihoods for Markov Chain Specications for Party Identication During 1984 Specication No Measurement Error One Transition Matrix Two Transition Matrices With Measurement Error One Transition Matrix Two Transition Matrices Note: Number of cases: 3456. Parameters ,2 log-likelihood Total Free 13267.82 13244.50 13251.18 13244.01 18 30 25 37 13 16 14 18 Table 3: Parameter Estimates for the Two Transition Matrix Model without Measurement Error, 1984 ANES-CMS Data parameter 1 2 3 4 5 6 111 121 122 132 133 143 144 154 155 165 166 176 211 221 222 232 233 243 244 254 255 265 266 276 MLE SE {.043 .289 .569 .211 {4.997 | .382 .486 {9.992 | .948 .309 3.938 .140 15.912 | 19.991 | {4.165 1.066 1.984 .412 {1.394 .403 {11.938 | .643 .135 {2.674 3.299 14.000 | 14.977 | {6.338 .909 {13.999 | {9.946 | {15.500 | 1.586 3.918 3.442 .491 {10.979 | {1.000 | {.541 .234 {7.000 | {8.102 .490 {9.022 .495 {7.000 | Note: The second transition matrix begins at day t = 120 (May 10, 1984). SEs are robust. SEs shown as \|" indicate a xed (boundary-constrained) parameter. Number of cases: 3456. Table 4: Party Identication Markov Chain Estimates During 1984 SDem Initial Distribution .1232 Dem DemL Ind RepL Rep SRep .2271 .0009 .1884 .0000 .3319 .1286 Transition Probability Matrices Days 1{119 SDem Dem DemL Ind RepL Rep SRep Strong Democrat .9809 .0191 .0000 .0000 .0000 .0000 .0000 Democrat .0166 .9834 .0000 .0000 .0000 .0000 .0000 Democrat Leaner .0000 .0019 .8774 .1207 .0000 .0000 .0000 Independent .0000 .0000 .1987 .0000 .8013 .0000 .0000 Republican Leaner .0000 .0000 .0000 .6402 .0232 .3366 .0000 Republican .0000 .0000 .0000 .0000 .2733 .7267 .0000 Strong Republican .0000 .0000 .0000 .0000 .0000 .0018 .9982 Days 120{331 SDem Dem DemL Ind RepL Rep SRep Strong Democrat .0000 1.0000 .0000 .0000 .0000 .0000 .0000 Democrat .8121 .1858 .0021 .0000 .0000 .0000 .0000 Democrat Leaner .0000 .0003 .9953 .0044 .0000 .0000 .0000 Independent .0000 .0000 .0000 .0000 1.0000 .0000 .0000 Republican Leaner .0000 .0000 .0000 .5255 .0000 .4745 .0000 Republican .0000 .0000 .0000 .0000 .4849 .5138 .0013 Strong Republican .0000 .0000 .0000 .0000 .0000 .0000 1.0000 Note: Probabilities are evaluated at the parameter MLEs. Day 1 is January 11, 1984. Election day is day 300 (November 6, 1984). Table 5: Party Identication Hitting Time Distributions During 1984, First Quartiles and Medians First Quartile Days 1{119 SDem Dem DemL Ind RepL Rep SRep Strong Democrat 0 15 1 1 1 1 1 Democrat 17 0 1 1 1 1 1 Democrat Leaner 524 507 0 2 3 5 1 Independent 530 512 5 0 1 3 1 Republican Leaner 531 513 6 1 0 2 1 Republican 532 514 7 2 1 0 1 Strong Republican 695 678 171 165 164 163 0 Days 120{331 SDem Dem DemL Ind RepL Rep SRep Strong Democrat 0 1 251 335 335 336 892 Democrat 1 0 250 335 335 336 892 Democrat Leaner 61 61 0 84 84 85 641 Independent 1 1 1 0 1 2 557 Republican Leaner 1 1 1 1 0 1 556 Republican 1 1 1 2 1 0 555 Strong Republican 1 1 1 1 1 1 0 Median Days 1{119 SDem Dem DemL Ind RepL Rep SRep Strong Democrat 0 36 1 1 1 1 1 Democrat 42 0 1 1 1 1 1 Democrat Leaner 1263 1222 0 5 8 14 1 Independent 1277 1235 13 0 2 8 1 Republican Leaner 1279 1238 15 2 0 6 1 Republican 1282 1240 18 5 2 0 1 Strong Republican 1676 1634 412 399 396 394 0 Days 120{331 SDem Dem DemL Ind RepL Rep SRep Strong Democrat 0 1 604 808 809 811 2150 Democrat 1 0 604 807 808 810 2149 Democrat Leaner 147 147 0 204 204 207 1545 Independent 1 1 1 0 1 3 1342 Republican Leaner 1 1 1 2 0 2 1341 Republican 1 1 1 4 1 0 1338 Strong Republican 1 1 1 1 1 1 0 Note: Hitting time distributions are evaluated at the parameter MLEs. Day 1 is January 11, 1984. Election day is day 300 (November 6, 1984). Table 6: Log Likelihoods for Markov Chain Specications for Vote Proportions in Canada, 1993 Parameter Counts Specication ,2 log-likelihood Total Free Pooled, Four Transitons Matrices Changes at Days 10,25,35 5736:57 108 88 More Aware, Four Transitons Matrices Changes at Days 10,25,35 2734:1 81 60 Less Aware, Four Transitons Matrices Changes at Days 10,25,35 2970:01 81 61 More Aware, Three Transitons Matrices Changes at Days 10,25 2743:0 63 49 Less Aware, Three Transitons Matrices Changes at Days 25,35 2984:63 63 53 Note: Number of cases: 1212 (Unware), 1068 (Aware). Table 7: Dierence between Expected Vote Proportions on Election Day, With and Without Last Transition Matrix Change, by Province and Awareness Level in Canada 1993 PC BC Liberal Reform Other PC :24 :35 ,0:11 :49 :29 :20 :02 :32 ,0:29 :47 :23 :24 :32 :41 :20 :41 :11 ,0:00 :12 :20 ,0:08 :44 :33 :10 Less Aware Change Day(s) 25 & 35 :06 :21 25 only :13 :24 Dierence ,0:06 ,0:03 More Aware Change Day(s) 10 & 25 :05 :22 10 only :24 :14 Dierence ,0:19 :08 Ontario Liberal Reform Other :08 :13 ,0:05 :43 :32 :10 :19 :26 :14 :33 :05 ,0:07 Figure 1: Trajectories of the Distribution of PID for the MLEs and Selected Parameter Sets from the 95% Condence Ellipsoid, Democrat PIDs 0.2 0.0 0.1 Proportion 0.3 Strong Democrat 0 100 200 300 200 300 Day 0.2 0.0 0.1 Proportion 0.3 Democrat 0 100 Day 0.2 0.1 0.0 Proportion 0.3 Democrat Leaner 0 100 200 Day 300 Figure 2: Trajectories of the Distribution of PID for the MLEs and Selected Parameter Sets from the 95% Condence Ellipsoid, Independent PID 0.2 0.1 0.0 Proportion 0.3 Independent 0 100 200 Day 300 Figure 3: Trajectories of the Distribution of PID for the MLEs and Selected Parameter Sets from the 95% Condence Ellipsoid, Republican PIDs 0.2 0.0 0.1 Proportion 0.3 Republican Leaner 0 100 200 300 200 300 Day 0.2 0.0 0.1 Proportion 0.3 Republican 0 100 Day 0.2 0.1 0.0 Proportion 0.3 Strong Republican 0 100 200 Day 300 Figure 4: B.C. Daily Vote Proportions by Party and Awareness, DFMC estimates 1.0 1.0 0.8 PC / More Aware 0.8 PC / Less Aware • 0.2 •• • • •• •• • • 20 • •• •• •• • •••• 30 40 0.6 • •• •• • 0 • • • • • • 10 • • • • • • • •• • • 20 ••••••• •• • 30 40 Day Liberal / Less Aware Liberal / More Aware 0.8 0.8 • • • • • • • • • • • 10 ••• •• 20 • • •• • • •• 30 • 40 • •• • •• 0 • • • 10 • • • • • 20 • 40 Reform / More Aware 1.0 Reform / Less Aware • 0.8 • • 0.0 ••• 0 ••• 10 • • • ••••• • • • • 20 • • • • 30 Day • •• 40 •• 0.6 • • •• • • • • • • • •• 0.2 0.2 • • • •• • • • • • 0.0 0.4 • • • • 0.4 Proportion 0.6 0.8 • • • 30 Day • • • Day • • • • • • •• ••• • • • • • 0.0 •• • • • •• 1.0 0.0 • • •• 0 • • • • • •• • • • 0.2 • 0.2 • • • 0.4 Proportion 0.6 • 0.6 • • 0.4 Proportion • • Day • Proportion • • • • • • • • • • • • ••• • 10 1.0 0 • • • 1.0 0.0 • • • 0.2 • • • 0.0 •• • • 0.4 Proportion 0.6 • 0.4 Proportion • •• 0 • • •• 10 • • • • • • • • 20 • • •• • • • 30 Day • • •• • • • 40 • • Figure 5: Ontario Daily Vote Proportions by Party and Awareness, DFMC estimates 1.0 1.0 0.8 PC / More Aware 0.8 PC / Less Aware • • • • • 0.0 • • • • • • • • • • • • • • 10 • •• 20 • • • 30 • • • • • • • • • •• • • • • • •• 10 • • • • 0 • • • •• 20 •• •• 30 • •• 40 Day Liberal / Less Aware Liberal / More Aware 0.8 • • • 0.2 • • • • • 0.0 • •• • • • • 10 20 30 40 • • ••• • • • • • •• • • • 0 • • • • • • • 10 20 30 40 Day Reform / Less Aware Reform / More Aware 0.8 0.8 1.0 Day 1.0 • • • • • • • 0 • • • • • • • • • • • • • • • • • •• • • • • • • • 0.2 • • • 0.0 0.4 •• •• • • 0.6 • • • • • • • Proportion 0.6 • • • 0.4 0.8 1.0 Day • Proportion 0.6 40 • • • • • • • • • 1.0 0 • •••• • • • •• • • • • • 0.2 0.2 • • • 0.0 • • 0.4 Proportion 0.6 • 0.4 Proportion • • • •• • ••• 0 10 • • •••• ••• •• • • •• ••• 20 • 30 Day • ••• • • • •• 40 • • • 0.6 0.2 • • • • • ••••••• • 0 • • • • •• • • 10 • •• •• • • • • • • • 0.0 0.2 • • 0.4 Proportion 0.6 0.4 • 0.0 Proportion • • • • • • • • • •• 20 30 Day • • 40 • • -0.3 -0.3 0.0 0.1 0 0.2 0.3 10 10 -0.3 -0.3 0.0 0.1 20 30 20 Day 30 40 0.0 0.1 0.2 0.2 0.3 0.3 30 -0.1 Difference in Proportion -0.1 20 0.1 0.3 0 10 0.0 0.2 Difference in Proportion 0 -0.1 Difference in Proportion -0.1 Difference in Proportion -0.3 -0.3 0.0 0.1 -0.1 0.0 0.1 Difference in Proportion -0.1 Difference in Proportion 0.2 0.2 0.3 0.3 Figure 6: Dierence in Daily Vote Proportions for BC and Ontario, High minus Low Awareness DFMC estimates BC PC Ontario PC Day 40 0 40 0 0 10 10 10 20 Day 30 BC Liberal Ontario Liberal Day 20 Day 30 BC Reform Ontario Reform 20 Day 30 40 40 40