Tutorial 6: Population Model(2013-7-31) |
|
Home 1. Install R 2. Install TAM 3. Rasch Model 4. CTT, Fit 5. Partial Credit Model 6. Population Model 7. DIF More resources |
Summary of TutorialThis tutorial shows how to obtain population statistics of latent trait, for example, mean and variance of ability estimates for a population or sub-population of students. The tutorial shows a number of R commands for computing frequencies, cross-tabulation and group statistics. For an extension, this tutorial explains and contrasts different IRT models for obtaining population estimates. The R script file for this tutorial can be downloaded through the link Tutorial6.R . Data filesThe data file is from TIMSS 2011 mathematics tests, released items, blocks 1 and 14, for Taiwan and Australian students. It is the same data file as for Tutorial 5. However, the item responses have already been scored. The four blank records have also been removed. The test paper and the data file can be downloaded through the following links: TIMSS released items block 1TIMSS 2011 Taiwan and Australian data in csv format - scored data R script fileLoad R librariesBefore TAM functions can be called, the TAM library has to be loaded into the R workspace. Line 2 loads the TAM library. Read data file
Line 5 sets the working directory and line 6 reads the data file
"TIMSS_AUS_TWN_scored.csv"
into a variable called "resp". Line 7 extracts the item responses into
a variable called "scored", as columns 12 - 14 contain country code,
gender code and book ID respectively. Run IRT analysis (MML)Before describing the IRT output, we give a brief explanation of the Marginal Maximum Likelihood (MML) estimation method. In the simple Rasch model, there is a mathematical function (a logistic function) for modelling the probability of success of a person on an item, given the person's ability and the item difficulty, as shown below: mod1 <- tam(scored) The TAM function "summary" provides a summary of results (see line 11). summary(mod1) An excerpt of the summary output is shown below: Note that as σ2 is a parameter of the IRT model, it is "directly" estimated from the item response data. It is not estimated based on a two-step process where student abilities are first individually estimated, and then the variance of the estimated abilities is computed.
To illustrate this difference, a joint maximum likelihood (JML) estimation is run (see line 14),
as JML makes no assumption about the population model. In summary, if you are interested in reporting population charateristics such as the mean and variance of the ability distribution, percentiles and histograms, etc., it will be better to use MML, since there is a population model assumption in computing the likelihood of item responses and the estimation of population characteristics is "direct" and not a two-step process. On the other hand, if you are only interested in providing individual student ability estimates and/or item estimates, then JML method is a better choice. Comparing two groupsIn the data set for this tutorial, item responses for Taiwan and Australia are included. Suppose we are interested in comparing the ability distributions of these two countries. Before running an IRT analysis, we will first explore the raw data in a little more depth.
Lines 18 to 21 set up a "country" variable, where Australia (country code 36) is coded as country 1, and Taiwan (country code 158) is coded as country 2. Compute raw scores
We will first take a look at the distribution of test scores for each country. Line 24 computes the raw score on the test for each student. It can be seen from this frequency table that the performances of the two countries differ greatly, with Taiwan students performing much higher on this test. Lines 31-34 plot two histograms of test scores on the same graph in a new window. As the histograms show, there is a considerable floor and ceiling effect if we use test raw scores to build population distributions of abilities. The use of MML estimation method with a population model assumption can address some of the problems caused by floor and ceiling effects.
For comparing two or more groups of students, the population model needs to take into account of different distributions for the groups.
For example, for two groups of students, the population model may assume two separate normal distributions as illustrated below: In this way, the mean and variance for each group are explicitly modelled, and estimations of these parameters are "direct" in the sense that they are directly estimated from the item responses. In contrast, if we use the IRT model estimated in "mod1" (see line 10 of R script file), then only an overall mean and variance are estimated in the IRT run, and not the group means and variances. Parameters for groups will need to be estimated through the generation of ability estimates first. If model 1 is run (mod1 in the R script file) where the population model makes no distinction between the two groups, the model is actually "mis-specified" for any comparison of the two groups.
Line 41 of the R script file runs an IRT model where the population model assumes separate normal distributions for the two countries. The estimated variances for the two groups are 1.54 and 2.78 respectively, and the mean scores are 0 and 2.06 (under the heading of regression coefficients). The TAM default is to set the mean for group 1 to zero. The results show that not only the two countries differ in mean performance, but the variances are quite different too. ExtensionIn this section, a simulation study is presented to compare JML and MML in recovering population parameters. See R code from line 44 to line 63. Item responses are simulated to conform to the Rasch model. First, use JML to estimate item and person parameters. The mean and variance of the ability distribution is computed from WLE ability estimates. The estimated variance is larger than simulated variance (1.4 versus 1). This is a two-step process where the abilities of individuals are estimated, and then population characterisitcs are formed based on individual ability estimates.In contrast, use the MML estimation method. The mean and variance of the ability distribution are parameters of the IRT model, so they are "directly" estimated. In this example, the estimated variance is 1.01. The mean is set to zero as this is the default constraint. As an exercise, vary the sample size (N) and test length (I) to see how these affect the estimation of the mean and variance of the ability distribution uing JML and MML.For example, change I to 5 and 40. |