Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 2 |
Descriptor
Source
Applied Measurement in… | 1 |
Applied Psychological… | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
National Center for Education… | 1 |
Author
Bock, R. Darrell | 3 |
Zimowski, Michele F. | 3 |
Wainer, Howard | 2 |
Brennan, Robert L. | 1 |
Cheng, Philip E. | 1 |
Chun Wang | 1 |
Hamilton, Laura | 1 |
Jewsbury, Paul A. | 1 |
Klein, Stephen P. | 1 |
Liou, Michelle | 1 |
Ping Chen | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 7 |
Journal Articles | 4 |
Opinion Papers | 3 |
Reports - Research | 3 |
Speeches/Meeting Papers | 2 |
Reports - Descriptive | 1 |
Education Level
Elementary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Secondary Education | 1 |
Audience
Researchers | 2 |
Policymakers | 1 |
Location
Ohio | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 14 |
What Works Clearinghouse Rating
Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020
In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…
Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing
Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020
Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…
Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Cheng, Philip E.; Liou, Michelle – Applied Psychological Measurement, 2000
Reviewed methods of estimating theta suitable for computerized adaptive testing (CAT) and discussed the differences between Fisher and Kullback-Leibler information criteria for selecting items. Examined the accuracy of different CAT algorithms using samples from the National Assessment of Educational Progress. Results show when correcting for…
Descriptors: Ability, Adaptive Testing, Algorithms, Computer Assisted Testing
Wainer, Howard; Thissen, David – 1992
If examinees are permitted to choose to answer a subset of the questions on a test, just knowing which questions were chosen can provide a measure of proficiency that may be as reliable as would have been obtained from the test graded traditionally. This new method of scoring is much less time consuming and expensive for both the examinee and the…
Descriptors: Adaptive Testing, Cost Effectiveness, Responses, Scoring

Brennan, Robert L. – Applied Measurement in Education, 1992
A conceptual framework and heuristic model for considering the existence, magnitude, and consequences of context effects are presented through an extension of some generalizability theory concepts. Context effects are often misunderstood, and current measurement models have serious limitations for examining them. Their importance needs to be…
Descriptors: Adaptive Testing, Context Effect, Equated Scores, Equations (Mathematics)
Ysseldyke, James E.; And Others – 1994
This report is a summary of a March 1994 meeting to agree on guidelines for making inclusion and accommodation decisions concerning students with disabilities in national and state large-scale assessments. Much of the discussion at the meeting focused on the National Assessment of Educational Progress. Factors that lead to the exclusion of…
Descriptors: Academic Standards, Adaptive Testing, Decision Making, Disabilities
Tatsuoka, Kikumi K. – 1982
This study introduced a probabilistic model utilizing item response theory (IRT) for dealing with a variety of misconceptions. The model can be used for evaluating the transition behavior of error types, advancement of learning stages, or the stability and persistence of particular misconceptions. Moreover, it apparently can be used for relating…
Descriptors: Adaptive Testing, Elementary Secondary Education, Error Patterns, Evaluation Methods
Reckase, Mark D. – 1986
The current technology of computerized testing is discussed, and a few comments are made on how such technology might be used for assessing school-related skills as part of the National Assessment of Educational progress (NAEP). The critical feature of computerized assessment procedures is that the test items are presented in interactive fashion,…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Disabilities
Bock, R. Darrell; Zimowski, Michele F. – National Center for Education Statistics, 2003
This report examines the potential of adaptive testing, two?-stage testing in particular, for improving the data quality of the National Assessment of Educational Progress (NAEP). Following a discussion of the rationale for adaptive testing in assessment and a review of previous studies of two-?stage testing, this report describes a 1993 Ohio…
Descriptors: National Competency Tests, Test Validity, Feasibility Studies, Educational Assessment
Bock, R. Darrell; Zimowski, Michele F. – 2003
This paper discusses the rationale for enhancing the current National Assessment of Educational Progress (NAEP) design by adding a capacity for adaptive testing. Items are tailored to the achievement level of the student in adaptive testing. The report describes a 1993 Ohio field trail of two-stage assessment carried out by the National Opinion…
Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Field Tests
Klein, Stephen P.; Hamilton, Laura – 1999
This paper reviews the salient characteristics of the current inventory of large-scale achievement tests, analyzes two recent proposals for creating a national testing program, and describes a new approach to statewide and nationwide testing that RAND is examining. Section 2 discusses the criteria typically used for evaluating large-scale testing…
Descriptors: Academic Achievement, Achievement Tests, Adaptive Testing, Computer Assisted Testing
Wainer, Howard; And Others – 1992
Four researchers at the Educational Testing Service describe what they consider some of the most vexing research problems they face. While these problems are not completely statistical, they all have major statistical components. Following the introduction (section 1), in section 2, "Problems with the Simultaneous Estimation of Many True…
Descriptors: Adaptive Testing, Bayesian Statistics, Educational Research, Estimation (Mathematics)
Bock, R. Darrell; Zimowski, Michele F. – 1998
This report examines the potential of adaptive testing, two-stage testing in particular, for improving the data quality of the National Assessment of Educational Progress (NAEP). Following a discussion of the rationale for adaptive testing in assessment and a review of previous studies of two-stage testing, this report describes a 1993 Ohio field…
Descriptors: Adaptive Testing, Data Analysis, Educational Assessment, Elementary Secondary Education
Walberg, Herbert J. – 1985
The value of statistical research depends on valid comparisons which can usefully influence educational policy. Educational research needs to extend the measures of learning (such as the National Assessment of Educational Progress) through nationally-calibrated absolute measures and through computer-assisted and adaptive testing. Direct sampling…
Descriptors: Academic Achievement, Academic Standards, Adaptive Testing, Computer Assisted Testing