ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Item Response Theory	15
Sampling	11
Item Sampling	6
Models	5
Error of Measurement	4
Test Items	4
Bayesian Statistics	3
Computation	3
Evaluation Methods	3
Mathematics Tests	3
Research Methodology	3
Sample Size	3
Achievement Tests	2
Comparative Analysis	2
Computer Software	2
Educational Improvement	2
Elementary Secondary Education	2
Equations (Mathematics)	2
Foreign Countries	2
High Schools	2
Inferences	2
Measurement Techniques	2
National Competency Tests	2
Psychometrics	2
Questionnaires	2
More ▼

Source

Journal of Educational and…	2
Applied Psychological…	1
Council of Chief State School…	1
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational…	1
National Center for Education…	1
Psicologica: International…	1
Psychological Review	1
Psychometrika	1
Sociological Methods &…	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Descriptive	15
Journal Articles	11
Guides - General	1
Opinion Papers	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Location

China

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

TIMSS 2015: Illustrating Advancements in Large-Scale International Assessments

Peer reviewed

Direct link

Martin, Michael O.; Mullis, Ina V. S. – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments of student achievement such as International Association for the Evaluation of Educational Achievement's Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study and Organization for Economic Cooperation and Development's Program for International…

Descriptors: Achievement Tests, International Assessment, Mathematics Tests, Science Achievement

An Efficient Alternative Mixed Randomized Response Procedure

Peer reviewed

Direct link

Singh, Housila P.; Tarray, Tanveer A. – Sociological Methods & Research, 2015

In this article, we have suggested a new modified mixed randomized response (RR) model and studied its properties. It is shown that the proposed mixed RR model is always more efficient than the Kim and Warde's mixed RR model. The proposed mixed RR model has also been extended to stratified sampling. Numerical illustrations and graphical…

Descriptors: Item Response Theory, Models, Efficiency, Comparative Analysis

A Strategy for Developing a Common Metric in Item Response Theory when Parameter Posterior Distributions Are Known

Peer reviewed

Direct link

Baldwin, Peter – Journal of Educational Measurement, 2011

Growing interest in fully Bayesian item response models begs the question: To what extent can model parameter posterior draws enhance existing practices? One practice that has traditionally relied on model parameter point estimates but may be improved by using posterior draws is the development of a common metric for two independently calibrated…

Descriptors: Item Response Theory, Bayesian Statistics, Computation, Sampling

Parsimonious Testing of Transitive or Intransitive Preferences: Reply to Birnbaum (2011)

Peer reviewed

Direct link

Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P.; Guo, Ying – Psychological Review, 2011

Birnbaum raised important challenges to testing transitivity. We summarize why an approach based on counting response patterns does not solve these challenges. Foremost, we show why parsimonious tests of transitivity require at least 5 choice alternatives. While the approach of Regenwetter, Dana, and Davis-Stober achieves high power with modest…

Descriptors: Testing, Item Response Theory, Responses, Evaluation Methods

Addressing Two Commonly Unrecognized Sources of Score Instability in Annual State Assessments

Download full text

Doorey, Nancy A. – Council of Chief State School Officers, 2011

The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…

Descriptors: Testing, Sampling, Expertise, Testing Programs

A Rasch Perspective

Peer reviewed

Direct link

Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…

Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

High School Longitudinal Study of 2009 (HSLS:09): Base-Year Data File Documentation. NCES 2011-328

Peer reviewed
PDF on ERIC

Download full text

Ingels, Steven J.; Pratt, Daniel J.; Herget, Deborah R.; Burns, Laura J.; Dever, Jill A.; Ottem, Randolph; Rogers, James E.; Jin, Ying; Leinwand, Steve – National Center for Education Statistics, 2011

The High School Longitudinal Study of 2009 (HSLS:09) is the fifth in a series of National Center for Education Statistics (NCES) secondary longitudinal studies. The core research questions for HSLS:09 explore secondary to postsecondary transition plans and the evolution of those plans; the paths into and out of science, technology, engineering,…

Descriptors: High Schools, Longitudinal Studies, Secondary Education, School Statistics

An Introduction to the DA-T Gibbs Sampler for the Two-Parameter Logistic (2PL) Model and beyond

Peer reviewed
PDF on ERIC

Download full text

Maris, Gunter; Bechger, Timo M. – Psicologica: International Journal of Methodology and Experimental Psychology, 2005

The DA-T Gibbs sampler is proposed by Maris and Maris (2002) as a Bayesian estimation method for a wide variety of "Item Response Theory (IRT) models". The present paper provides an expository account of the DA-T Gibbs sampler for the 2PL model. However, the scope is not limited to the 2PL model. It is demonstrated how the DA-T Gibbs…

Descriptors: Bayesian Statistics, Computation, Item Response Theory, Models

An NCME Instructional Module on Estimating Item Response Theory Models Using Markov Chain Monte Carlo Methods

Peer reviewed

Direct link

Kim, Jee-Seon; Bolt, Daniel M. – Educational Measurement: Issues and Practice, 2007

The purpose of this ITEMS module is to provide an introduction to Markov chain Monte Carlo (MCMC) estimation for item response models. A brief description of Bayesian inference is followed by an overview of the various facets of MCMC algorithms, including discussion of prior specification, sampling procedures, and methods for evaluating chain…

Descriptors: Placement, Monte Carlo Methods, Markov Processes, Measurement

RESGEN Item Response Generator. 1990 Version 1.01.

Download full text

Muraki, Eiji – 1992

RESGEN is a computer program designed to generate simulated latent trait distributions and then dichotomous or polytomous item responses based on item response models. The latent trait distributions can be univariate or multivariate normal, log-normal, uniform, or gamma. The item response models utilized in this program may have characteristics…

Descriptors: Computer Software, Computer Software Development, Item Response Theory, Models

Reliability as a Measurement Design Effect

Peer reviewed

Direct link

Adams, Raymond J. – Studies in Educational Evaluation, 2005

Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…

Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory

Using Set Covering with Item Sampling to Analyze the Infeasibility of Linear Programming Test Assembly Models

Peer reviewed

Direct link

Huitzing, Hiddo A. – Applied Psychological Measurement, 2004

This article shows how set covering with item sampling (SCIS) methods can be used in the analysis and preanalysis of linear programming models for test assembly (LPTA). LPTA models can construct tests, fulfilling a set of constraints set by the test assembler. Sometimes, no solution to the LPTA model exists. The model is then said to be…

Descriptors: Mathematical Applications, Simulation, Item Sampling, Item Response Theory

Analysis of Distractor Difficulty in Multiple-Choice Items

Peer reviewed

Direct link

Revuelta, Javier – Psychometrika, 2004

Two psychometric models are presented for evaluating the difficulty of the distractors in multiple-choice items. They are based on the criterion of rising distractor selection ratios, which facilitates interpretation of the subject and item parameters. Statistical inferential tools are developed in a Bayesian framework: modal a posteriori…

Descriptors: Multiple Choice Tests, Psychometrics, Models, Difficulty Level

The Effects of Sample Size and Matching Strategy on Mantel-Haenszel and Logit DIF Procedures.

Download full text

Schultz, Matthew T.; Geisinger, Kurt F. – 1992

Research efforts have established that the Mantel-Haenszel procedure (MHP) is an effective method for detecting the presence of test items exhibiting differential item functioning (DIF). While the MHP has been advocated for situations where item response theory based methods may not be usable, recent findings have suggested that the performance of…

Descriptors: College Entrance Examinations, Comparative Analysis, Control Groups, Equations (Mathematics)

Adams, Raymond J.	1
Baldwin, Peter	1
Bechger, Timo M.	1
Bolt, Daniel M.	1
Burns, Laura J.	1
Dana, Jason	1
Davis-Stober, Clintin P.	1
Dever, Jill A.	1
Doorey, Nancy A.	1
Geisinger, Kurt F.	1
Guo, Ying	1
Herget, Deborah R.	1
Huitzing, Hiddo A.	1
Ingels, Steven J.	1
Jiang, Yu	1
Jin, Ying	1
Kim, Jee-Seon	1
Leinwand, Steve	1
Maris, Gunter	1
Martin, Michael O.	1
Mullis, Ina V. S.	1
Muraki, Eiji	1
Ottem, Randolph	1
Pratt, Daniel J.	1
Regenwetter, Michel	1
More ▼