ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Difficulty Level	10
Error of Measurement	10
Psychometrics	10
Test Items	9
Item Response Theory	7
Equated Scores	3
Scoring	3
Test Construction	3
Test Validity	3
Elementary School Students	2
Fractions	2
Goodness of Fit	2
Latent Trait Theory	2
Mathematics Tests	2
Measurement	2
Simulation	2
Test Reliability	2
Ability	1
Ability Grouping	1
Accuracy	1
Adaptive Testing	1
Chemistry	1
Classification	1
College Entrance Examinations	1
Computation	1
More ▼

Source

Grantee Submission	2
Applied Measurement in…	1
Behavioral Research and…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Online Submission	1

Publication Type

Reports - Research	9
Journal Articles	4
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Grade 3	2
Grade 4	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 2	1
Grade 5	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Researchers

Location

Philippines

Laws, Policies, & Programs

Assessments and Surveys

Medical College Admission Test

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Analyzing Different Module Characteristics in Computer Adaptive Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020

Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory

Psychometric Report on the Knowledge for Teaching Elementary Fractions Test Administered to Elementary Educators in Six States in Spring 2017. Research Report No. 2018-13

Download full text

Schoen, Robert C.; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2018

This report provides evidence of the substantive and structural validity of the Knowledge for Teaching Elementary Fractions Test. Field-test data were gathered with a sample of 241 elementary educators, including teachers, administrators, and instructional support personnel, in spring 2017, as part of a larger study involving a multisite…

Descriptors: Psychometrics, Pedagogical Content Knowledge, Mathematics Tests, Mathematics Instruction

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

The Effect of Anchor Test Construction on Scale Drift

Peer reviewed

Direct link

Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014

In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory

A Multilevel Testlet Model for Dual Local Dependence

Peer reviewed

Direct link

Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012

The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…

Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods

Observed-Score Equating with a Heterogeneous Target Population

Peer reviewed

Direct link

Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012

Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…

Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis

Demonstrating the Difference between Classical Test Theory and Item Response Theory Using Derived Test Data

Download full text

Magno, Carlo – Online Submission, 2009

The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…

Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries

Examining the Technical Adequacy of Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 41

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007

In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…

Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education

Latent Trait Models for Partially Speeded Tests.

Wise, Lauress L. – 1986

A primary goal of this study was to determine the extent to which item difficulty was related to item position and, if a significant relationship was found, to suggest adjustments to predicted item difficulty that reflect differences in item position. Item response data from the Medical College Admission Test (MCAT) were analyzed. A data set was…

Descriptors: College Entrance Examinations, Difficulty Level, Educational Research, Error of Measurement

A Comparison of Rasch Person Analysis and Robust Estimators.

Smith, Richard M. – 1983

Measurement disturbances, such as guessing, startup, and plodding, often result in an examinee's ability being either over- or under-estimated by the maximum likelihood estimation employed in latent trait psychometric models. Several authors have suggested methods to lessen the impact of unexpected responses on the ability estimation process. This…

Descriptors: Difficulty Level, Error of Measurement, Estimation (Mathematics), Goodness of Fit

Paek, Insu	2
Schoen, Robert C.	2
Yang, Xiaotong	2
Alonzo, Julie	1
Antal, Judit	1
Duong, Minh Q.	1
Jiao, Hong	1
Jin, Ying	1
Kamata, Akihito	1
Liu, Kimy	1
Liu, Sicong	1
Magno, Carlo	1
Melican, Gerald J.	1
Proctor, Thomas P.	1
Sahin, Melek Gulsah	1
Smith, Richard M.	1
Tindal, Gerald	1
Wang, Shudong	1
Wise, Lauress L.	1
von Davier, Alina A.	1
More ▼