ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	8
Error of Measurement	8
Scores	8
Test Items	6
Item Response Theory	5
Computation	3
Accuracy	2
Comparative Analysis	2
Elementary School Students	2
Estimation (Mathematics)	2
Foreign Countries	2
Generalizability Theory	2
Test Construction	2
Test Reliability	2
Undergraduate Students	2
Adaptive Testing	1
Bayesian Statistics	1
Classification	1
College Entrance Examinations	1
Concept Formation	1
Correlation	1
Critical Reading	1
Diagnostic Tests	1
Engineering Education	1
English (Second Language)	1
More ▼

Source

College Entrance Examination…	1
ETS Research Report Series	1
Educational and Psychological…	1
IEEE Transactions on Education	1
Online Submission	1
Research Papers in Education	1

Publication Type

Reports - Research	7
Journal Articles	4
Speeches/Meeting Papers	2
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1

Audience

Location

Canada	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Generalizability of Scaling Gradients on Direct Behavior Ratings

Peer reviewed

Direct link

Chafouleas, Sandra M.; Christ, Theodore J.; Riley-Tillman, T. Chris – Educational and Psychological Measurement, 2009

Generalizability theory is used to examine the impact of scaling gradients on a single-item Direct Behavior Rating (DBR). A DBR refers to a type of rating scale used to efficiently record target behavior(s) following an observation occasion. Variance components associated with scale gradients are estimated using a random effects design for persons…

Descriptors: Generalizability Theory, Undergraduate Students, Scaling, Rating Scales

A Simulation Study to Explore Configuring the New SAT® Critical Reading Section without Analogy Items. Research Report No. 2004-2. ETS RR-04-01

Download full text

Liu, Jinghua; Feigenbaum, Miriam; Cook, Linda – College Entrance Examination Board, 2004

This study explored possible configurations of the new SAT® critical reading section without analogy items. The item pool contained items from SAT verbal (SAT-V) sections of 14 previously administered SAT tests, calibrated using the three-parameter logistic IRT model. Multiple versions of several prototypes that do not contain analogy items were…

Descriptors: College Entrance Examinations, Critical Reading, Logical Thinking, Difficulty Level

A Comparison of Rasch Person Analysis and Robust Estimators.

Smith, Richard M. – 1983

Measurement disturbances, such as guessing, startup, and plodding, often result in an examinee's ability being either over- or under-estimated by the maximum likelihood estimation employed in latent trait psychometric models. Several authors have suggested methods to lessen the impact of unexpected responses on the ability estimation process. This…

Descriptors: Difficulty Level, Error of Measurement, Estimation (Mathematics), Goodness of Fit

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

Anwyll, Steve	1
Bristow, M.	1
Chafouleas, Sandra M.	1
Christ, Theodore J.	1
Cook, Linda	1
Custer, Michael	1
Erkorkmaz, K.	1
Feigenbaum, Miriam	1
Glanville, Matthew	1
He, Qingping	1
Henning, Grant	1
Huissoon, J. P.	1
Jeon, Soo	1
Kim, Jongpil	1
Kim, Sooyeon	1
Liu, Jinghua	1
Moses, Tim	1
Opposs, Dennis	1
Owen, W. S.	1
Riley-Tillman, T. Chris	1
Smith, Richard M.	1
Stubley, G. D.	1
Waslander, S. L.	1
Yoo, Hanwook Henry	1
More ▼