ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Scaling	8
Test Format	8
Test Items	8
Elementary Secondary Education	4
Difficulty Level	3
Equated Scores	3
Item Response Theory	3
Multiple Choice Tests	3
Academic Achievement	2
Comparative Analysis	2
Educational Assessment	2
Foreign Countries	2
Gender Differences	2
Goodness of Fit	2
International Assessment	2
Mathematical Models	2
Mathematics Tests	2
Statistical Analysis	2
Test Construction	2
Test Reliability	2
Accuracy	1
Achievement Tests	1
Adults	1
Analysis of Variance	1
Classification	1
More ▼

Source

European Educational Research…	1
International Association for…	1
Pearson	1
Practical Assessment,…	1

Author

Algina, James	1
Bay, Luz	1
Binici, Salih	1
Boldt, R. F.	1
Gerick, Julia	1
Goldhammer, Frank	1
Hambleton, Ronald K.	1
Han, Kyung T.	1
Ihme, Jan Marten	1
Legg, Sue M.	1
Linacre, John Michael	1
Martin, Michael O., Ed.	1
Mullis, Ina V. S., Ed.	1
Powers, Sonya	1
Senkbeil, Martin	1
Turhan, Ahmet	1
Wells, Craig S.	1
von Davier, Matthias, Ed.	1
More ▼

Publication Type

Reports - Research	5
Speeches/Meeting Papers	4
Reports - Evaluative	3
Journal Articles	2
Numerical/Quantitative Data	2
Guides - General	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Europe	1
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Methods and Procedures: TIMSS 2019 Technical Report

Download full text

Martin, Michael O., Ed.; von Davier, Matthias, Ed.; Mullis, Ina V. S., Ed. – International Association for the Evaluation of Educational Achievement, 2020

The chapters in this online volume comprise the TIMSS & PIRLS International Study Center's technical report of the methods and procedures used to develop, implement, and report the results of TIMSS 2019. There were various technical challenges because TIMSS 2019 was the initial phase of the transition to eTIMSS, with approximately half the…

Descriptors: Foreign Countries, Elementary Secondary Education, Achievement Tests, International Assessment

Effect of Adjusting Pseudo-Guessing Parameter Estimates on Test Scaling When Item Parameter Drift Is Present

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T.; Wells, Craig S.; Hambleton, Ronald K. – Practical Assessment, Research & Evaluation, 2015

In item response theory test scaling/equating with the three-parameter model, the scaling coefficients A and B have no impact on the c-parameter estimates of the test items since the cparameter estimates are not adjusted in the scaling/equating procedure. The main research question in this study concerned how serious the consequences would be if…

Descriptors: Item Response Theory, Monte Carlo Methods, Scaling, Test Items

Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

Peer reviewed

Direct link

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…

Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment

Population Invariance of Vertical Scaling Results

Direct link

Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012

The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests

Structured Rating Scales.

Linacre, John Michael – 1991

A rating scale can be expressed as a chain of dichotomous items. The relationship between the dichotomies depends on the manner in which the rating scale is presented to the test taker. Three models for ordered scales are discussed. In the success model, which represents growth, the lowest or easiest category is presented first. If the test taker…

Descriptors: Difficulty Level, Equations (Mathematics), Mathematical Models, Rating Scales

Reliability of the Test of Spoken English Revisited. Research Reports, Report 40.

Download full text

Boldt, R. F. – 1992

The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…

Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

Comparing Student Performance on Different Item Formats Relative to Achievement Levels Cutpoints.

Download full text

Bay, Luz – 1998

A study was conducted to investigate the difference in student performance on multiple choice (MC) and constructed response (CR) items relative to the achievement levels of the National Assessment of Educational Progress (NAEP). The study included an investigation of how estimates of student performance were affected by item response theory (IRT)…

Descriptors: Academic Achievement, Comparative Analysis, Constructed Response, Cutting Scores

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores