ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	6
Equated Scores	6
Evaluation Methods	6
Test Items	5
Item Response Theory	3
Comparative Analysis	2
Item Analysis	2
Scoring	2
Statistical Analysis	2
Student Evaluation	2
Test Construction	2
Ability Grouping	1
Academic Standards	1
Benchmarking	1
Computation	1
Consultants	1
Context Effect	1
Correlation	1
Cutting Scores	1
Data Analysis	1
Data Collection	1
Decision Making	1
Educational Assessment	1
Educational Indicators	1
Educational Objectives	1
More ▼

Source

ACT, Inc.	1
ETS Research Report Series	1
European Journal of…	1
Ministerial Council on…	1
National Center for Research…	1

Author

Carlson, Alfred B.	1
Chen, Hanwei	1
Cui, Zhongmin	1
Donovan, Jenny	1
Gao, Xiaohong	1
Holland, Paul	1
Hutton, Penny	1
Lennon, Melissa	1
Michaelides, Michalis P.	1
Setiawan, Risky	1
Sinharay, Sandip	1
Smith, Robert L.	1
Wu, Margaret	1
Zhu, Rongchun	1
More ▼

Publication Type

Numerical/Quantitative Data	3
Reports - Evaluative	3
Reports - Research	3
Journal Articles	2

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 6	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Comparison of Score Equating Conducted Using Haebara and Stocking Lord Method for Polytomous

Peer reviewed
PDF on ERIC

Download full text

Setiawan, Risky – European Journal of Educational Research, 2019

The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…

Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

Effects of Misbehaving Common Items on Aggregate Scores and an Application of the Mantel-Haenszel Statistic in Test Equating. CSE Report 688

Download full text

Michaelides, Michalis P. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006

Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…

Descriptors: Equated Scores, Test Items, Item Response Theory, Evaluation Methods

Choice of Anchor Test in Equating. Research Report. ETS RR-06-35

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…

Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level

Using Judgmental Estimates of Item Difficulty To Assemble Test Forms with Equivalent Cut Scores. Research Memorandum.

Download full text

Smith, Robert L.; Carlson, Alfred B. – 1995

The feasibility of constructing test forms with practically equivalent cut scores using judges' estimates of item difficulty as target "statistical" specifications was investigated. Test forms with equivalent judgmental cut scores (based on judgments of item difficulty) were assembled using items from six operational forms of the…

Descriptors: Cutting Scores, Decision Making, Difficulty Level, Equated Scores

National Assessment Program--Science Literacy Year 6 Technical Report, 2006

Download full text

Wu, Margaret; Donovan, Jenny; Hutton, Penny; Lennon, Melissa – Ministerial Council on Education, Employment, Training and Youth Affairs (NJ1), 2008

In July 2001, the Ministerial Council on Education, Employment, Training and Youth Affairs (MCEETYA) agreed to the development of assessment instruments and key performance measures for reporting on student skills, knowledge and understandings in primary science. It directed the newly established Performance Measurement and Reporting Taskforce…

Descriptors: Foreign Countries, Scientific Literacy, Science Achievement, Comparative Analysis