Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Difficulty Level | 6 |
Equated Scores | 6 |
Evaluation Methods | 6 |
Test Items | 5 |
Item Response Theory | 3 |
Comparative Analysis | 2 |
Item Analysis | 2 |
Scoring | 2 |
Statistical Analysis | 2 |
Student Evaluation | 2 |
Test Construction | 2 |
More ▼ |
Source
ACT, Inc. | 1 |
ETS Research Report Series | 1 |
European Journal of… | 1 |
Ministerial Council on… | 1 |
National Center for Research… | 1 |
Author
Carlson, Alfred B. | 1 |
Chen, Hanwei | 1 |
Cui, Zhongmin | 1 |
Donovan, Jenny | 1 |
Gao, Xiaohong | 1 |
Holland, Paul | 1 |
Hutton, Penny | 1 |
Lennon, Melissa | 1 |
Michaelides, Michalis P. | 1 |
Setiawan, Risky | 1 |
Sinharay, Sandip | 1 |
More ▼ |
Publication Type
Numerical/Quantitative Data | 3 |
Reports - Evaluative | 3 |
Reports - Research | 3 |
Journal Articles | 2 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 6 | 1 |
Audience
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Setiawan, Risky – European Journal of Educational Research, 2019
The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…
Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items
Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010
The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level
Michaelides, Michalis P. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006
Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…
Descriptors: Equated Scores, Test Items, Item Response Theory, Evaluation Methods
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…
Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level
Smith, Robert L.; Carlson, Alfred B. – 1995
The feasibility of constructing test forms with practically equivalent cut scores using judges' estimates of item difficulty as target "statistical" specifications was investigated. Test forms with equivalent judgmental cut scores (based on judgments of item difficulty) were assembled using items from six operational forms of the…
Descriptors: Cutting Scores, Decision Making, Difficulty Level, Equated Scores
Wu, Margaret; Donovan, Jenny; Hutton, Penny; Lennon, Melissa – Ministerial Council on Education, Employment, Training and Youth Affairs (NJ1), 2008
In July 2001, the Ministerial Council on Education, Employment, Training and Youth Affairs (MCEETYA) agreed to the development of assessment instruments and key performance measures for reporting on student skills, knowledge and understandings in primary science. It directed the newly established Performance Measurement and Reporting Taskforce…
Descriptors: Foreign Countries, Scientific Literacy, Science Achievement, Comparative Analysis