ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	18
Since 2016 (last 10 years)	27
Since 2006 (last 20 years)	46

Descriptor

Scoring	86
Test Format	86
Test Items	86
Test Construction	32
Computer Assisted Testing	27
Multiple Choice Tests	19
Higher Education	16
Test Reliability	15
Difficulty Level	14
Foreign Countries	13
Achievement Tests	12
Test Use	12
Adaptive Testing	11
Comparative Analysis	11
Item Analysis	10
State Standards	10
Test Validity	10
Testing	10
Educational Assessment	9
Responses	9
Item Response Theory	8
Psychometrics	8
Scores	8
Student Evaluation	8
Academic Standards	7
More ▼

Publication Type

Reports - Research	38
Journal Articles	36
Reports - Evaluative	19
Reports - Descriptive	12
Speeches/Meeting Papers	11
Guides - Non-Classroom	9
Information Analyses	6
Guides - Classroom - Teacher	5
Dissertations/Theses -…	4
Tests/Questionnaires	4
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	8
Elementary Secondary Education	7
Higher Education	7
Postsecondary Education	7
Elementary Education	6
Intermediate Grades	5
High Schools	4
Grade 4	3
Grade 8	3
Grade 3	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
Early Childhood Education	1
Grade 12	1
Grade 5	1
Grade 6	1
Two Year Colleges	1
More ▼

Audience

Teachers	8
Students	6
Practitioners	5
Parents	4
Administrators	1

Location

Arizona	5
Louisiana	3
Canada	2
Asia	1
Bhutan	1
Cambodia	1
Hong Kong	1
Israel	1
Malaysia	1
Maryland	1
Mongolia	1
Nepal	1
Pakistan	1
Turkey	1
United Kingdom (England)	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	6
Advanced Placement…	3
Preliminary Scholastic…	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Trends in International…	2
College Level Examination…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
Graduate Management Admission…	1
Graduate Record Examinations	1
National Merit Scholarship…	1
Program for International…	1
United States Medical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Marginalized Learners in International and Regional Test Data: The Extent of Floor Effects

Peer reviewed

Direct link

Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023

International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

Exploring Speededness in Pre-Reform GCSEs (2009 to 2016)

Download full text

Direct link

Emma Walland – Research Matters, 2024

GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…

Descriptors: Educational Change, Test Items, Item Analysis, Scoring

Examining the Impacts of Ignoring Rater Effects in Mixed-Format Tests

Peer reviewed

Direct link

Guo, Wenjing; Wind, Stefanie A. – Journal of Educational Measurement, 2021

The use of mixed-format tests made up of multiple-choice (MC) items and constructed response (CR) items is popular in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district- and state-level assessments in the United States. Rater effects, or raters' scoring tendencies that result in…

Descriptors: Test Format, Multiple Choice Tests, Scoring, Test Items

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference

Peer reviewed

Direct link

Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022

The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…

Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring

Beyond Agreement: Exploring Rater Effects in Large-Scale Mixed Format Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational Assessment, 2021

Scoring procedures for the constructed-response (CR) items in large-scale mixed-format educational assessments often involve checks for rater agreement or rater reliability. Although these analyses are important, researchers have documented rater effects that persist despite rater training and that are not always detected in rater agreement and…

Descriptors: Scoring, Responses, Test Items, Test Format

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

COVID-19 Impact on Group Invariance Property of Equating

Download full text

Kim, Dong-In; Julian, Marc; Hermann, Pam – Online Submission, 2022

In test equating, one critical equating property is the group invariance property which indicates that the equating function used to convert performance on each alternate form to the reporting scale should be the same for various subgroups. To mitigate the impact of disrupted learning on the item parameters during the COVID-19 pandemic, a…

Descriptors: COVID-19, Pandemics, Test Format, Equated Scores

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Assessment of Transversal Competencies: Current Tools in the Asian Region

Direct link

Care, Esther; Vista, Alvin; Kim, Helyn – UNESCO Bangkok, 2019

UNESCO's Asia-Pacific Regional Bureau for Education has been working on education quality under the name of 'transversal competencies' (TVC) since 2013. Many of these competencies have been included in national education policy and curricula of countries in the region, but now the importance accorded them is increasingly gaining attention. As…

Descriptors: Foreign Countries, Educational Quality, 21st Century Skills, Competence

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	4
Journal of Educational…	4
ProQuest LLC	4
Louisiana Department of…	3
Practical Assessment,…	3
Applied Measurement in…	2
College Board	2
Educational Measurement:…	2
Grantee Submission	2
American Journal of…	1
Applied Psychological…	1
Arizona Department of…	1
Assessment & Evaluation in…	1
Athletic Training Education…	1
Comparative Education Review	1
ETS Research Report Series	1
Education and Information…	1
Educational Assessment	1
Electronic Journal of Science…	1
Evaluation and the Health…	1
International Association for…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Research in…	1
Journal of Technology,…	1
More ▼

Anderson, Paul S.	4
Ellington, Henry	3
Wind, Stefanie A.	3
Guo, Wenjing	2
Kim, Doyoung	2
Kim, Sooyeon	2
Martinez, Michael E.	2
Ward, William C.	2
Akyildiz, Murat	1
Alderson, J. Charles	1
Ali, Usama S.	1
Ault, Marilyn	1
Aviad-Levitzky, Tami	1
Bailey, Kathleen M., Ed.	1
Baldwin, Peter	1
Barakat, Bilal Fouad	1
Becker, Benjamin	1
Ben Seipel	1
Bennett, Randy Elliot	1
Betts, Joe	1
Boyer, Michelle	1
Boz Yuksekdag, Belgin	1
Braswell, James S.	1
Braun, Henry I.	1
More ▼