ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	37

Descriptor

Evaluation Methods	65
Test Items	65
Scoring	46
Test Construction	21
Item Response Theory	15
Student Evaluation	15
Foreign Countries	12
Scores	11
Standard Setting (Scoring)	11
Psychometrics	10
Scoring Rubrics	10
Testing	10
Difficulty Level	9
Computer Assisted Testing	8
Interrater Reliability	8
Item Analysis	8
Multiple Choice Tests	8
Test Validity	8
Evaluation Criteria	7
Mathematics Tests	7
Measurement Techniques	7
Computation	6
Criterion Referenced Tests	6
Educational Assessment	6
Elementary Secondary Education	6
More ▼

Publication Type

Journal Articles	37
Reports - Research	29
Speeches/Meeting Papers	15
Reports - Evaluative	12
Reports - Descriptive	10
Guides - Classroom - Teacher	6
Books	2
Dissertations/Theses -…	2
Guides - General	2
Guides - Non-Classroom	2
Information Analyses	2
Multilingual/Bilingual…	2
Numerical/Quantitative Data	2
Opinion Papers	2
Tests/Questionnaires	2
More ▼

Education Level

Elementary Education	6
Higher Education	5
Postsecondary Education	5
Secondary Education	5
Grade 6	4
Grade 8	4
Elementary Secondary Education	3
Junior High Schools	3
Middle Schools	3
Grade 4	2
Early Childhood Education	1
Intermediate Grades	1
More ▼

Audience

Practitioners	6
Teachers	4
Administrators	2

Location

Canada	4
Australia	3
India	2
China	1
Hong Kong	1
Israel	1
Japan	1
Pennsylvania	1
South Korea	1
Taiwan	1
United Kingdom	1
United Kingdom (Great Britain)	1
United States	1
Virginia	1
More ▼

Laws, Policies, & Programs

Education for All Handicapped…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	3
Program for International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Investigation of Rater Tendencies and Reliability in Different Assessment Methods with Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Electronic Journal of Elementary Education, 2020

One of the most commonly used methods for measuring higher-order thinking skills such as problem-solving or written expression is open-ended items. Three main approaches are used to evaluate responses to open-ended items: general evaluation, rating scales, and rubrics. In order to measure and improve problem-solving skills of students, firstly, an…

Descriptors: Interrater Reliability, Item Response Theory, Test Items, Rating Scales

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

Classroom Assessment in Higher Education

Peer reviewed

Direct link

Rao, N. J.; Banerjee, Shilpi – Higher Education for the Future, 2023

Classroom assessment is the process of documenting the knowledge, skills, attitudes and beliefs of learners. It provides essential feedback to both instructors and students to improve their teaching methods for guiding and motivating students to be actively involved in their learning. Assessment drives learning. Formative assessments enable the…

Descriptors: Higher Education, Student Evaluation, Evaluation Methods, Formative Evaluation

Beyond Agreement: Exploring Rater Effects in Large-Scale Mixed Format Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational Assessment, 2021

Scoring procedures for the constructed-response (CR) items in large-scale mixed-format educational assessments often involve checks for rater agreement or rater reliability. Although these analyses are important, researchers have documented rater effects that persist despite rater training and that are not always detected in rater agreement and…

Descriptors: Scoring, Responses, Test Items, Test Format

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

COVID-19 Impact on Group Invariance Property of Equating

Download full text

Kim, Dong-In; Julian, Marc; Hermann, Pam – Online Submission, 2022

In test equating, one critical equating property is the group invariance property which indicates that the equating function used to convert performance on each alternate form to the reporting scale should be the same for various subgroups. To mitigate the impact of disrupted learning on the item parameters during the COVID-19 pandemic, a…

Descriptors: COVID-19, Pandemics, Test Format, Equated Scores

A Comparison of Score Equating Conducted Using Haebara and Stocking Lord Method for Polytomous

Peer reviewed
PDF on ERIC

Download full text

Setiawan, Risky – European Journal of Educational Research, 2019

The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…

Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items

Evaluation of the Impact of Equating Approach on the Parameter and Student Score Stability Using Pre- and Post-Equated Designs in the Post-Pandemic Environment

Download full text

Tomkowicz, Joanna; Kim, Dong-In; Wan, Ping – Online Submission, 2022

In this study we evaluated the stability of item parameters and student scores, using the pre-equated (pre-pandemic) parameters from Spring 2019 and post-equated (post-pandemic) parameters from Spring 2021 in two calibration and equating designs related to item parameter treatment: re-estimating all anchor parameters (Design 1) and holding the…

Descriptors: Equated Scores, Test Items, Evaluation Methods, Pandemics

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Modification and Validation of the Mixed-Format Engineering Concept Assessment for Middle School Students Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Koskey, Kristin L. K.; Makki, Nidaa; Ahmed, Wondimu; Garafolo, Nicholas G.; Visco, Donald P., Jr. – School Science and Mathematics, 2020

Integrating engineering into the K-12 science curriculum continues to be a focus in national reform efforts in science education. Although there is an increasing interest in research in and practice of integrating engineering in K-12 science education, to date only a few studies have focused on the development of an assessment tool to measure…

Descriptors: Middle School Students, Engineering, Design, Science Education

Analyzing the Role of Science Practices in ACS Exam Items

Peer reviewed

Direct link

Reed, Jessica J.; Brandriet, Alexandra R.; Holme, Thomas A. – Journal of Chemical Education, 2017

Recent efforts to reform K-12 science curricula, embedded within the "NRC Framework for K-12 Science Education" and the "Next Generation Science Standards," have focused on unifying core disciplinary content with crosscutting concepts that span across science disciplines and scientific practices. With these reforms comes the…

Descriptors: Science Education, Chemistry, Elementary Secondary Education, Science Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	4
Online Submission	4
Assessment	2
Educational Assessment	2
Educational Measurement:…	2
International Journal of…	2
Journal of Chemical Education	2
Practical Assessment,…	2
ProQuest LLC	2
ASCD	1
Alberta Journal of…	1
Applied Measurement in…	1
Applied Psychological…	1
Clearing House	1
Educational Testing Service	1
Engineering Education	1
English Teaching Forum	1
European Journal of…	1
Evaluation and the Health…	1
Grantee Submission	1
Higher Education for the…	1
Instructional Science	1
International Electronic…	1
International Journal of…	1
International Online Journal…	1
More ▼

Hambleton, Ronald K.	3
Reckase, Mark D.	3
Friedman, Greg	2
Kim, Dong-In	2
McGinty, Dixie	2
Michaels, Hillary	2
Neel, John H.	2
Ochieng, Charles	2
Yen, Shu Jing	2
Ahmed, Wondimu	1
Alicia A. Stoltenberg	1
Babcock, Ben	1
Bakla, Arif	1
Banerjee, Shilpi	1
Bhaskar, R.	1
Birenbaum, Menucha	1
Boccaccini, Marcus T.	1
Bohn, Larry	1
Brandriet, Alexandra R.	1
Brookhart, Susan M.	1
Burfitt, Joan	1
Chernyshenko, Oleksandr S.	1
Coker, Donald R.	1
Davey, Tim	1
More ▼