ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	48

Descriptor

Scores	77
Scoring	77
Test Items	77
Item Response Theory	26
Test Reliability	22
Test Construction	21
Foreign Countries	17
Psychometrics	17
Comparative Analysis	15
Test Validity	15
Correlation	13
Item Analysis	13
Testing	13
Computer Assisted Testing	12
Test Bias	12
Difficulty Level	11
Evaluation Methods	10
Interrater Reliability	10
Mathematics Tests	10
Statistical Analysis	10
Language Tests	9
Models	9
Multiple Choice Tests	9
Computation	8
English (Second Language)	8
More ▼

Publication Type

Journal Articles	42
Reports - Research	41
Reports - Evaluative	20
Speeches/Meeting Papers	15
Reports - Descriptive	7
Tests/Questionnaires	6
Dissertations/Theses -…	2
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Guides - Classroom - Learner	1
Guides - General	1
Reports - General	1
More ▼

Education Level

Higher Education	11
Postsecondary Education	8
Secondary Education	8
Elementary Education	6
High Schools	6
Early Childhood Education	4
Grade 4	4
Grade 6	4
Grade 3	3
Grade 7	3
Grade 8	3
Middle Schools	3
Primary Education	3
Elementary Secondary Education	2
Grade 5	2
Grade 9	2
Intermediate Grades	2
Junior High Schools	2
More ▼

Audience

Practitioners	1
Researchers	1

Location

Canada	3
Japan	3
California	2
Israel	2
United Kingdom	2
United States	2
Alabama	1
Australia	1
China	1
Colorado	1
Czech Republic	1
Estonia	1
Georgia	1
Hong Kong	1
Idaho	1
India	1
Iran	1
Nebraska	1
Nevada	1
New Mexico	1
New York	1
North Carolina	1
North Dakota	1
Norway	1
Ohio	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…

Assessments and Surveys

ACT Assessment	2
Advanced Placement…	2
Graduate Record Examinations	2
Raven Progressive Matrices	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
ACT Interest Inventory	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Early Childhood Environment…	1
National Assessment of…	1
National Teacher Examinations	1
Preschool Language Scale	1
Strengths and Difficulties…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 77 results Save | Export

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Impact of Scoring Instructions, Timing, and Feedback on Measurement: An Experimental Study

Peer reviewed

Direct link

van Rijn, Peter W.; Attali, Yigal; Ali, Usama S. – Journal of Experimental Education, 2023

We investigated whether and to what extent different scoring instructions, timing conditions, and direct feedback affect performance and speed. An experimental study manipulating these factors was designed to address these research questions. According to the factorial design, participants were randomly assigned to one of twelve study conditions.…

Descriptors: Scoring, Time, Feedback (Response), Performance

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Coefficient [beta] as Extension of KR-21 Reliability for Summed and Scaled Scores for Polytomously-Scored Tests

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Measurement in Education, 2021

KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…

Descriptors: Test Reliability, Scores, Scoring, Computation

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Psychometric Approaches to Analyzing C-Tests

Peer reviewed

Direct link

Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023

The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…

Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Young Children's Actions on Length Measurement Tasks: Strategies and Cognitive Attributes

Peer reviewed

Direct link

Clements, Douglas H.; Banse, Holland; Sarama, Julie; Tatsuoka, Curtis; Joswick, Candace; Hudyma, Aaron; Van Dine, Douglas W.; Tatsuoka, Kikumi K. – Mathematical Thinking and Learning: An International Journal, 2022

Researchers often develop instruments using correctness scores (and a variety of theories and techniques, such as Item Response Theory) for validation and scoring. Less frequently, observations of children's strategies are incorporated into the design, development, and application of assessments. We conducted individual interviews of 833…

Descriptors: Item Response Theory, Computer Assisted Testing, Test Items, Mathematics Tests

Scoring Stability in a Large-Scale Assessment Program: A Longitudinal Analysis of Leniency/Severity Effects

Peer reviewed

Direct link

Palermo, Corey; Bunch, Michael B.; Ridge, Kirk – Journal of Educational Measurement, 2019

Although much attention has been given to rater effects in rater-mediated assessment contexts, little research has examined the overall stability of leniency and severity effects over time. This study examined longitudinal scoring data collected during three consecutive administrations of a large-scale, multi-state summative assessment program.…

Descriptors: Scoring, Interrater Reliability, Measurement, Summative Evaluation

Partial Credit in Answer-Until-Correct Multiple-Choice Tests Deployed in a Classroom Setting

Peer reviewed

Direct link

Slepkov, Aaron D.; Godfrey, Alan T. K. – Applied Measurement in Education, 2019

The answer-until-correct (AUC) method of multiple-choice (MC) testing involves test respondents making selections until the keyed answer is identified. Despite attendant benefits that include improved learning, broad student adoption, and facile administration of partial credit, the use of AUC methods for classroom testing has been extremely…

Descriptors: Multiple Choice Tests, Test Items, Test Reliability, Scores

Implementing Confidence Assessment in Low-Stakes, Formative Mathematics Assessments

Peer reviewed

Direct link

Foster, Colin – International Journal of Science and Mathematics Education, 2022

Confidence assessment (CA) involves students stating alongside each of their answers a confidence rating (e.g. 0 low to 10 high) to express how certain they are that their answer is correct. Each student's score is calculated as the sum of the confidence ratings on the items that they answered correctly, minus the sum of the confidence ratings on…

Descriptors: Mathematics Tests, Mathematics Education, Secondary School Students, Meta Analysis

A Comparative Analysis of the "Early Childhood Environment Rating Scale--Revised" and "Early Childhood Environment Rating Scale, Third Edition"

Peer reviewed
PDF on ERIC

Download full text

Direct link

Neitzel, Jennifer; Early, Diane; Sideris, John; LaForrett, Doré; Abel, Michael B.; Soli, Margaret; Davidson, Dawn L.; Haboush-Deloye, Amanda; Hestenes, Linda L.; Jenson, Denise; Johnson, Cindy; Kalas, Jennifer; Mamrak, Angela; Masterson, Marie L.; Mims, Sharon U.; Oya, Patti; Philson, Bobbi; Showalter, Megan; Warner-Richter, Mallory; Kortright Wood, Jill – Journal of Early Childhood Research, 2019

The Early Childhood Environment Rating Scales, including the "Early Childhood Environment Rating Scale--Revised" (Harms et al., 2005) and the "Early Childhood Environment Rating Scale, Third Edition" (Harms et al., 2015) are the most widely used observational assessments in early childhood learning environments. The most recent…

Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Scoring

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	4
Educational and Psychological…	4
Practical Assessment,…	4
Journal of Educational…	3
Language Testing	3
New Meridian Corporation	3
Applied Measurement in…	2
Applied Psychological…	2
College Board	2
Educational Testing Service	2
Online Submission	2
ProQuest LLC	2
ACT, Inc.	1
Education and Information…	1
Electronic Journal of…	1
Evaluation and the Health…	1
Hispania	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
JALT CALL Journal	1
Journal of Early Childhood…	1
More ▼

Attali, Yigal	2
Bennett, Randy Elliot	2
Friedman, Greg	2
Livingston, Samuel A.	2
Lord, Frederic M.	2
Michaels, Hillary	2
Ochieng, Charles	2
Reckase, Mark D.	2
Rogers, W. Todd	2
Yen, Shu Jing	2
Zhang, Mo	2
Abel, Michael B.	1
Ali, Usama S.	1
Almehrizi, Rashid S.	1
Alpizar, David	1
Alqarni, Abdulelah Mohammed	1
Ashwell, Tim	1
Aviad-Levitzky, Tami	1
Baldwin, Peter	1
Banse, Holland	1
Birenbaum, Menucha	1
Botting, Nicola	1
Bowman, Harry L.	1
Breyer, F. Jay	1
More ▼