ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	42
Since 2006 (last 20 years)	69

Descriptor

Scoring	111
Test Items	111
Test Reliability	111
Test Validity	67
Test Construction	56
Item Response Theory	31
Psychometrics	28
Item Analysis	25
Scores	22
Difficulty Level	21
Multiple Choice Tests	20
Testing	19
Test Bias	17
Mathematics Tests	15
Test Format	15
Foreign Countries	14
Computer Assisted Testing	13
Achievement Tests	12
Correlation	12
Interrater Reliability	12
Comparative Analysis	11
Higher Education	11
English (Second Language)	10
Language Tests	10
College Students	9
More ▼

Publication Type

Journal Articles	61
Reports - Research	56
Reports - Evaluative	29
Reports - Descriptive	14
Speeches/Meeting Papers	13
Tests/Questionnaires	10
Numerical/Quantitative Data	9
Guides - Non-Classroom	4
Guides - Classroom - Teacher	2
Information Analyses	2
Opinion Papers	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	15
Higher Education	13
Elementary Education	12
High Schools	9
Postsecondary Education	9
Early Childhood Education	8
Middle Schools	7
Primary Education	7
Elementary Secondary Education	6
Intermediate Grades	6
Grade 4	5
Junior High Schools	5
Grade 3	4
Grade 5	4
Grade 7	4
Grade 6	3
Grade 1	2
Grade 2	2
Grade 8	2
Grade 9	2
Kindergarten	2
More ▼

Audience

Practitioners	3
Researchers	2
Teachers	2

Location

Florida	5
Nebraska	4
California	3
New Mexico	3
Canada	2
Alabama	1
Germany	1
Idaho	1
Iran	1
Israel	1
Maryland	1
Nebraska (Lincoln)	1
New York	1
North Dakota	1
Ohio	1
Oman	1
Taiwan	1
Tennessee	1
Texas	1
Turkey	1
United Kingdom (England)	1
United Kingdom (London)	1
Washington	1
West Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 111 results Save | Export

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Differential Item Functioning Analysis of the Fundamental Concepts for Organic Reaction Mechanisms Inventory

Peer reviewed

Direct link

Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022

The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…

Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

Coefficient [beta] as Extension of KR-21 Reliability for Summed and Scaled Scores for Polytomously-Scored Tests

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Measurement in Education, 2021

KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…

Descriptors: Test Reliability, Scores, Scoring, Computation

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Development of a Protein Concept Inventory: A Proposal for Item Scoring and Responding

Peer reviewed
PDF on ERIC

Download full text

Güntay Tasçi – Science Insights Education Frontiers, 2024

The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…

Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Partial Credit in Answer-Until-Correct Multiple-Choice Tests Deployed in a Classroom Setting

Peer reviewed

Direct link

Slepkov, Aaron D.; Godfrey, Alan T. K. – Applied Measurement in Education, 2019

The answer-until-correct (AUC) method of multiple-choice (MC) testing involves test respondents making selections until the keyed answer is identified. Despite attendant benefits that include improved learning, broad student adoption, and facile administration of partial credit, the use of AUC methods for classroom testing has been extremely…

Descriptors: Multiple Choice Tests, Test Items, Test Reliability, Scores

Exploring the Relationship between Optimal Methods of Item Scoring and Selection and Predictive Validity. Conference Paper

Direct link

Benton, Tom – Cambridge Assessment, 2018

One of the questions with the longest history in educational assessment is whether it is possible to increase the reliability of a test simply by altering the way in which scores on individual test items are combined to make the overall test score. Most usually, the score available on each item is communicated to the candidate within a question…

Descriptors: Test Items, Scoring, Predictive Validity, Test Reliability

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

ITC Guidelines for Translating and Adapting Tests (Second Edition)

Peer reviewed

Direct link

International Journal of Testing, 2018

The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…

Descriptors: Translation, Test Construction, Testing, Scoring

Towards Optimal Measurement and Theoretical Grounding of L2 English Elicited Imitation: Examining Scales, (Mis)Fits, and Prompt Features from Item Response Theory and Random Forest Approaches

Direct link

Ji-young Shin – ProQuest LLC, 2021

The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Psychoeducational…	8
Grantee Submission	5
Applied Psychological…	4
ETS Research Report Series	4
Applied Measurement in…	3
Nebraska Department of…	3
Online Submission	3
Educational Measurement:…	2
Educational and Psychological…	2
International Journal of…	2
Journal of Educational…	2
Language Testing	2
New Meridian Corporation	2
New Mexico Public Education…	2
OECD Publishing	2
ACT, Inc.	1
Advances in Health Sciences…	1
American Journal of…	1
American Language Review	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
CBE - Life Sciences Education	1
Cambridge Assessment	1
College Board	1
More ▼

Schoen, Robert C.	7
Yang, Xiaotong	4
Anderson, Daniel	3
Bauduin, Charity	3
Paek, Insu	3
Stansfield, Charles W.	3
Burton, Richard F.	2
Dorans, Neil J.	2
Downey, Ronald G.	2
Guo, Hongwen	2
Haladyna, Thomas M.	2
Liu, Sicong	2
Segall, Daniel O.	2
Slepkov, Aaron D.	2
Albanese, Mark A.	1
Alderson, J. Charles	1
Aleamoni, Lawrence M.	1
Almehrizi, Rashid S.	1
Alqarni, Abdulelah Mohammed	1
Anderson, Paul S.	1
Ault, Marilyn	1
Aviad-Levitzky, Tami	1
Bae, Yunhee	1
Bauer, Daniel	1
More ▼

SAT (College Admission Test)	4
ACT Assessment	3
Raven Progressive Matrices	2
ACT Interest Inventory	1
Advanced Placement…	1
Alberta Grade Twelve Diploma…	1
Autism Diagnostic Observation…	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
Graduate Management Admission…	1
Graduate Record Examinations	1
International Association for…	1
International English…	1
Kaufman Test of Educational…	1
National Assessment of…	1
Preliminary Scholastic…	1
Program for International…	1
Progress in International…	1
Strengths and Difficulties…	1
Teaching and Learning…	1
Test of English as a Foreign…	1
Test of Nonverbal Intelligence	1
Test of Written English	1
Trends in International…	1
More ▼