ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	47
Since 2007 (last 20 years)	80

Descriptor

Test Items	148
Test Reliability	148
Scoring	114
Test Validity	82
Test Construction	68
Item Response Theory	37
Item Analysis	34
Psychometrics	33
Difficulty Level	29
Multiple Choice Tests	28
Scoring Formulas	25
Testing	25
Scores	24
Achievement Tests	20
Mathematics Tests	19
Foreign Countries	18
Test Bias	18
Test Format	17
Computer Assisted Testing	16
Higher Education	15
Interrater Reliability	15
Comparative Analysis	13
Language Tests	13
Scoring Rubrics	13
Test Interpretation	13
More ▼

Publication Type

Journal Articles	82
Reports - Research	80
Reports - Evaluative	33
Reports - Descriptive	17
Speeches/Meeting Papers	17
Tests/Questionnaires	13
Numerical/Quantitative Data	11
Guides - Non-Classroom	6
Information Analyses	3
Opinion Papers	3
Collected Works - General	2
Guides - Classroom - Teacher	2
Books	1
Dissertations/Theses -…	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	20
Elementary Education	16
Higher Education	14
High Schools	12
Postsecondary Education	11
Early Childhood Education	9
Elementary Secondary Education	9
Middle Schools	9
Junior High Schools	7
Primary Education	7
Intermediate Grades	6
Grade 4	5
Grade 3	4
Grade 5	4
Grade 7	4
Kindergarten	4
Grade 1	3
Grade 2	3
Grade 6	3
Grade 8	3
Grade 9	2
Adult Education	1
More ▼

Audience

Practitioners	5
Teachers	3
Researchers	2

Location

Florida	5
Nebraska	5
California	4
New Mexico	3
Canada	2
Germany	2
Ohio	2
Turkey	2
Alabama	1
China	1
Idaho	1
Iran	1
Israel	1
Maryland	1
Nebraska (Lincoln)	1
New York	1
North Dakota	1
Oman	1
Taiwan	1
Tennessee	1
Texas	1
United Kingdom (England)	1
United Kingdom (London)	1
Washington	1
West Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 148 results Save | Export

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Differential Item Functioning Analysis of the Fundamental Concepts for Organic Reaction Mechanisms Inventory

Peer reviewed

Direct link

Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022

The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…

Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

The Competent Computational Thinking Test (cCTt): A Valid, Reliable and Gender-Fair Test for Longitudinal CT Studies in Grades 3-6

Peer reviewed

Direct link

Laila El-Hamamsy; María Zapata-Cáceres; Estefanía Martín-Barroso; Francesco Mondada; Jessica Dehler Zufferey; Barbara Bruno; Marcos Román-González – Technology, Knowledge and Learning, 2025

The introduction of computing education into curricula worldwide requires multi-year assessments to evaluate the long-term impact on learning. However, no single Computational Thinking (CT) assessment spans primary school, and no group of CT assessments provides a means of transitioning between instruments. This study therefore investigated…

Descriptors: Cognitive Tests, Computation, Thinking Skills, Test Validity

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Coefficient [beta] as Extension of KR-21 Reliability for Summed and Scaled Scores for Polytomously-Scored Tests

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Measurement in Education, 2021

KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…

Descriptors: Test Reliability, Scores, Scoring, Computation

Development and Validity Testing of the School Health Score Card

Peer reviewed

Direct link

Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018

Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…

Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Development of a Protein Concept Inventory: A Proposal for Item Scoring and Responding

Peer reviewed
PDF on ERIC

Download full text

Güntay Tasçi – Science Insights Education Frontiers, 2024

The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…

Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology

A Mokken Scale Analysis of the Last Series of the Standard Progressive Matrices (SPM-LS)

Peer reviewed
PDF on ERIC

Download full text

Myszkowski, Nils – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…

Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Examining the Achievement Test Development Process in the Educational Studies

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gülsah; Yildirim, Yildiz; Boztunç Öztürk, Nagihan – Participatory Educational Research, 2023

Literature review shows that the development process of an achievement test is mainly investigated in dissertations. Moreover, preparing a form that will shed light on developing an achievement test is expected to guide those who will administer the test. In this line, the current study aims to create an "Achievement Test Development Process…

Descriptors: Achievement Tests, Test Construction, Records (Forms), Mathematics Achievement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Journal of Psychoeducational…	9
Applied Psychological…	6
ETS Research Report Series	5
Grantee Submission	5
Educational and Psychological…	4
Nebraska Department of…	4
Online Submission	4
Applied Measurement in…	3
Journal of Educational…	3
Psychometrika	3
Assessment & Evaluation in…	2
Educational Assessment	2
Educational Measurement:…	2
Evaluation and the Health…	2
International Journal of…	2
Journal of Chemical Education	2
Journal of Experimental…	2
Language Testing	2
New Meridian Corporation	2
New Mexico Public Education…	2
OECD Publishing	2
ACT, Inc.	1
Advances in Health Sciences…	1
American Journal of…	1
American Language Review	1
More ▼

Schoen, Robert C.	7
Yang, Xiaotong	4
Anderson, Daniel	3
Bauduin, Charity	3
Burton, Richard F.	3
Paek, Insu	3
Stansfield, Charles W.	3
Dorans, Neil J.	2
Downey, Ronald G.	2
Guo, Hongwen	2
Haladyna, Thomas M.	2
Huynh, Huynh	2
Liu, Sicong	2
Schrader, William B.	2
Segall, Daniel O.	2
Slepkov, Aaron D.	2
Ahmed, Wondimu	1
Aiken, Lewis R.	1
Albanese, Mark A.	1
Alderson, J. Charles	1
Aleamoni, Lawrence M.	1
Almehrizi, Rashid S.	1
Alqarni, Abdulelah Mohammed	1
Anderson, Paul S.	1
More ▼

SAT (College Admission Test)	4
ACT Assessment	3
Program for International…	2
Raven Progressive Matrices	2
Test of English as a Foreign…	2
ACT Interest Inventory	1
Advanced Placement…	1
Alberta Grade Twelve Diploma…	1
Autism Diagnostic Observation…	1
Clinical Evaluation of…	1
Comprehensive Tests of Basic…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
Dynamic Indicators of Basic…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
International Association for…	1
International English…	1
Kaufman Test of Educational…	1
Matching Familiar Figures Test	1
National Assessment of…	1
Preliminary Scholastic…	1
Progress in International…	1
Strengths and Difficulties…	1
Teaching and Learning…	1
More ▼