ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	42
Since 2017 (last 10 years)	98
Since 2007 (last 20 years)	249

Descriptor

Testing	249
Test Reliability	164
Test Validity	124
Reliability	76
Test Construction	71
Foreign Countries	68
Scoring	64
Scores	55
Language Tests	44
Validity	39
Psychometrics	38
Student Evaluation	37
Test Items	32
Academic Achievement	31
English (Second Language)	31
Evaluation Methods	31
Item Response Theory	30
Second Language Learning	26
Comparative Analysis	25
Correlation	24
Test Bias	24
Computer Assisted Testing	23
Error of Measurement	22
Measures (Individuals)	20
Interrater Reliability	19
More ▼

Publication Type

Journal Articles	186
Reports - Research	115
Reports - Evaluative	64
Reports - Descriptive	41
Numerical/Quantitative Data	17
Tests/Questionnaires	12
Dissertations/Theses -…	9
Opinion Papers	9
Guides - Non-Classroom	7
Information Analyses	6
Books	5
Guides - Classroom - Teacher	4
Speeches/Meeting Papers	4
Guides - General	2
Collected Works - General	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Elementary Education	33
Secondary Education	33
Elementary Secondary Education	27
Early Childhood Education	25
Middle Schools	24
Junior High Schools	20
Grade 7	19
Primary Education	19
Grade 6	17
Intermediate Grades	16
Grade 5	15
Grade 8	15
Grade 4	14
Grade 3	13
High Schools	13
Kindergarten	9
Preschool Education	7
Grade 9	5
Adult Education	3
Grade 10	3
Grade 1	2
Grade 11	2
Adult Basic Education	1
More ▼

Audience

Teachers	5
Administrators	2
Practitioners	2
Policymakers	1
Students	1

Location

New York	10
Canada	6
Illinois	6
Turkey	6
China	5
United Kingdom (England)	5
United States	5
Australia	4
Florida	4
Maryland	4
United Kingdom	4
California	3
Delaware	3
Indonesia	3
Iran	3
Michigan	3
Nebraska	3
Nigeria	3
Ohio	3
Pennsylvania	3
Russia	3
Taiwan	3
Texas	3
Washington	3
Bangladesh	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Race to the Top	3
Individuals with Disabilities…	2
Americans with Disabilities…	1
Debra P v Turlington	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 249 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Inter-Rater Reliability in Comprehensive Examination Scoring: The Case for Consistent and Collaborative Rater Training and Calibration

Download full text

Saenz, David Arron – Online Submission, 2023

There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…

Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics

Test Review: Raven's 2 Progressive Matrices, Clinical Edition (Raven's 2)

Peer reviewed

Direct link

McLeod, Justin W.H.; McCrimmon, Adam W. – Journal of Psychoeducational Assessment, 2021

The "Raven's 2 Progressive Matrices Clinical Edition" (Raven's 2; Raven, Rust, Chan, & Zhou, 2018), published by NCS Pearson, is an individually administered nonverbal assessment of general cognitive ability developed to measure "educative abilities," defined as the ability to think clearly and solve complex problems in…

Descriptors: Test Reviews, Intelligence Tests, Testing, Test Reliability

Utilizing Deep Learning AI to Analyze Scientific Models: Overcoming Challenges

Peer reviewed

Direct link

Tingting Li; Kevin Haudek; Joseph Krajcik – Journal of Science Education and Technology, 2025

Scientific modeling is a vital educational practice that helps students apply scientific knowledge to real-world phenomena. Despite advances in AI, challenges in accurately assessing such models persist, primarily due to the complexity of cognitive constructs and data imbalances in educational settings. This study addresses these challenges by…

Descriptors: Artificial Intelligence, Scientific Concepts, Models, Automation

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Project Development for Blood Bank Application and Convertor for Software Testing

Peer reviewed
PDF on ERIC

Download full text

Rosziati Ibrahim; Mizani Mohamad Madon; Zhiang Yue Lee; Piraviendran A/L Rajendran; Jahari Abdul Wahab; Faaizah Shahbodin – International Society for Technology, Education, and Science, 2023

This paper discusses the steps involve in project development for developing the mobile application, namely Blood Bank Application and developing the convertor for software testing. The project development is important for Computer Science students for them to learn the important steps in developing the application and testing the reliability of…

Descriptors: Program Administration, Educational Technology, Computer Software, Testing

Assessments Play an Important Role in Serving Students. What's Next: Policy Recommendations from the George W. Bush Institute

Download full text

Anne Wicks; Robin Berkley – George W. Bush Institute, 2025

Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…

Descriptors: Student Evaluation, Testing, Tests, Standardized Tests

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Parents Can Accurately and Reliably Administer an Online Dyslexia Evaluation Tool

Peer reviewed

Direct link

Hurford, David P.; Wines, Autumn – Australian Journal of Learning Difficulties, 2022

The purpose of the present study was to examine the potential that parents could effectively administer an online dyslexia evaluation tool (ODET) to their children. To this end, four groups consisting of parents and trained staff were compared. Sixty-three children (36 females and 27 males) participated. The children in each group were assessed…

Descriptors: Test Reliability, Computer Assisted Testing, Dyslexia, Screening Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Journal of Psychoeducational…	19
ProQuest LLC	9
Language Testing	8
New York State Education…	8
Canadian Journal of School…	7
Language Assessment Quarterly	6
Online Submission	6
ETS Research Report Series	5
International Journal of…	5
Journal of Educational…	5
Partnership for Assessment of…	4
Regional Educational…	4
Educational Measurement:…	3
English Teaching Forum	3
Measurement in Physical…	3
Nebraska Department of…	3
Practical Assessment,…	3
Advances in Health Sciences…	2
Advances in Language and…	2
Annenberg Institute for…	2
Assessment and Accountability…	2
Assessment for Effective…	2
Child Abuse & Neglect: The…	2
Council of Chief State School…	2
Early Child Development and…	2
More ▼

McCrimmon, Adam W.	6
Goldschmidt, Pete	3
Ackerman, Debra J.	2
Delen, Erhan	2
Dickens, Rachel H.	2
Dorans, Neil J.	2
Dunne, Michael P.	2
Hamid, M. Obaidul	2
Heritage, Margaret	2
Herman, Joan L.	2
Hurford, David P.	2
Kaya, Fatih	2
Meisinger, Elizabeth B.	2
Mislevy, Robert J.	2
Nordstokke, David W.	2
Petscher, Yaacov	2
Pinder, Patrice Juliet	2
Prather, Edward E.	2
Runyan, Desmond K.	2
Salmani-Nodoushan, Mohammad…	2
Solano-Flores, Guillermo	2
Tarar, Jessica M.	2
Zolotor, Adam J.	2
Abdullah, Saifuddin Kumar	1
Ahmed, Md. Kawser	1
More ▼

ACT Assessment	3
Measures of Academic Progress	3
Raven Progressive Matrices	3
Autism Diagnostic Observation…	2
Battelle Developmental…	2
Bayley Scales of Infant…	2
Clinical Evaluation of…	2
Florida Comprehensive…	2
International English…	2
National Assessment of…	2
Program for International…	2
Test of English as a Foreign…	2
Vineland Adaptive Behavior…	2
Wechsler Adult Intelligence…	2
Wechsler Intelligence Scale…	2
Woodcock Johnson Tests of…	2
ACTFL Oral Proficiency…	1
Beck Anxiety Inventory	1
Beery Developmental Test of…	1
Block Design Test	1
Center for Epidemiologic…	1
Childhood Autism Rating Scale	1
Classroom Assessment Scoring…	1
Denver Developmental…	1
Developmental Indicators for…	1
More ▼