ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	30
Since 2017 (last 10 years)	57
Since 2007 (last 20 years)	115

Descriptor

Testing	326
Test Reliability	248
Test Validity	143
Test Construction	80
Foreign Countries	72
Language Tests	54
Higher Education	53
Comparative Analysis	50
Scores	47
Reliability	46
English (Second Language)	43
Interrater Reliability	43
Second Language Learning	42
Evaluation Methods	39
Student Evaluation	38
Scoring	37
Achievement Tests	32
Language Proficiency	30
Measurement Techniques	30
Testing Problems	30
Statistical Analysis	29
Correlation	28
Test Items	28
College Students	27
Student Attitudes	27
More ▼

Publication Type

Reports - Research	326
Journal Articles	183
Speeches/Meeting Papers	29
Tests/Questionnaires	14
Information Analyses	4
Books	3
Numerical/Quantitative Data	3
Reports - Descriptive	3
Collected Works - General	2
Guides - Non-Classroom	2
Opinion Papers	2
Collected Works - Serials	1
Guides - Classroom - Teacher	1
More ▼

Education Level

Higher Education	40
Postsecondary Education	34
Elementary Education	12
Secondary Education	12
Early Childhood Education	8
High Schools	7
Middle Schools	7
Elementary Secondary Education	6
Kindergarten	6
Primary Education	6
Junior High Schools	4
Grade 6	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Preschool Education	3
Adult Education	2
Grade 4	2
Grade 5	2
Grade 10	1
More ▼

Audience

Practitioners	10
Teachers	6
Researchers	4
Administrators	1
Counselors	1
Policymakers	1

Location

Canada	7
Turkey	5
United Kingdom	5
Illinois	4
Iran	4
Japan	4
United Kingdom (England)	4
California	3
China	3
Nigeria	3
Ohio	3
Pennsylvania	3
Taiwan	3
Australia	2
Bangladesh	2
Brazil	2
Delaware	2
Florida	2
Indonesia	2
Maryland	2
New Zealand	2
North Carolina	2
South Africa	2
United Kingdom (Great Britain)	2
United States	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
Elementary and Secondary…	2
Race to the Top	2
Bilingual Education Act 1968	1
Elementary and Secondary…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 326 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Inter-Rater Reliability in Comprehensive Examination Scoring: The Case for Consistent and Collaborative Rater Training and Calibration

Download full text

Saenz, David Arron – Online Submission, 2023

There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…

Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics

Utilizing Deep Learning AI to Analyze Scientific Models: Overcoming Challenges

Peer reviewed

Direct link

Tingting Li; Kevin Haudek; Joseph Krajcik – Journal of Science Education and Technology, 2025

Scientific modeling is a vital educational practice that helps students apply scientific knowledge to real-world phenomena. Despite advances in AI, challenges in accurately assessing such models persist, primarily due to the complexity of cognitive constructs and data imbalances in educational settings. This study addresses these challenges by…

Descriptors: Artificial Intelligence, Scientific Concepts, Models, Automation

Project Development for Blood Bank Application and Convertor for Software Testing

Peer reviewed
PDF on ERIC

Download full text

Rosziati Ibrahim; Mizani Mohamad Madon; Zhiang Yue Lee; Piraviendran A/L Rajendran; Jahari Abdul Wahab; Faaizah Shahbodin – International Society for Technology, Education, and Science, 2023

This paper discusses the steps involve in project development for developing the mobile application, namely Blood Bank Application and developing the convertor for software testing. The project development is important for Computer Science students for them to learn the important steps in developing the application and testing the reliability of…

Descriptors: Program Administration, Educational Technology, Computer Software, Testing

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Parents Can Accurately and Reliably Administer an Online Dyslexia Evaluation Tool

Peer reviewed

Direct link

Hurford, David P.; Wines, Autumn – Australian Journal of Learning Difficulties, 2022

The purpose of the present study was to examine the potential that parents could effectively administer an online dyslexia evaluation tool (ODET) to their children. To this end, four groups consisting of parents and trained staff were compared. Sixty-three children (36 females and 27 males) participated. The children in each group were assessed…

Descriptors: Test Reliability, Computer Assisted Testing, Dyslexia, Screening Tests

The Use of ChatGPT in Assessment

Peer reviewed
PDF on ERIC

Download full text

Mehmet Kanik – International Journal of Assessment Tools in Education, 2024

ChatGPT has surged interest to cause people to look for its use in different tasks. However, before allowing it to replace humans, its capabilities should be investigated. As ChatGPT has potential for use in testing and assessment, this study aims to investigate the questions generated by ChatGPT by comparing them to those written by a course…

Descriptors: Artificial Intelligence, Testing, Multiple Choice Tests, Test Construction

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Can the Oral Proficiency Interview -- Computer (ACTFL OPIc) Be Used Instead of the Oral Proficiency Interview (ACTFL OPI)? An Aligned Rank Transform (ART) Analysis

Peer reviewed

Direct link

Troy L. Cox; Gregory L. Thompson; Steven S. Stokes – Foreign Language Annals, 2025

This study investigated the differences between the ACTFL Oral Proficiency Interview (OPI) and the ACTFL Oral Proficiency Interview - Computer (OPIc) among Spanish learners at a U.S. university. Participants (N = 154) were randomly assigned to take both tests in a counterbalanced order to mitigate test order effects. Data were analyzed using an…

Descriptors: Oral Language, Language Proficiency, Interviews, Computer Uses in Education

Practical Randomly Selected Question Exam Design to Address Replicated and Sequential Questions in Online Examinations

Peer reviewed

Direct link

Elkhatat, Ahmed M. – International Journal for Educational Integrity, 2022

Examinations form part of the assessment processes that constitute the basis for benchmarking individual educational progress, and must consequently fulfill credibility, reliability, and transparency standards in order to promote learning outcomes and ensure academic integrity. A randomly selected question examination (RSQE) is considered to be an…

Descriptors: Integrity, Monte Carlo Methods, Credibility, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 22

Language Testing	12
Journal of Educational…	7
ETS Research Report Series	4
Language Learning	4
Advances in Health Sciences…	3
Early Child Development and…	3
Educational and Psychological…	3
Journal of Clinical Psychology	3
Journal of Communication…	3
Language Assessment Quarterly	3
Regional Educational…	3
System	3
Advances in Language and…	2
Annenberg Institute for…	2
Assessment & Evaluation in…	2
British Journal of…	2
Council of Chief State School…	2
ESL Magazine	2
Foreign Language Annals	2
International Journal of…	2
International Journal of…	2
Journal of Child Language	2
Journal of Communication…	2
Journal of Educational…	2
Journal of Experimental…	2
More ▼

Weiss, David J.	4
Gallas, Edwin J.	3
Kapes, Jerome T.	3
Nakamura, Yuji	3
Ackerman, Debra J.	2
Chalhoub-Deville, Micheline	2
Chambers, Francine	2
Feldt, Leonard S.	2
Fernandes, Kathleen	2
Hurford, David P.	2
McNamara, T. F.	2
Porter, Don	2
Prather, Edward E.	2
Richards, Brian	2
Rose, Andrew M.	2
Russell, Nolan F.	2
Schrader, William B.	2
Stewart, Krista J.	2
Vansickle, Timothy R.	2
Adams, R. J.	1
Ahmed, Md. Kawser	1
Ajjawi, Rola	1
Akkus, Huseyin	1
Al Hajri, Fatma	1
More ▼

Minnesota Multiphasic…	3
ACTFL Oral Proficiency…	2
Bayley Scales of Infant…	2
Florida Comprehensive…	2
General Aptitude Test Battery	2
Stanford Achievement Tests	2
State Trait Anxiety Inventory	2
Test of English as a Foreign…	2
Vineland Adaptive Behavior…	2
ACT Assessment	1
Adjective Check List	1
Battelle Developmental…	1
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Bem Sex Role Inventory	1
California Achievement Tests	1
California Critical Thinking…	1
Career Development Inventory	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Denver Developmental…	1
Developmental Indicators for…	1
Gates MacGinitie Reading Tests	1
More ▼