ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	26

Descriptor

Comparative Analysis	57
Item Analysis	57
Test Validity	57
Test Items	27
Test Reliability	24
Test Construction	16
Foreign Countries	15
Difficulty Level	9
Scores	9
Correlation	8
Statistical Analysis	8
Criterion Referenced Tests	7
Higher Education	7
Multiple Choice Tests	7
Reading Tests	7
Scoring	7
Achievement Tests	6
Factor Analysis	6
Test Bias	6
College Students	5
Mathematics Tests	5
Measurement Techniques	5
Psychometrics	5
Rating Scales	5
Reading Comprehension	5
More ▼

Publication Type

Reports - Research	39
Journal Articles	29
Speeches/Meeting Papers	7
Reports - Evaluative	6
Tests/Questionnaires	3
Dissertations/Theses -…	2
Guides - General	1
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	11
Postsecondary Education	10
Secondary Education	4
Elementary Education	2
Elementary Secondary Education	1
Grade 4	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1
Preschool Education	1

Audience

Location

Australia	2
Germany	2
Belgium	1
China	1
Europe	1
France	1
Indonesia	1
Iran	1
Japan	1
Luxembourg	1
New York	1
New York (New York)	1
Ohio	1
Russia	1
Spain	1
Switzerland	1
Texas	1
Turkey	1
Turkey (Istanbul)	1
United Kingdom	1
United Kingdom (Belfast)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Childrens Manifest Anxiety…	2
Armed Services Vocational…	1
Bender Gestalt Test	1
California Achievement Tests	1
College and University…	1
National Teacher Examinations	1
Piers Harris Childrens Self…	1
Program for International…	1
Raven Progressive Matrices	1
Sequential Tests of…	1
Trends in International…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

Test Score Comparison Tables: How Well are They Serving Test Users?

Peer reviewed

Direct link

Ute Knoch; Jason Fan – Language Testing, 2024

While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…

Descriptors: Language Tests, English, Test Validity, Item Analysis

Validity of Multiple-Choice Digital Formative Assessment for Assessing Students' (Mis)Conceptions: Evidence from a Mixed-Methods Study in Algebra

Peer reviewed

Direct link

Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024

Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…

Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

How Administration Stakes and Settings Affect Student Behavior and Performance on a Biology Concept Assessment

Peer reviewed

Direct link

Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023

Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…

Descriptors: Biology, Science Tests, High Stakes Tests, Scores

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

The Pattern of Test-Taking Effort across Items in Cognitive Ability Test: A Latent Class Analysis

Peer reviewed
PDF on ERIC

Download full text

Akhtar, Hanif – International Association for Development of the Information Society, 2022

When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…

Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

The Role of Expert Judgement in Language Test Validation

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022

The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…

Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Development of the BioCalculus Assessment (BCA)

Peer reviewed

Direct link

Taylor, Robin T.; Bishop, Pamela R.; Lenhart, Suzanne; Gross, Louis J.; Sturner, Kelly – CBE - Life Sciences Education, 2020

We describe the development and initial validity assessment of the 20-item BioCalculus Assessment (BCA), with the objective of comparing undergraduate life science students' understanding of calculus concepts in different courses with alternative emphases (with and without focus on biological applications). The development process of the BCA…

Descriptors: Test Construction, Mathematics Tests, Calculus, Test Validity

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Misconceptions about the Naglieri Nonverbal Ability Test: A Commentary of Concerns and Disagreements

Peer reviewed

Direct link

Naglieri, Jack A.; Ford, Donna Y. – Roeper Review, 2015

Black and Hispanic students are undeniably underidentified as gifted and underrepresented in gifted education. The underrepresentation of the two largest groups of "minority" students is long-standing, dating several decades, and is a serious area of contention. Most debates focus on the efficacy of traditional intelligence tests with…

Descriptors: Misconceptions, Nonverbal Ability, Ability, Ability Identification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	4
Journal of Educational…	4
CBE - Life Sciences Education	2
Journal of Consulting and…	2
Language Testing	2
ProQuest LLC	2
Assessment & Evaluation in…	1
Developmental Psychology	1
Early Education and…	1
Edinburgh Working Papers in…	1
Educational Assessment	1
Hispanic Journal of…	1
International Association for…	1
Journal of Chemical Education	1
Journal of Education for…	1
Journal of Teacher Education	1
Journal of Vocational Behavior	1
Language Education &…	1
Language, Speech, and Hearing…	1
Online Submission	1
Practitioner Research in…	1
Research Quarterly for…	1
Research in Developmental…	1
Roeper Review	1
School Science and Mathematics	1
More ▼

Haladyna, Tom	3
Roid, Gale	2
Afflerbach, Peter	1
Akhtar, Hanif	1
Aleyna Altan	1
Argulewicz, Ed N.	1
Bishop, Pamela R.	1
Blair, Bernadette	1
Bowes, Neal	1
Broonen, Jean-Paul	1
Brown, James Dean	1
Bryant, N. Dale	1
Bärbel Barzel	1
Cantrell, Pamela	1
Churchman, David	1
Claessens, Amy	1
Coniam, David	1
Couch, Brian A.	1
Crambert, Albert C.	1
Crehan, Kevin D.	1
Daniel Thurm	1
David Bell	1
Dillon, Amanda	1
Douglass, Frazier M., IV	1
More ▼