ERIC - Search Results

Publication Date

In 2025	4
Since 2024	14
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	38

Descriptor

Comparative Testing	203
Test Reliability	203
Test Validity	95
Higher Education	47
Test Construction	47
Foreign Countries	31
College Students	28
Test Format	28
Intelligence Tests	22
Test Items	22
Psychometrics	20
Adults	19
Correlation	19
Multiple Choice Tests	19
Computer Assisted Testing	18
Scores	18
Test Interpretation	18
Evaluation Methods	17
Factor Structure	16
Achievement Tests	15
Comparative Analysis	15
Standardized Tests	15
Testing Problems	15
Elementary Secondary Education	14
Factor Analysis	14
More ▼

Publication Type

Reports - Research	141
Journal Articles	93
Speeches/Meeting Papers	45
Reports - Evaluative	24
Tests/Questionnaires	4
Dissertations/Theses -…	3
Information Analyses	3
Reports - Descriptive	2
Book/Product Reviews	1
Books	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
Opinion Papers	1
Reference Materials -…	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	11
Elementary Education	5
Elementary Secondary Education	4
Secondary Education	4
Early Childhood Education	2
Grade 2	2
Grade 4	2
High Schools	2
Grade 10	1
Grade 7	1
Intermediate Grades	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	9
Practitioners	3
Teachers	2
Counselors	1

Location

United States	5
Australia	4
Canada	4
China	4
Ireland	2
Israel	2
Singapore	2
United Kingdom	2
United Kingdom (England)	2
Alabama	1
Argentina	1
Austria	1
Finland	1
France	1
Georgia (Atlanta)	1
Germany	1
Hong Kong	1
Idaho	1
Illinois (Chicago)	1
India	1
Indonesia	1
Japan	1
Kenya	1
Maryland	1
Netherlands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
No Child Left Behind Act 2001	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 203 results Save | Export

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

A Practical Guide to Item Bank Calibration with Multiple Matrix Sampling

Peer reviewed
PDF on ERIC

Download full text

Eren Can Aybek; Serkan Arikan; Günes Ertas – International Journal of Assessment Tools in Education, 2024

When it is required to estimate item parameters of a large item bank, Multiple Matrix Sampling (MMS) design provides an efficient way while minimizing the test burden on students. The current study exemplifies how to calibrate a large item pool using MMS design for various purposes, such as developing a CAT administration. The purpose of the…

Descriptors: Elementary School Mathematics, Elementary School Students, Grade 4, Item Banks

Neutrosophic Estimators for Estimating the Population Mean in Survey Sampling

Peer reviewed

Direct link

Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024

In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…

Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

How Long Should a High Stakes Test Be?

Download full text

Tom Benton – Research Matters, 2024

Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…

Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Agreement between Body Composition Estimates Using a 2D Imaging System across Different Body Positions and Days

Peer reviewed

Direct link

Casey J. Metoyer; Katherine Sullivan; Lee J. Winchester; Mark T. Richardson; Michael R. Esco; Michael V. Fedewa – Measurement in Physical Education and Exercise Science, 2025

Relative adiposity (%Fat) was measured using a smartphone-based application in a convenience sample of adults aged 20-52 years (n = 32, 68.7% female, 84.3% White/Caucasian, 26.7 ± 3.5 kg/m2) across different body positions (Anterior versus Posterior) on consecutive days (Day 1 versus Day 2). A reference photo was obtained from the posterior view…

Descriptors: Adults, Body Composition, Handheld Devices, Computer Assisted Instruction

Investigating Students' Expectations and Engagement in General and Organic Chemistry Laboratory Courses

Peer reviewed

Direct link

Elizabeth B. Vaughan; Saraswathi Tummuru; Jack Barbera – Chemistry Education Research and Practice, 2025

Students' expectations for their laboratory coursework are theorized to have an impact on their learning experiences and behaviors, such as engagement. Before students' expectations and engagement can be explored in different types of undergraduate chemistry laboratory courses, appropriate measures of these constructs must be identified, and…

Descriptors: Undergraduate Students, Organic Chemistry, Chemistry, Science Instruction

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Predicting Student Success in a Magnet School Setting through Intelligence and Non-Cognitive Factors

Direct link

John Jeffrey McCann Jr. – ProQuest LLC, 2024

Magnet schools have been a main tool or innovation in urban education settings in the United States, originating in the early 1970's and expanding into most large urban districts today (Blank, 1989). While some magnet schools do not rely on a specific criterion to determine entry, many do. This study focuses on such a setting where students must…

Descriptors: Intelligence Tests, Magnet Schools, Urban Schools, Screening Tests

The Application of Cognitive Task Analysis and Cognitive Load Methods in the Process of Learning Algorithms

Direct link

Razieh Fathi – ProQuest LLC, 2021

This dissertation describes an experiment to investigate how learners with different levels of background in computer science learn core concepts of computer science, in particular, algorithms. We designed a study to focus on cognitive task analysis for eliciting the empirical mental elements of learning two graph algorithms. Cognitive workload…

Descriptors: Undergraduate Students, Computer Science Education, Algorithms, Cognitive Development

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Educational and Psychological…	13
Journal of Clinical Psychology	7
Perceptual and Motor Skills	6
Psychology in the Schools	5
Psychological Assessment	4
Applied Measurement in…	3
Journal of Educational…	3
Journal of Educational…	3
ProQuest LLC	3
Psychological Reports	3
Advances in Health Sciences…	2
Applied Psychological…	2
Evaluation and the Health…	2
Journal of Consulting and…	2
Measurement:…	2
Multivariate Behavioral…	2
Psychol Rep	2
Psychological Test Bulletin	2
Advances in Physiology…	1
Alberta Journal of…	1
Annenberg Institute for…	1
Assessment & Evaluation in…	1
Behavior Research Methods,…	1
British Journal of…	1
Chemistry Education Research…	1
More ▼

Bracken, Bruce A.	3
Gallas, Edwin J.	3
Smith, Douglas K.	3
Trevisan, Michael S.	3
Anderson, Paul S.	2
Breland, Hunter M.	2
Costantino, Giuseppe	2
Green, Kathy	2
Hyers, Albert D.	2
Karma, Kai	2
Marsh, Herbert W.	2
Naglieri, Jack A.	2
Pfeiffer, Steven I.	2
Schroeder, David H.	2
Thompson, Bruce	2
Vispoel, Walter P.	2
Aberman, Hugh M.	1
Alcock, Lara	1
Allison, Donald E.	1
Allison, Howard K., II	1
Alterman, Arthur I.	1
Alwis, W. A. M.	1
Amanda A. Wolkowitz	1
Anderson, David O.	1
More ▼

Wechsler Intelligence Scale…	11
Peabody Picture Vocabulary…	4
Wechsler Adult Intelligence…	4
Kaufman Assessment Battery…	3
Comprehensive Tests of Basic…	2
Computer Anxiety Scale	2
Illinois Test of…	2
McCarthy Scales of Childrens…	2
Minnesota Multiphasic…	2
National Assessment of…	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Test of Standard Written…	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Autism Diagnostic Observation…	1
Beck Depression Inventory	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Boehm Test of Basic Concepts	1
Bracken Basic Concept Scale	1
California Achievement Tests	1
Child Abuse Potential…	1
Childhood Autism Rating Scale	1
Collegiate Assessment of…	1
More ▼