ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	65
Since 2006 (last 20 years)	97

Descriptor

Correlation	120
Test Items	120
Test Reliability	120
Test Validity	66
Test Construction	43
Foreign Countries	42
Factor Analysis	38
Scores	33
Statistical Analysis	33
Psychometrics	30
Difficulty Level	27
Item Analysis	23
Item Response Theory	20
College Students	19
Factor Structure	17
Measures (Individuals)	16
Undergraduate Students	16
Comparative Analysis	14
Goodness of Fit	14
Test Bias	14
Construct Validity	13
Language Tests	12
Scoring	12
Student Attitudes	12
English (Second Language)	11
More ▼

Publication Type

Reports - Research	95
Journal Articles	81
Tests/Questionnaires	14
Reports - Evaluative	10
Speeches/Meeting Papers	7
Dissertations/Theses -…	6
Numerical/Quantitative Data	4
Reports - Descriptive	3
Guides - General	1
Guides - Non-Classroom	1
Multilingual/Bilingual…	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	44
Postsecondary Education	31
Elementary Education	9
Secondary Education	9
Middle Schools	5
Early Childhood Education	4
Elementary Secondary Education	4
Junior High Schools	4
Grade 7	3
Grade 8	3
High Schools	3
Kindergarten	3
Primary Education	3
Adult Education	2
Grade 1	2
Grade 2	2
Grade 6	2
Intermediate Grades	2
Two Year Colleges	2
Grade 3	1
Grade 4	1
Grade 5	1
More ▼

Audience

Researchers	2
Practitioners	1
Teachers	1

Location

Turkey	13
Canada	3
Florida	3
Germany	3
California	2
Illinois	2
New York	2
United Kingdom (London)	2
Arizona	1
Australia	1
Chile	1
China	1
Colombia	1
District of Columbia	1
Greece	1
India	1
Indonesia	1
Iowa	1
Iran	1
Japan	1
Kansas	1
Maryland	1
Mexico	1
Nebraska	1
Nebraska (Lincoln)	1
More ▼

Laws, Policies, & Programs

United Nations Convention on…

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 120 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

The Role of Item Distributions on Reliability Estimation: The Case of Cronbach's Coefficient Alpha

Peer reviewed

Direct link

Olvera Astivia, Oscar Lorenzo; Kroc, Edward; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

Simulations concerning the distributional assumptions of coefficient alpha are contradictory. To provide a more principled theoretical framework, this article relies on the Fréchet-Hoeffding bounds, in order to showcase that the distribution of the items play a role on the estimation of correlations and covariances. More specifically, these bounds…

Descriptors: Test Items, Test Reliability, Computation, Correlation

Comparison of Cronbach's Alpha and McDonald's Omega for Ordinal Data: Are They Different?

Peer reviewed
PDF on ERIC

Download full text

Fatih Orcan – International Journal of Assessment Tools in Education, 2023

Among all, Cronbach's Alpha and McDonald's Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the…

Descriptors: Statistical Analysis, Monte Carlo Methods, Correlation, Factor Analysis

Development and Initial Validation of Digital Age Teaching Scale (DATS) to Assess Application of ISTE Standards for Educators in K-12 Education Classrooms

Peer reviewed

Direct link

Vucaj, Indrit – Journal of Research on Technology in Education, 2022

This study presents the methodological and procedural development process of the Digital Age Teaching Scale (DATS), a summative assessment tool designed to measure application of the ISTE Standards for Educators in K-12 classrooms. The theoretical framework of the ISTE Standards for Educators informed the development of DATS, and an 8-step process…

Descriptors: Elementary Secondary Education, Standards, Test Construction, Test Items

Modeling Local Item Dependence in Cloze Tests with the Rasch Model: Applying a New Strategy

Peer reviewed
PDF on ERIC

Download full text

Barno S. Abdullaeva; Diyorjon Abdullaev; Nurislom I. Khursanov; Khurshida B. Kadirova; Laylo Djuraeva – International Journal of Language Testing, 2024

Cloze tests are commonly used in language testing as a quick measure of overall language ability or reading comprehension. A problem for the analysis of cloze tests with item response theory models is that cloze test items are locally dependent. This leads to the violation of the conditional or local independence assumption of IRT models. In this…

Descriptors: Cloze Procedure, Language Tests, Test Items, Correlation

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

Preliminary Findings to Support the Internal Consistency and Factor Structure of the Ferrari-Lynch-Vogel Listening Test (FLVLT)

Peer reviewed

Direct link

Ferrari-Bridgers, Franca – International Journal of Listening, 2023

While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…

Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability

Metaverse/Meta-Education Belief Scale

Peer reviewed
PDF on ERIC

Download full text

Erol, Ahmet; Yurdakal, Ibrahim Halil; Tekin Karagöz, Ceren – Malaysian Online Journal of Educational Technology, 2023

The "metaverse," which bridges augmented and virtual reality as mixed reality and includes technological phenomena such as artificial intelligence, continues to be an agenda topic. It is foreseen that the concept in question will accelerate the changes in education and teaching activities, as in many other fields. In this research, a…

Descriptors: Computer Simulation, Artificial Intelligence, Likert Scales, Preservice Teachers

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

A Baseline for Multiple-Choice Testing in the University Classroom

Peer reviewed

Direct link

Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021

There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…

Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests

Refinement and Psychometric Evaluation of the Executive Skills Questionnaire--Revised

Peer reviewed

Direct link

Strait, Julia Englund; Dawson, Peg; Walther, Christine A. P.; Strait, Gerald Gill; Barton, Amy K.; Brunson McClain, Maryellen – Contemporary School Psychology, 2020

Executive functioning (EF) skills are vital for academic success. Along with the recent explosion of interventions targeting these skills comes the need for affordable, efficient, and ecologically valid measures for planning and tailoring interventions and monitoring outcomes. The current study describes the refinement and initial psychometric…

Descriptors: Executive Function, Questionnaires, Rating Scales, Test Items

Exploration of Student Cognitive Mathematics Ability Diagnostic Instruments: Validity, Reliability, and Item Characteristics

Peer reviewed
PDF on ERIC

Download full text

Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023

Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…

Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational and Psychological…	9
ProQuest LLC	6
Online Submission	5
Eurasian Journal of…	4
Grantee Submission	4
ETS Research Report Series	3
Journal of Educational…	2
Language Testing	2
ACT, Inc.	1
American Journal on Mental…	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
Autism: The International…	1
Behavioral Research and…	1
Business and Professional…	1
CBE - Life Sciences Education	1
Canadian Journal of School…	1
Career Development and…	1
College Board	1
College Student Journal	1
Contemporary Educational…	1
Contemporary School Psychology	1
Early Education and…	1
More ▼

Liu, Ou Lydia	4
Farina, Kristy	3
LaVenia, Mark	3
Schoen, Robert C.	3
Champagne, Zachary M.	2
Dikmenli, Yurdal	2
Mao, Liyang	2
Metsämuuronen, Jari	2
Sijtsma, Klaas	2
Xu, Jun	2
Adamu, Gishua Garba	1
Aedo-Saravia, Jaime	1
Aktas, Elif	1
Al Khasawneh, Mohanad	1
Aldhalaan, Hesham	1
Aldosari, Mohammed	1
Almeda, Mia	1
Alonzo, Julie	1
Alqadoumi, Tala	1
Alsaleh, Asma	1
Alshaban, Fouad	1
Alshammari, Hawraa	1
Altun, Halis	1
Aman, Michael G.	1
More ▼

SAT (College Admission Test)	5
ACT Assessment	3
Rosenberg Self Esteem Scale	2
Stanford Achievement Tests	2
ACT Interest Inventory	1
Beck Depression Inventory	1
Behavior Assessment System…	1
Clinical Evaluation of…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Strengths and Difficulties…	1
Teacher Rating Scale	1
Teaching and Learning…	1
Test of English as a Foreign…	1
Trends in International…	1
Wechsler Intelligence Scale…	1
More ▼