ERIC - Search Results

Publication Date

In 2025	1
Since 2024	7
Since 2021 (last 5 years)	26
Since 2016 (last 10 years)	72
Since 2006 (last 20 years)	147

Descriptor

Correlation	169
Scores	169
Test Items	169
Foreign Countries	59
Comparative Analysis	34
Item Analysis	33
Statistical Analysis	33
Test Reliability	33
Second Language Learning	31
Factor Analysis	29
Test Construction	29
Test Validity	29
Item Response Theory	27
Multiple Choice Tests	27
English (Second Language)	26
Language Tests	26
Difficulty Level	25
Psychometrics	24
Mathematics Tests	21
Reliability	21
Undergraduate Students	19
Computer Assisted Testing	18
Test Bias	17
Achievement Tests	16
College Students	16
More ▼

Publication Type

Reports - Research	135
Journal Articles	131
Reports - Evaluative	18
Speeches/Meeting Papers	12
Tests/Questionnaires	12
Dissertations/Theses -…	10
Numerical/Quantitative Data	2
Reports - Descriptive	2
Books	1
Collected Works - Proceedings	1
Guides - General	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	53
Postsecondary Education	40
Secondary Education	25
Elementary Education	17
High Schools	16
Middle Schools	8
Intermediate Grades	6
Elementary Secondary Education	4
Grade 4	4
Junior High Schools	4
Early Childhood Education	3
Grade 5	3
Grade 6	3
Grade 8	3
Grade 1	2
Grade 3	2
Grade 7	2
Grade 9	2
Preschool Education	2
Grade 10	1
Grade 11	1
Grade 2	1
Kindergarten	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	3
Practitioners	1
Students	1

Location

Turkey	8
Japan	6
China	5
Australia	4
South Korea	4
Canada	3
Hong Kong	3
Netherlands	3
New York	3
Texas	3
United Kingdom	3
United Kingdom (England)	3
United States	3
Arizona	2
California	2
Europe	2
France	2
Indonesia	2
Saudi Arabia	2
Taiwan	2
Austria	1
Belgium	1
China (Shanghai)	1
Czech Republic	1
District of Columbia	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 169 results Save | Export

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

Question Format Biases College Students' Metacognitive Judgments for Exam Performance

Peer reviewed
PDF on ERIC

Download full text

McGuire, Michael J. – International Journal for the Scholarship of Teaching and Learning, 2023

College students in a lower-division psychology course made metacognitive judgments by predicting and postdicting performance for true-false, multiple-choice, and fill-in-the-blank question sets on each of three exams. This study investigated which question format would result in the most accurate metacognitive judgments. Extending Koriat's (1997)…

Descriptors: Metacognition, Multiple Choice Tests, Accuracy, Test Format

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Can People with Higher versus Lower Scores on Impression Management or Self-Monitoring be Identified through Different Traces under Faking?

Peer reviewed

Direct link

Jessica Röhner; Philipp Thoss; Liad Uziel – Educational and Psychological Measurement, 2024

According to faking models, personality variables and faking are related. Most prominently, people's tendency to try to make an appropriate impression (impression management; IM) and their tendency to adjust the impression they make (self-monitoring; SM) have been suggested to be associated with faking. Nevertheless, empirical findings connecting…

Descriptors: Metacognition, Deception, Personality Traits, Scores

Examination of Differential Item Functioning in PISA through Univariate and Multivariate Matching Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ahmet Yildirim; Nizamettin Koç – International Journal of Assessment Tools in Education, 2024

The present research aims to examine whether the questions in the Program for the International Student Assessment (PISA) 2009 reading literacy instrument display differential item functioning (DIF) among the Turkish, French, and American samples based on univariate and multivariate matching techniques before and after the total score, which is…

Descriptors: Test Items, Item Analysis, Correlation, Error of Measurement

Answer Changing Behaviors and Performance in a First-Year Medical Gross and Developmental Anatomy Course

Peer reviewed
PDF on ERIC

Download full text

Marli Crabtree; Kenneth L. Thompson; Ellen M. Robertson – HAPS Educator, 2024

Research has suggested that changing one's answer on multiple-choice examinations is more likely to lead to positive academic outcomes. This study aimed to further understand the relationship between changing answer selections and item attributes, student performance, and time within a population of 158 first-year medical students enrolled in a…

Descriptors: Anatomy, Science Tests, Medical Students, Medical Education

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Along the Convergent-Divergent Continuum: The Role of Task Structure in the PISA Creative Thinking Assessment

Peer reviewed

Direct link

Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025

Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…

Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Estimating the Impact of Local Item Dependency in a Test of Second Language Reading Comprehension

Peer reviewed
PDF on ERIC

Download full text

Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024

Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…

Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

On the Connections between Item Response Theory and Classical Test Theory: A Note on True Score Evaluation for Polytomous Items via Item Response Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

This note highlights and illustrates the links between item response theory and classical test theory in the context of polytomous items. An item response modeling procedure is discussed that can be used for point and interval estimation of the individual true score on any item in a measuring instrument or item set following the popular and widely…

Descriptors: Correlation, Item Response Theory, Test Items, Scores

Assessing, Accommodating, and Guiding English Learners: A Collection of Studies

Direct link

Stephanie B. Moore – ProQuest LLC, 2024

This three-manuscript dissertation attempts to answer the question: "How does students' English language proficiency (ELP) inform the availability, structure, and use of English language accommodations and intervention to support the academic achievement of English learner (EL) students?" The question is addressed using three independent…

Descriptors: English Language Learners, Language Proficiency, English (Second Language), Second Language Learning

A Baseline for Multiple-Choice Testing in the University Classroom

Peer reviewed

Direct link

Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021

There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…

Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

ProQuest LLC	10
ETS Research Report Series	9
Educational and Psychological…	9
International Journal of…	4
Language Testing	4
College Board	3
Advances in Physiology…	2
Applied Measurement in…	2
Assessment & Evaluation in…	2
College Entrance Examination…	2
Educational Assessment	2
Grantee Submission	2
Intelligence	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Experimental…	2
Language Assessment Quarterly	2
Language Teaching Research…	2
Research in Developmental…	2
ACT, Inc.	1
Accounting Education	1
American Annals of the Deaf	1
American Journal on Mental…	1
Applied Psychological…	1
More ▼

Liu, Ou Lydia	5
Metsämuuronen, Jari	3
Bennett, Randy Elliot	2
Dimitrov, Dimiter M.	2
Kalender, Ilker	2
Kobrin, Jennifer L.	2
Mao, Liyang	2
Marcoulides, George A.	2
O'Grady, Stefan	2
Raykov, Tenko	2
Reckase, Mark D.	2
Rios, Joseph A.	2
Sackett, Paul R.	2
Sijtsma, Klaas	2
Stricker, Lawrence J.	2
Verhoeven, Ludo	2
Xu, Jun	2
Zhang, Mo	2
Abdullah, Ahmad A.	1
Abrams, Eleanor	1
Ahmet Yildirim	1
Ahonen, Timo	1
Aizawa, Kazumi	1
Akbarian, Is'haaq	1
More ▼

SAT (College Admission Test)	8
Program for International…	6
Test of English as a Foreign…	5
Peabody Picture Vocabulary…	4
ACT Assessment	3
Advanced Placement…	2
Graduate Record Examinations	2
Progress in International…	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Test of English for…	2
ACT Interest Inventory	1
COMPASS (Computer Assisted…	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Communication and Symbolic…	1
Digit Span Test	1
Graduate Management Admission…	1
Hidden Figures Test	1
International English…	1
Measures of Academic Progress	1
NEO Personality Inventory	1
Nelson Denny Reading Tests	1
Peabody Developmental Motor…	1
Preschool Language Scale	1
More ▼