ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	46

Descriptor

Evaluation Methods	106
Test Theory	106
Student Evaluation	29
Psychometrics	27
Test Reliability	27
Foreign Countries	23
Test Validity	23
Measurement Techniques	20
Testing	19
Comparative Analysis	15
Item Response Theory	15
Test Construction	15
Test Items	14
Models	13
Scores	13
Educational Assessment	12
Higher Education	12
Test Interpretation	12
Testing Problems	12
Educational Testing	11
Statistical Analysis	11
Definitions	10
Educational Research	10
Classification	9
Equated Scores	9
More ▼

Publication Type

Journal Articles	81
Reports - Research	39
Reports - Evaluative	24
Opinion Papers	18
Reports - Descriptive	11
Speeches/Meeting Papers	10
Information Analyses	9
Guides - Non-Classroom	5
Books	3
Dissertations/Theses -…	2
Guides - General	2
Reports - General	2
Tests/Questionnaires	2
Book/Product Reviews	1
Collected Works - General	1
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	13
Elementary Secondary Education	10
Postsecondary Education	6
Middle Schools	3
Secondary Education	3
Adult Education	2
High Schools	2
Junior High Schools	2
Elementary Education	1
Grade 6	1
Intermediate Grades	1
More ▼

Audience

Practitioners	8
Teachers	4
Researchers	3
Administrators	1
Policymakers	1

Location

United Kingdom	5
United Kingdom (England)	5
United Kingdom (Wales)	4
United States	4
Canada	3
Australia	2
Netherlands	2
Sweden	2
Turkey	2
United Kingdom (Northern…	2
Chile	1
Egypt	1
Finland (Helsinki)	1
Illinois	1
Japan	1
Michigan	1
Oregon	1
Singapore	1
United Kingdom (Great Britain)	1
Utah	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

SAT (College Admission Test)	2
Advanced Placement…	1
Cornell Critical Thinking Test	1
General Educational…	1
Graduate Record Examinations	1
National Assessment of…	1
Piers Harris Childrens Self…	1
Student Descriptive…	1
Tennessee Self Concept Scale	1
Watson Glaser Critical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 106 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

The Tall Order of Teaching Measurement Reliability: Introducing Classical Test Theory through Observations of Human Height

Peer reviewed

Direct link

Richards, Adam S. – Communication Teacher, 2021

Course: Communication Research Methods. Objectives: This activity provides students with an experiential introduction to measurement theory and the methods for assessing measurement reliability. First, multiple measurements of a person's height are interpreted according to classical test theory. Second, the measurement of human height is used as…

Descriptors: Body Height, Measurement, Communication Research, Test Theory

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

Validation of a Coupled, Multiple Response Assessment for Upper-Division Thermal Physics

Peer reviewed

Direct link

Rainey, Katherine D.; Vignal, Michael; Wilcox, Bethany R. – Physical Review Physics Education Research, 2022

Currently there are no assessment instruments available for upper-division thermal physics, though several introductory assessments are currently available. Notably missing from these introductory assessment are items targeting statistical mechanics. This leaves a gap in the content that can be assessed by upper-division thermal physics faculty.…

Descriptors: Physics, Science Instruction, Thermodynamics, College Science

Examination of Common Exams Held by Measurement and Assessment Centers: Many Facet Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021

This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…

Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items

Commentary on Baird, J., Andrich, D., Hopfenbeck, T. N. and Stobart, G., "Assessment and Learning: Fields Apart"

Peer reviewed

Direct link

Scharaschkin, Alex – Assessment in Education: Principles, Policy & Practice, 2017

This issue's featured article, "Assessment and Learning: Fields Apart" (Baird, Andrich, Hopfenbeck, and Stobart 2017) raises issues that are of basic importance for the disciplines of assessment and teaching and learning theory. In this commentary, Alex Scharaschkin restricts his remarks to a few areas. He considers the idea of a…

Descriptors: Educational Assessment, Learning Theories, Test Theory, Psychometrics

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Problem Solving Learning Environments and Assessment: A Knowledge Space Theory Approach

Peer reviewed

Direct link

Reimann, Peter; Kickmeier-Rust, Michael; Albert, Dietrich – Computers & Education, 2013

This paper explores the relation between problem solving learning environments (PSLEs) and assessment concepts. The general framework of evidence-centered assessment design is used to describe PSLEs in terms of assessment concepts, and to identify similarities between the process of assessment design and of PSLE design. We use a recently developed…

Descriptors: Teaching Methods, Psychometrics, Problem Solving, Test Theory

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Peer reviewed
PDF on ERIC

Download full text

Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015

For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…

Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Measurement Properties of the Motivation for Youth Treatment Scale with a Residential Group Home Population

Peer reviewed

Direct link

Lambert, Matthew C.; Hurley, Kristin Duppong; Tomlinson, M. Michele Athay; Stevens, Amy L. – Child & Youth Care Forum, 2013

Background: A client's motivation to receive services is significantly related to seeking services, remaining in services, and improved outcomes. The Motivation for Youth Treatment Scale (MYTS) is one of the few brief measures used to assess motivation for mental health treatment. Objective: To investigate if the psychometric properties of the…

Descriptors: Motivation, Mental Health, Health Services, Access to Health Care

A Psychometric Evaluation of the Digital Logic Concept Inventory

Peer reviewed

Direct link

Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C. – Computer Science Education, 2014

Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric…

Descriptors: Psychometrics, Concept Formation, Measures (Individuals), Teaching Methods

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

A Psychometric Analysis of the Chemical Concepts Inventory

Peer reviewed

Direct link

Barbera, Jack – Journal of Chemical Education, 2013

The Chemical Concepts Inventory (CCI) is a multiple-choice instrument designed to assess the alternate conceptions of students in high school or first-semester college chemistry. The instrument was published in 2002 along with an analysis of its data from a test population. This study supports the initial analysis and expands on the psychometric…

Descriptors: Science Instruction, Secondary School Science, High Schools, College Science

Using IRT Trait Estimates versus Summated Scores in Predicting Outcomes

Peer reviewed

Direct link

Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012

It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…

Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Measurement:…	9
Educational and Psychological…	4
Journal of Educational…	4
Alberta Journal of…	3
Annual Review of Applied…	3
Educational Research and…	3
Applied Psychological…	2
Assessment in Education:…	2
Educational Measurement:…	2
History and Social Science…	2
Journal of Chemical Education	2
Language, Speech, and Hearing…	2
ProQuest LLC	2
Advances in Physiology…	1
American Psychologist	1
Anatomical Sciences Education	1
Applied Measurement in…	1
Asia Pacific Journal of…	1
Assessment & Evaluation in…	1
Assessment in Education…	1
Child & Youth Care Forum	1
Communication Monographs	1
Communication Teacher	1
Computer Science Education	1
Computers & Education	1
More ▼

Mislevy, Robert J.	3
Braun, Henry I.	2
Williams, Richard H.	2
Zimmerman, Donald W.	2
van der Linden, Wim J.	2
Aksu, Gökhan	1
Albert, Dietrich	1
Allen, Nancy L.	1
Andreou, Pantelis	1
Audette, Jennifer Gail	1
Bachor, Dan G.	1
Baird, Jo-Anne	1
Barbera, Jack	1
Barnard, Jane	1
Beguin, A. A.	1
Beichner, Robert	1
Bhaskar, R.	1
Black, Beth	1
Bloom, Benjamin S.	1
Boldt, R. F.	1
Bos, Wilfried	1
Bramley, Tom	1
Breithaupt, Krista	1
Brennan, Robert T.	1
More ▼