ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	11

Descriptor

Evaluation Methods	24
Test Theory	24
Psychometrics	7
Test Reliability	6
Testing	5
Foreign Countries	4
Measurement Techniques	4
Scores	4
Test Validity	4
Academic Achievement	3
Educational Research	3
Error of Measurement	3
Evaluation Research	3
Item Response Theory	3
Measurement	3
Models	3
Test Format	3
Adaptive Testing	2
Comparative Analysis	2
Comparative Testing	2
Computer Assisted Testing	2
Context Effect	2
Correlation	2
Diagnostic Tests	2
Educational Assessment	2
More ▼

Source

Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational…	2
Applied Measurement in…	1
Asia Pacific Journal of…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
Educational Measurement:…	1
Educational Research and…	1
European Journal of…	1
High Ability Studies	1
Language, Speech, and Hearing…	1
Measurement:…	1
Multivariate Behavioral…	1
Online Submission	1
Remedial and Special…	1
Review of Educational Research	1
More ▼

Publication Type

Reports - Evaluative	24
Journal Articles	20
Opinion Papers	3
Speeches/Meeting Papers	3
Books	1
Collected Works - General	1

Education Level

Higher Education	4
Elementary Secondary Education	2
Adult Education	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	2
Researchers	1

Location

Finland (Helsinki)	1
Singapore	1
Turkey	1
United Kingdom	1
Utah	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

Commentary on Baird, J., Andrich, D., Hopfenbeck, T. N. and Stobart, G., "Assessment and Learning: Fields Apart"

Peer reviewed

Direct link

Scharaschkin, Alex – Assessment in Education: Principles, Policy & Practice, 2017

This issue's featured article, "Assessment and Learning: Fields Apart" (Baird, Andrich, Hopfenbeck, and Stobart 2017) raises issues that are of basic importance for the disciplines of assessment and teaching and learning theory. In this commentary, Alex Scharaschkin restricts his remarks to a few areas. He considers the idea of a…

Descriptors: Educational Assessment, Learning Theories, Test Theory, Psychometrics

The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

Peer reviewed

Direct link

Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…

Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance

The Paradoxical Attenuation Effect in Tests Based on Classical Test Theory: Mathematical Background and Practical Implications for the Measurement of High Abilities

Peer reviewed

Direct link

Ziegler, Albert; Ziegler, Albert – High Ability Studies, 2009

The aim of this paper is to demonstrate the dramatic consequences the application of cut-off points can have in the practice of identifying gifted individuals. The paradoxical attenuation effect describes the frequent situation in which measurements of the gifts and talents individuals possess are lower than their true values. However, in…

Descriptors: Gifted, Academic Achievement, Test Theory, Measurement

Diagnostic Classification Models: Which One Should I Use?

Peer reviewed

Direct link

Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009

Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…

Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

Evaluating Alignment between Curriculum, Assessment, and Instruction

Peer reviewed

Direct link

Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009

The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…

Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development

Detecting Differential Speededness in Multistage Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007

A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…

Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

The Telephone Interview Probe: A Novel Measure of Treatment Response in Children with Attention Deficit Hyperactivity Disorder

Peer reviewed

Direct link

Corkum, Penny; Andreou, Pantelis; Schachar, Russell; Tannock, Rosemary; Cunningham, Charles – Educational and Psychological Measurement, 2007

With increasing interest in studies evaluating treatment outcome in children with attention deficit hyperactivity disorder (ADHD), there is a need for treatment-sensitive instruments that are feasible, yield valid and reliable scores, and measure outcome in a "time-locked" and "situation- and symptom-specific" manner. These instruments are needed…

Descriptors: Attention Deficit Disorders, Children, Evaluation Methods, Generalizability Theory

Implications of Item Response Theory for the Measurement Practitioner.

Peer reviewed

Loyd, Brenda H. – Applied Measurement in Education, 1988

The impact of item response theory (IRT) on the measurement practitioner is discussed, with a review of potential benefits. The complexity of IRT theory and procedures and the lack of robustness of IRT procedures to violation of assumptions must be recognized for the measurement practitioner to realize its advantages. (SLD)

Descriptors: Educational Researchers, Evaluation Methods, Evaluators, Latent Trait Theory

The [Alpha] and the [Omega] of Congeneric Test Theory: An Extension of Reliability and Internal Consistency to Heterogeneous Tests

Peer reviewed

Direct link

Lucke, Joseph F. – Applied Psychological Measurement, 2005

Psychometric theory focuses primarily on tests that are homogeneous, measuring only one attribute of a psychosocial entity. However, the complexity of psychosocial behavior often requires tests that are heterogeneous, measuring more than one attribute. In this presentation, reliability and internal consistency are extended to heterogeneous tests…

Descriptors: Psychometrics, Item Response Theory, Test Reliability, Psychological Studies

Relation between Science Teachers' Assessment Tools and Students Cognitive Development

Download full text

Ozsevgec, Tuncay; Cepni, Salih – Online Submission, 2006

In order to determine students' achievement, science teachers have to develop their own assessment tools. This study attempts to find out the relationship between the teachers' assessment tools and students' cognitive development according to the teachers' teaching experiences. Six open-ended survey questions were developed and delivered to 59…

Descriptors: Foreign Countries, Correlation, Science Teachers, Evaluation Methods

Rumors Regarding the Death of the Equipercentile Assumption May Have Been Greatly Exaggerated.

Peer reviewed

Tallmadge, G. Kasten – Journal of Educational Measurement, 1985

Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)

Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement

Recontextualizing Mental Measurement.

Peer reviewed

Goldstein, Harvey – Educational Measurement: Issues and Practice, 1994

This article examines how psychometric models based on certain assumptions have come to be used counterproductively by many practitioners in ways that limit the kinds of conclusions that can be made. The general problem of the context's influence on performance is discussed, and some implications are drawn. (SLD)

Descriptors: Context Effect, Educational Research, Evaluation Methods, Measurement Techniques

A Critical Examination of the "Reliability" and "Abnormality" Approaches to the Evaluation of Subtest Score Differences.

Peer reviewed

Cahan, Sorel – Educational and Psychological Measurement, 1989

Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…

Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

van der Linden, Wim J.	2
Andreou, Pantelis	1
Bos, Wilfried	1
Breithaupt, Krista	1
Cahan, Sorel	1
Cepni, Salih	1
Chuah, Siang Chee	1
Corkum, Penny	1
Cunningham, Charles	1
Fenna, Doug S.	1
Goldstein, Harvey	1
Goy, Martin	1
Haley, Kathleen	1
Jiao, Hong	1
Loyd, Brenda H.	1
Lucke, Joseph F.	1
Martone, Andrea	1
Medoff, Deborah R.	1
O'Grady, Kevin E.	1
Ozsevgec, Tuncay	1
Rantanen, Pekka	1
Schachar, Russell	1
Scharaschkin, Alex	1
Ser Ming Mark Lee	1
More ▼