ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	16

Descriptor

Evaluation Methods	23
Foreign Countries	23
Test Theory	23
Comparative Analysis	7
Measurement Techniques	7
Psychometrics	7
Testing Problems	7
Educational Assessment	6
Equated Scores	6
High Stakes Tests	6
Educational Testing	5
Student Evaluation	5
Test Interpretation	5
Test Reliability	5
Test Validity	5
Classification	4
Definitions	4
Educational Policy	4
Evaluation Criteria	4
Predictive Measurement	4
Questionnaires	4
Test Use	4
Difficulty Level	3
Interrater Reliability	3
Program Evaluation	3
More ▼

Source

Measurement:…	4
Assessment in Education:…	2
Alberta Journal of…	1
Asia Pacific Journal of…	1
Assessment & Evaluation in…	1
Assessment in Education…	1
Communication Monographs	1
Educational Research	1
Educational Research and…	1
International Journal of…	1
International Journal of…	1
Language Assessment Quarterly	1
Language Testing	1
Online Submission	1
Research Papers in Education	1
Topics in Early Childhood…	1
More ▼

Publication Type

Journal Articles	20
Reports - Research	11
Opinion Papers	5
Reports - Evaluative	4
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	6
Higher Education	2
Middle Schools	2
Adult Education	1
Elementary Education	1
Grade 6	1
Intermediate Grades	1
Junior High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

United Kingdom	5
United Kingdom (England)	5
United Kingdom (Wales)	4
Canada	3
United States	3
Netherlands	2
Sweden	2
Turkey	2
United Kingdom (Northern…	2
Australia	1
Chile	1
Egypt	1
Finland (Helsinki)	1
Japan	1
Oregon	1
Singapore	1
United Kingdom (Great Britain)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

Examination of Common Exams Held by Measurement and Assessment Centers: Many Facet Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021

This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…

Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items

Commentary on Baird, J., Andrich, D., Hopfenbeck, T. N. and Stobart, G., "Assessment and Learning: Fields Apart"

Peer reviewed

Direct link

Scharaschkin, Alex – Assessment in Education: Principles, Policy & Practice, 2017

This issue's featured article, "Assessment and Learning: Fields Apart" (Baird, Andrich, Hopfenbeck, and Stobart 2017) raises issues that are of basic importance for the disciplines of assessment and teaching and learning theory. In this commentary, Alex Scharaschkin restricts his remarks to a few areas. He considers the idea of a…

Descriptors: Educational Assessment, Learning Theories, Test Theory, Psychometrics

The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

Peer reviewed

Direct link

Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…

Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance

Design, Development and Validation of a Model of Problem Solving for Egyptian Science Classes

Peer reviewed

Direct link

Shahat, Mohamed A.; Ohle, Annika; Treagust, David F.; Fischer, Hans E. – International Journal of Science and Mathematics Education, 2013

Educators and policymakers envision the future of education in Egypt as enabling learners to acquire scientific inquiry and problem-solving skills. In this article, we describe the validation of a model for problem solving and the design of instruments for evaluating new teaching methods in Egyptian science classes. The instruments were based on…

Descriptors: Foreign Countries, Questionnaires, Problem Solving, Science Instruction

The Effect of Mode of Response on a Semidirect Test of Oral Proficiency

Peer reviewed

Direct link

Kiddle, Thom; Kormos, Judit – Language Assessment Quarterly, 2011

This article reports on a study conducted with 42 participants from a Chilean university, which aimed to determine the effect of mode of response on test performance and test-taker perception of test features by comparing a semidirect online version and a direct face-to-face version of a speaking test. Candidate performances on both test versions…

Descriptors: Student Attitudes, Test Theory, Foreign Countries, Evaluation Methods

The Reliability of Results from National Tests, Public Examinations, and Vocational Qualifications in England

Peer reviewed

Direct link

He, Qingping; Opposs, Dennis – Educational Research and Evaluation, 2012

National tests, public examinations, and vocational qualifications in England are used for a variety of purposes, including the certification of individual learners in different subject areas and the accountability of individual professionals and institutions. However, there has been ongoing debate about the reliability and validity of their…

Descriptors: Qualifications, Evidence, National Competency Tests, Foreign Countries

A Psychometric Study of the Infant and Toddler Intervals of the Social Emotional Assessment Measure

Peer reviewed

Direct link

Squires, Jane K.; Waddell, Misti L.; Clifford, Jantina R.; Funk, Kristin; Hoselton, Robert M.; Chen, Ching-I – Topics in Early Childhood Special Education, 2013

Psychometric and utility studies on Social Emotional Assessment Measure (SEAM), an innovative tool for assessing and monitoring social-emotional and behavioral development in infants and toddlers with disabilities, were conducted. The Infant and Toddler SEAM intervals were the study focus, using mixed methods, including item response theory…

Descriptors: Psychometrics, Evaluation Methods, Social Development, Emotional Development

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Investigating a Judgemental Rank-Ordering Method for Maintaining Standards in UK Examinations

Peer reviewed

Direct link

Black, Beth; Bramley, Tom – Research Papers in Education, 2008

A new judgemental method of equating raw scores on two tests, based on rank-ordering scripts from both tests, has been developed by Bramley. The rank-ordering method has potential application as a judgemental standard-maintaining mechanism, because given a mark on one test (e.g. the A grade boundary mark), the equivalent mark (i.e. at the same…

Descriptors: Foreign Countries, Equated Scores, Test Theory, Evaluative Thinking

Determining Validity in National Curriculum Assessments

Peer reviewed

Direct link

Stobart, Gordon – Educational Research, 2009

Background: Validity is a central concern in any assessment, though this has often not been made explicit in the UK assessment context. This article applies current validity theorising, largely derived from American formulations, to national curriculum assessments in England. Purpose: The aim is to consider validity arguments in relation to the…

Descriptors: National Curriculum, Foreign Countries, Elementary Secondary Education, Educational Policy

Previous Page | Next Page »

Pages: 1 | 2

Baird, Jo-Anne	1
Barnard, Jane	1
Beguin, A. A.	1
Black, Beth	1
Bramley, Tom	1
Carlman, Nancy	1
Cepni, Salih	1
Chen, Ching-I	1
Clifford, Jantina R.	1
Cresswell, Mike	1
Dassa, Clement	1
Demirtas Tolaman, Tugba	1
Fischer, Hans E.	1
Funk, Kristin	1
Gur Erdogan, Duygu	1
Hamilton, David	1
He, Qingping	1
Hoselton, Robert M.	1
Kaya Uyanik, Gulden	1
Kiddle, Thom	1
Kormos, Judit	1
Leung, Constant	1
Newton, Paul E.	1
Ohle, Annika	1
Opposs, Dennis	1
More ▼