Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 10 |
Descriptor
Comparative Analysis | 15 |
Evaluation Methods | 15 |
Test Theory | 15 |
Foreign Countries | 7 |
Measurement Techniques | 6 |
Testing Problems | 6 |
Educational Testing | 5 |
Equated Scores | 5 |
Definitions | 4 |
Educational Assessment | 4 |
Evaluation Criteria | 4 |
More ▼ |
Source
Measurement:… | 5 |
ProQuest LLC | 2 |
Educational Research and… | 1 |
Educational and Psychological… | 1 |
Journal of Educational… | 1 |
Language Assessment Quarterly | 1 |
Multivariate Behavioral… | 1 |
Author
Audette, Jennifer Gail | 1 |
Baird, Jo-Anne | 1 |
Bos, Wilfried | 1 |
Carlman, Nancy | 1 |
Cresswell, Mike | 1 |
Goy, Martin | 1 |
Herman, Geoffrey Lindsay | 1 |
Hills, John R. | 1 |
Kiddle, Thom | 1 |
Kormos, Judit | 1 |
Maydeu-Olivares, Alberto | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Opinion Papers | 5 |
Reports - Research | 4 |
Dissertations/Theses -… | 2 |
Reports - Evaluative | 2 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 5 |
Higher Education | 4 |
Postsecondary Education | 1 |
Audience
Location
United Kingdom (England) | 3 |
United States | 3 |
United Kingdom | 2 |
United Kingdom (Wales) | 2 |
Australia | 1 |
Canada | 1 |
Chile | 1 |
Netherlands | 1 |
Sweden | 1 |
United Kingdom (Northern… | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012
It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…
Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory
Kiddle, Thom; Kormos, Judit – Language Assessment Quarterly, 2011
This article reports on a study conducted with 42 participants from a Chilean university, which aimed to determine the effect of mode of response on test performance and test-taker perception of test features by comparing a semidirect online version and a direct face-to-face version of a speaking test. Candidate performances on both test versions…
Descriptors: Student Attitudes, Test Theory, Foreign Countries, Evaluation Methods
Audette, Jennifer Gail – ProQuest LLC, 2011
Purpose: International service-learning (ISL) is popular in higher education, and many physical therapy educational programs are adding ISL opportunities to their curricula because doing so aligns with student interest and the increasingly global nature of the profession. The faculty leading these experiences have not been studied. Nearly all…
Descriptors: Group Membership, Higher Education, Teaching Styles, Teacher Characteristics
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Herman, Geoffrey Lindsay – ProQuest LLC, 2011
Instructors in electrical and computer engineering and in computer science have developed innovative methods to teach digital logic circuits. These methods attempt to increase student learning, satisfaction, and retention. Although there are readily accessible and accepted means for measuring satisfaction and retention, there are no widely…
Descriptors: Grounded Theory, Delphi Technique, Concept Formation, Misconceptions

O'Grady, Kevin E.; Medoff, Deborah R. – Multivariate Behavioral Research, 1991
A procedure for evaluating a variety of rater reliability models is presented. A multivariate linear model is used to describe and assess a set of ratings. Parameters are represented in terms of a factor analytic model, and maximum likelihood methods test the model parameters. Illustrative examples are presented. (SLD)
Descriptors: Comparative Analysis, Correlation, Equations (Mathematics), Estimation (Mathematics)
Takala, Sauli – 1998
This paper discusses recent developments in language testing. It begins with a review of the traditional criteria that are applied to all measurement and outlines recent emphases that derive from the expanding range of stakeholders. Drawing on Alderson's seminal work, criteria are presented for evaluating communicative language tests. Developments…
Descriptors: Alternative Assessment, Communicative Competence (Languages), Comparative Analysis, Evaluation Criteria

Hills, John R.; And Others – Journal of Educational Measurement, 1988
Five methods of equating minimum-competency tests were compared using the Florida Statewide Student Assessment Test, Part II, for 1984 and 1986. Four of five methods yielded essentially comparable results for the highest scoring 84% of the students. Different lengths of anchor items were compared, using the concurrent item response theory equating…
Descriptors: Comparative Analysis, Equated Scores, Evaluation Methods, Graduation Requirements
Carlman, Nancy – 1985
A study examined whether Canadian twelfth grade students' papers would rate differently when they were written in different modes and whether there are significant differences between global (modified holistic) scores and rhetorical effectiveness (modified primary trait) scores for the same papers. Fifty students wrote on two transactional topics…
Descriptors: Comparative Analysis, Discourse Modes, Evaluation Methods, Foreign Countries
van Weeren, J., Ed. – 1983
Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level