NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 72 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sainan Xu; Jing Lu; Jiwei Zhang; Chun Wang; Gongjun Xu – Grantee Submission, 2024
With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational…
Descriptors: Educational Assessment, Bayesian Statistics, Statistical Inference, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
White, John – London Review of Education, 2013
It is time to replace the examination regime at 16 and 18 by something more appropriate. The coalition government has been solidifying its place by its Baccalaureate reforms at both ages, but this is a move in quite the wrong direction. Whatever the wider purposes that the examination system may serve, its core aim is to find out how well students…
Descriptors: Student Evaluation, Evaluation Methods, Educational Testing, Testing Programs
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schutz, Dick – Education Policy Analysis Archives, 2013
The commentary (1) uses the U. S. National Assessment of Educational Progress (NAEP) as a prototype for examining standardized reading achievement tests at the item level, and (2) sketches an alternative based on an initiative underway in the United Kingdom.
Descriptors: Educational Testing, Educational Change, Achievement Tests, Reading Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010
Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…
Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
Descriptors: English, Language Skills, English Language Learners, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Marshall, Jeffery H.; Chinna, Ung; Hok, Ung Ngo; Tinon, Souer; Veasna, Meung; Nissay, Put – Educational Assessment, Evaluation and Accountability, 2012
The global spread of national assessment testing activities, and the growing pressure to move beyond basic measures of participation in educational monitoring, means that student achievement measures are likely to become increasingly relevant indicators of systemic progress in the developing world. Using data from the CESSP project in Cambodia,…
Descriptors: Foreign Countries, Academic Achievement, Developing Nations, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Bakker, Steven – Educational Measurement: Issues and Practice, 2012
A particular trait of the educational system under socialist reign was accountability at the input side--appropriate facilities, centrally decided curriculum, approved text-books, and uniformly trained teachers--but no control on the output. It was simply assumed that it met the agreed standards, which was, in turn, proven by the statistics…
Descriptors: Accountability, Social Problems, Ethics, Foreign Students
Peer reviewed Peer reviewed
Direct linkDirect link
Coe, Robert – Research Papers in Education, 2010
Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…
Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Bryant, Darren A.; Carless, David R. – Educational Research for Policy and Practice, 2010
The literature suggests that peer assessment contributes to the development of student learning and promotes ownership of assessment processes. These claims emerge from research conducted primarily in Western contexts. This exploratory paper reports on the perspectives that a class of Hong Kong primary school students and their teachers have on…
Descriptors: Feedback (Response), Peer Evaluation, Foreign Countries, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009
Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…
Descriptors: Word Problems (Mathematics), Probability, Automation, College Students
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5