ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	29

Descriptor

Educational Testing	57
Evaluation Methods	57
Measurement Techniques	57
Educational Assessment	27
Measurement	17
Student Evaluation	17
Comparative Analysis	13
Evaluation Criteria	13
Psychometrics	13
Classification	12
Test Use	11
Criterion Referenced Tests	10
Models	10
Test Construction	10
Academic Achievement	9
Evaluation Problems	9
Testing Problems	9
Equated Scores	8
Standardized Tests	8
Standards	8
Test Interpretation	8
Accountability	7
Definitions	7
Educational Policy	7
Elementary Secondary Education	7
More ▼

Publication Type

Journal Articles	30
Opinion Papers	15
Reports - Evaluative	11
Reports - Descriptive	7
Information Analyses	4
Reports - Research	4
Speeches/Meeting Papers	3
Books	2
Dissertations/Theses -…	2
Guides - Non-Classroom	2
Historical Materials	2
Guides - Classroom - Teacher	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	18
Higher Education	5
Postsecondary Education	3
Elementary Education	2
Early Childhood Education	1
Secondary Education	1

Audience

Practitioners	3
Policymakers	1
Teachers	1

Location

United Kingdom	4
United States	4
United Kingdom (England)	3
United Kingdom (Wales)	2
Australia	1
California	1
Colombia	1
Florida	1
Indiana	1
Kansas	1
Thailand	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Elementary and Secondary…	1
Individuals with Disabilities…	1

Assessments and Surveys

SAT (College Admission Test)	4
Advanced Placement…	3
ACT Assessment	2
College Level Examination…	2
National Assessment of…	2
Collegiate Assessment of…	1
Defining Issues Test	1
Minnesota Multiphasic…	1
National Assessment of Adult…	1
Nelson Denny Reading Tests	1
Stanford Achievement Tests	1
Strong Campbell Interest…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards with or without Reservations

Showing 1 to 15 of 57 results Save | Export

A Case of Inconsistent Equatings: How the Man with Four Watches Decides What Time It Is

Peer reviewed

Direct link

Livingston, Samuel A.; Antal, Judit – Applied Measurement in Education, 2010

A simultaneous equating of four new test forms to each other and to one previous form was accomplished through a complex design incorporating seven separate equating links. Each new form was linked to the reference form by four different paths, and each path produced a different score conversion. The procedure used to resolve these inconsistencies…

Descriptors: Measurement Techniques, Measurement, Educational Assessment, Educational Testing

A Comparison of Computer-Based Classification Testing Approaches Using Mixed-Format Tests with the Generalized Partial Credit Model

Direct link

Kim, Jiseon – ProQuest LLC, 2010

Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…

Descriptors: Test Length, Computer Assisted Testing, Classification, Probability

The Long-Term Impacts of Teachers: Teacher Value-Added and Student Outcomes in Adulthood. NBER Working Paper No. 17699

Direct link

Raj Chetty; John N. Friedman; Jonah E. Rockoff – National Bureau of Economic Research, 2011

Are teachers' impacts on students' test scores ("value-added") a good measure of their quality? This question has sparked debate largely because of disagreement about (1) whether value-added (VA) provides unbiased estimates of teachers' impacts on student achievement and (2) whether high-VA teachers improve students' long-term outcomes.…

Descriptors: Academic Achievement, Scores, Teacher Effectiveness, Outcomes of Education

Comparability of Paper-and-Pencil and Computer-Based Cognitive and Non-Cognitive Measures in a Low-Stakes Testing Environment

Direct link

Rowan, Barbara E. – ProQuest LLC, 2010

Computerized versions of paper-and-pencil tests (PPT) have emerged over the past few decades, and some practitioners are using both formats concurrently. But computerizing a PPT may not yield equivalent scores across the two administration modes. Comparability studies are required to determine if the scores are equivalent before treating them as…

Descriptors: Computer Assisted Testing, Factor Structure, Program Effectiveness, Scores

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Evaluating the Rank-Ordering Method for Standard Maintaining

Peer reviewed

Direct link

Bramley, Tom; Gill, Tim – Research Papers in Education, 2010

The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…

Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries

Evidentiary Reasoning in Diagnostic Classification Models

Peer reviewed

Direct link

Levy, Roy – Measurement: Interdisciplinary Research and Perspectives, 2009

In "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art," Rupp and Templin (2008) undertake the ambitious task of providing a thorough portrait of the current state of diagnostic classification models (DCM). In this commentary, the author applauds Rupp and Templin for their…

Descriptors: Classification, Models, Evidence, Measurement

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Contrasting Conceptions of Comparability

Peer reviewed

Direct link

Newton, Paul E. – Research Papers in Education, 2010

Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…

Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

Value-Added Measures of Education Performance: Clearing Away the Smoke and Mirrors. Policy Brief 10-4

Direct link

Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010

In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…

Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

Basic Skills Assessment

Peer reviewed

Direct link

Yin, Alexander C.; Volkwein, J. Fredericks – New Directions for Institutional Research, 2010

After surveying 1,827 students in their final year at eighty randomly selected two-year and four-year public and private institutions, American Institutes for Research (2006) reported that approximately 30 percent of students in two-year institutions and nearly 20 percent of students in four-year institutions have only basic quantitative…

Descriptors: Standardized Tests, Basic Skills, College Admission, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Measurement:…	12
Applied Measurement in…	3
New Directions for…	2
ProQuest LLC	2
Research Papers in Education	2
American Journal of Education	1
Applied Psychological…	1
Art Education	1
Educational Research and…	1
Educational Theory	1
Evaluation in Education:…	1
Journal for Research in…	1
Journal of Science and…	1
National Accessible Reading…	1
National Bureau of Economic…	1
National Center for Analysis…	1
National Center on…	1
National Center on…	1
Office of Education, United…	1
Policy Analysis for…	1
Policy Futures in Education	1
Psychology of Women Quarterly	1
Today's Education	1
Topics in Early Childhood…	1
United States Bureau of…	1
More ▼

Harris, Douglas N.	2
Newton, Paul E.	2
Volkwein, J. Fredericks	2
Yin, Alexander C.	2
Abedi, Jamal	1
Albus, Debra	1
Antal, Judit	1
Bagnato, Stephen J.	1
Baird, Jo-Anne	1
Baird, Leonard L., Ed.	1
Ban, Jae-Chun	1
Bechger, Timo	1
Bloom, Benjamin S.	1
Bos, Wilfried	1
Brain, George B.	1
Bramley, Tom	1
COMSTOCK, GEORGE	1
Carstensen, Claus H.	1
Cook, Linda L.	1
Crane, Robert	1
Cresswell, Mike	1
Crisp, Raymond D.	1
Dillon, Deborah R.	1
Edith A.	1
More ▼