ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Source

Assessment & Evaluation in…	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Technology,…	1
National Center for Analysis…	1
Popular Measurement	1
Review of Research in…	1
TESL Canada Journal	1
Theory and Research in…	1

Publication Type

Reports - Evaluative	12
Journal Articles	8
Collected Works - General	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Elementary Education	2
Secondary Education	2
Grade 3	1
Higher Education	1
Postsecondary Education	1

Audience

Administrators	1
Practitioners	1
Teachers	1

Location

Australia	1
Canada	1
Florida	1
North Carolina	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Florida Comprehensive…	1
North Carolina End of Course…	1
SAT (College Admission Test)	1
Test of Understanding in…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

New Estimates of Design Parameters for Clustered Randomization Studies: Findings from North Carolina and Florida. Working Paper 43

Download full text

Xu, Zeyu; Nichols, Austin – National Center for Analysis of Longitudinal Data in Education Research, 2010

The gold standard in making causal inference on program effects is a randomized trial. Most randomization designs in education randomize classrooms or schools rather than individual students. Such "clustered randomization" designs have one principal drawback: They tend to have limited statistical power or precision. This study aims to…

Descriptors: Test Format, Reading Tests, Norm Referenced Tests, Research Design

Prior Degree and Student Assessment Performance: How Can Evidence Guide Decisions on Assessment Policy?

Peer reviewed

Direct link

Craig, Pippa; Gordon, Jill; Clarke, Rufus; Oldmeadow, Wendy – Assessment & Evaluation in Higher Education, 2009

This study aimed to provide evidence to guide decisions on the type and timing of assessments in a graduate medical programme, by identifying whether students from particular degree backgrounds face greater difficulty in satisfying the current assessment requirements. We examined the performance rank of students in three types of assessments and…

Descriptors: Student Evaluation, Medical Education, Student Characteristics, Correlation

Do Test Formats in Reading Comprehension Affect Second-Language Students' Test Performance Differently?

Peer reviewed
PDF on ERIC

Download full text

Direct link

Zheng, Ying; Cheng, Liying; Klinger, Don A. – TESL Canada Journal, 2007

Large scale testing in English affects second-language students not only greatly but also differently than first-language learners. The research literature reports that confounding factors in such large-scale testing such as varying test formats may differentially affect the performance of students from diverse backgrounds. An investigation of…

Descriptors: Reading Comprehension, Reading Tests, Test Format, Educational Testing

The Third Edition of the Test of Understanding in College Economics.

Peer reviewed

Saunders, Phillip – Journal of Economic Education, 1991

Discusses the content and cognitive specification of the third edition of the Test of Understanding in College Economics. Presents examples of the construction and sampling criteria employed in the latest and previous versions of the test. Explains that the test emphasizes recognition and understanding of basic terms, concepts, and principles with…

Descriptors: Economics Education, Educational Testing, Higher Education, Student Evaluation

Testing Testing Testing.

Peer reviewed

Deville, Craig; O'Neill, Thomas; Wright, Benjamin D.; Woodcock, Richard W.; Munoz-Sandoval, Ana; Gershon, Richard C.; Bergstrom, Betty – Popular Measurement, 1998

Articles in this special section consider (1) flow in test taking (Craig Deville); (2) testwiseness (Thomas O'Neill); (3) test length (Benjamin Wright); (4) cross-language test equating (Richard W. Woodcock and Ana Munoz-Sandoval); (5) computer-assisted testing and testwiseness (Richard Gershon and Betty Bergstrom); and (6) Web-enhanced testing…

Descriptors: Computer Assisted Testing, Educational Testing, Equated Scores, Measurement Techniques

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

On Examinee Choice in Educational Testing. GRE Board Professional Report No. 91-17P.

Download full text

Wainer, Howard; Thissen, David – 1994

When an examination consists in whole or part of constructed response test items, it is common practice to allow the examinee to choose a subset of the constructed response questions from a larger pool. It is sometimes argued that, if choice were not allowed, the limitations on domain coverage forced by the small number of items might unfairly…

Descriptors: Constructed Response, Difficulty Level, Educational Testing, Equated Scores

Distinctions between Item Format and Objectivity in Scoring.

Terwilliger, James S. – 1991

This paper clarifies important distinctions in item writing and item scoring and considers the implications of these distinctions for developing guidelines related to test construction for training teachers. The terminology used to describe and classify paper and pencil test questions frequently confuses two distinct features of questions:…

Descriptors: Classroom Techniques, Educational Testing, Higher Education, Measurement Techniques

Creating Better Classroom Tests.

Download full text

Gulliksen, Harold – 1985

This article presents the perspective that the quality of teacher-made, small classroom tests has not improved, and may have declined in recent years. This decline may be due to the fact that teachers have come to believe that the kinds of objective items used in national standardized tests are the only item types appropriate for classroom use.…

Descriptors: Adults, Classroom Techniques, Educational Testing, Educational Trends

Educational Measurement and Knowledge of Other Minds

Peer reviewed

Direct link

Curren, Randall R. – Theory and Research in Education, 2004

This article addresses the capacity of high stakes tests to measure the most significant kinds of learning. It begins by examining a set of philosophical arguments pertaining to construct validity and alleged conceptual obstacles to attributing specific knowledge and skills to learners. The arguments invoke philosophical doctrines of holism and…

Descriptors: Test Items, Educational Testing, Construct Validity, High Stakes Tests

Knowing What All Students Know: Procedures for Developing Universal Design for Assessment

Peer reviewed
PDF on ERIC

Download full text

Ketterlin-Geller, Leanne R. – Journal of Technology, Learning, and Assessment, 2005

Universal design for assessment (UDA) is intended to increase participation of students with disabilities and English-language learners in general education assessments by addressing student needs through customized testing platforms. Computer-based testing provides an optimal format for creating individually-tailored tests. However, although a…

Descriptors: Student Needs, Disabilities, Grade 3, Second Language Learning

Educational Testing	12
Test Format	12
Test Construction	7
Test Items	6
Test Bias	4
Testing Problems	4
Correlation	3
Equated Scores	3
Foreign Countries	3
Higher Education	3
Measurement Techniques	3
Multiple Choice Tests	3
Student Evaluation	3
Academic Achievement	2
Access to Education	2
Achievement Tests	2
Classroom Techniques	2
Computer Assisted Testing	2
Construct Validity	2
Criterion Referenced Tests	2
Culture Fair Tests	2
Educational Assessment	2
Educational Principles	2
English (Second Language)	2
Evaluation Problems	2
More ▼

Bergstrom, Betty	1
Cheng, Liying	1
Clarke, Rufus	1
Craig, Pippa	1
Curren, Randall R.	1
Deville, Craig	1
Gershon, Richard C.	1
Gordon, Jill	1
Gulliksen, Harold	1
Ketterlin-Geller, Leanne R.	1
Kim, Sooyeon	1
Klinger, Don A.	1
McHale, Frederick	1
Munoz-Sandoval, Ana	1
Nichols, Austin	1
O'Neill, Thomas	1
Oldmeadow, Wendy	1
Saunders, Phillip	1
Terwilliger, James S.	1
Thissen, David	1
Wainer, Howard	1
Walker, Michael E.	1
Wiliam, Dylan	1
Woodcock, Richard W.	1
Wright, Benjamin D.	1
More ▼