Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Educational Measurement:… | 32 |
Author
Publication Type
Journal Articles | 32 |
Reports - Evaluative | 32 |
Information Analyses | 2 |
Opinion Papers | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Elementary Education | 1 |
Audience
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 3 |
California Achievement Tests | 1 |
Program for International… | 1 |
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Sireci, Stephen G.; Suarez-Alvarez, Javier – Educational Measurement: Issues and Practice, 2022
The COVID-19 pandemic negatively affected the quality of data from educational testing programs. These data were previously used for many important purposes ranging from placing students in instructional programs to school accountability. In this article, we draw from the research design literature to point out the limitations inherent in…
Descriptors: Decision Making, Data Use, COVID-19, Pandemics
Arffman, Inga – Educational Measurement: Issues and Practice, 2013
The article reviews research and findings on problems and issues faced when translating international academic achievement tests. The purpose is to draw attention to the problems, to help to develop the procedures followed when translating the tests, and to provide suggestions for further research. The problems concentrate on the following: the…
Descriptors: Achievement Tests, Translation, Testing Problems, Test Construction
Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008
Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…
Descriptors: Test Items, Disabilities, Test Construction, Testing Programs
Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008
Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…
Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement

Hills, John R. – Educational Measurement: Issues and Practice, 1989
Test bias detection methods based on item response theory (IRT) are reviewed. Five such methods are commonly used: (1) equality of item parameters; (2) area between item characteristic curves; (3) sums of squares; (4) pseudo-IRT; and (5) one-parameter-IRT. A table compares these and six newer or less tested methods. (SLD)
Descriptors: Item Analysis, Test Bias, Test Items, Testing Programs

Madaus, George F. – Educational Measurement: Issues and Practice, 1992
The need for an independent mechanism that regulates, or audits, the testing enterprise is discussed along with a critique of current mechanisms for challenging a high-stakes test or its use and the need for independent auditing of the commercial test industry. Models for an auditing mechanism are reviewed. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Evaluation Methods, Higher Education

Phelps, Richard P. – Educational Measurement: Issues and Practice, 2000
Compiled information from 31 countries to study trends in large-scale testing. Shows a clear trend toward adding, not dropping, testing programs. Twenty-seven countries show a net increase in testing, while only three show a decrease. Fifty-nine testing programs have been added; only four have been dropped. (SLD)
Descriptors: Educational Trends, Foreign Countries, International Education, International Studies
Bandalos, Deborah L. – Educational Measurement: Issues and Practice, 2004
Recent implementation of Nebraska's Standards-based Teacher-led Assessment and Reporting System (STARS) introduced a unique opportunity to examine the benefits and drawbacks of a teacher-led state assessment system. STARS is unique among state assessment systems in that statewide tests are replaced by locally developed assessments designed by…
Descriptors: Federal Legislation, Testing Programs, State Standards, Academic Standards

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

Koretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment

Kingsbury, G. Gage – Educational Measurement: Issues and Practice, 1990
Application of computer-assisted testing (CAT) to measurement in the Portland (Oregon) Public Schools is described. Focus is on MicroCAT's developmental subsystem, which allows creation of items and test design specifications. Changes in CAT procedures allowing MicroCAT to function in existing testing systems and uses/limitations of MicroCAT…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Software Evaluation, Elementary Secondary Education

Baker, Frank B. – Educational Measurement: Issues and Practice, 1990
Four articles on computer-assisted testing in schools at local and state levels are reviewed. Trends in the application of and limitations/advantages of microcomputerized testing packages are described. (SLD)
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Elementary Secondary Education, Microcomputers

Green, Bert F. – Educational Measurement: Issues and Practice, 1995
If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

Educational Measurement: Issues and Practice, 1992
Eight criteria developed by the National Forum on Assessment (Washington, DC) as guidelines for evaluating existing and proposed assessment systems at any level are listed. Criteria call for clear definitions and fair assessments to assist educators and policymakers in improving instruction. Continuous review and improvement are recommended. (SLD)
Descriptors: Decision Making, Definitions, Educational Assessment, Educational Improvement

Buckendahl, Chad W.; Impara, James C.; Plake, Barbara S. – Educational Measurement: Issues and Practice, 2002
Proposed an accountably model that addresses the challenges of allowing school districts to choose the specific strategies they use to measure student performance and evaluated this model using data from multiple sources for all school districts in Florida. Findings identify three strategies that would be useful for this type of accountability…
Descriptors: Academic Achievement, Accountability, Comparative Analysis, Educational Assessment