Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 15 |
Descriptor
Measurement Techniques | 123 |
Testing Programs | 123 |
Elementary Secondary Education | 33 |
Educational Assessment | 32 |
Student Evaluation | 30 |
Evaluation Methods | 29 |
State Programs | 26 |
Test Construction | 25 |
Testing Problems | 24 |
Academic Achievement | 23 |
Test Validity | 21 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 8 |
Grade 4 | 4 |
Grade 6 | 4 |
Grade 8 | 4 |
Grade 3 | 3 |
Grade 5 | 3 |
Grade 7 | 3 |
Higher Education | 3 |
Elementary Education | 2 |
Adult Education | 1 |
Early Childhood Education | 1 |
More ▼ |
Location
Arizona | 5 |
Illinois (Chicago) | 5 |
Canada | 3 |
Georgia | 2 |
United Kingdom (Great Britain) | 2 |
United States | 2 |
Arizona (Mesa) | 1 |
California | 1 |
Connecticut | 1 |
Florida | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016
Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…
Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation
Schilder, Diane; Dahlin, Melissa – Center on Enhancing Early Learning Outcomes, 2014
In February 2014, a state department of education contacted the Center on Enhancing Early Learning Outcomes (CEELO) for support in informing the rebranding of their kindergarten readiness assessment instrument. This state's department of education would like to develop a plan for increasing the use of the early childhood assessment system among…
Descriptors: Kindergarten, School Readiness, Preschool Evaluation, State Departments of Education
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
In this technical report, we document the results of a cross-validation study designed to identify optimal cut-scores for the use of the easyCBM[R] mathematics test in the state of Washington. A large sample, randomly split into two groups of roughly equal size, was used for this study. Students' performance classification on the Washington state…
Descriptors: Testing Programs, Mathematics Tests, Prediction, Measurement Techniques
Park, Bitnara Jasmine; Irvin, P. Shawn; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
This technical report presents results from a cross-validation study designed to identify optimal cut scores when using easyCBM[R] reading tests in Oregon. The cross-validation study analyzes data from the 2009-2010 academic year for easyCBM[R] reading measures. A sample of approximately 2,000 students per grade, randomly split into two groups of…
Descriptors: Testing Programs, Reading Tests, Prediction, Measurement Techniques
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory
Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009
The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…
Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics
Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008
Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…
Descriptors: Test Items, Disabilities, Test Construction, Testing Programs
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009
In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…
Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics
Russell, Michael; Kavanaugh, Maureen – IAP - Information Age Publishing, Inc., 2011
The importance of student assessment, particularly for summative purposes, has increased greatly over the past thirty years. At the same time, emphasis on including all students in assessment programs has also increased. Assessment programs, whether they are large-scale, district-based, or teacher developed, have traditionally attempted to assess…
Descriptors: Testing Accommodations, Testing Programs, Educational Assessment, Adaptive Testing
Napper, Lucy E.; Branson, Catherine M.; Fisher, Dennis G.; Reynolds, Grace L.; Wood, Michelle M. – Journal of Drug Education, 2008
This study examined the validity of a single-item measure of HIV risk stage of change that HIV prevention contractors were required to collect by the California State Office of AIDS. The single-item measure was compared to the more conventional University of Rhode Island Change Assessment (URICA). Participants were members of Los Angeles…
Descriptors: Testing Programs, Sexually Transmitted Diseases, Test Validity, Acquired Immunodeficiency Syndrome (AIDS)

Weinberger, Jo Ann – 1968
Due to the problems inherent in using a norm-reference test such as the Iowa Test of Basic Skills (ITBS) to determine pupil achievement, a comparison of ITBS with the Individually Prescribed Instruction (IPI) continuum and placement tests was undertaken. This yielded the following conclusions: Of the 136 items on test 1-A Arithmetic Concepts, 14…
Descriptors: Comparative Analysis, Measurement Techniques, Standardized Tests, Testing Programs
Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008
Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…
Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement