Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Author
Publication Type
Journal Articles | 24 |
Reports - Evaluative | 9 |
Reports - Descriptive | 8 |
Opinion Papers | 3 |
Reports - Research | 3 |
Book/Product Reviews | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 4 |
Elementary Education | 2 |
Higher Education | 2 |
Adult Education | 1 |
Grade 10 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Policymakers | 1 |
Teachers | 1 |
Location
United Kingdom (Great Britain) | 2 |
California | 1 |
Florida | 1 |
Minnesota | 1 |
Nebraska | 1 |
New Zealand | 1 |
Singapore | 1 |
United States | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Graduate Record Examinations | 1 |
National Assessment of… | 1 |
Praxis Series | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016
Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…
Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory
Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008
Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…
Descriptors: Test Items, Disabilities, Test Construction, Testing Programs
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009
In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…
Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics
Napper, Lucy E.; Branson, Catherine M.; Fisher, Dennis G.; Reynolds, Grace L.; Wood, Michelle M. – Journal of Drug Education, 2008
This study examined the validity of a single-item measure of HIV risk stage of change that HIV prevention contractors were required to collect by the California State Office of AIDS. The single-item measure was compared to the more conventional University of Rhode Island Change Assessment (URICA). Participants were members of Los Angeles…
Descriptors: Testing Programs, Sexually Transmitted Diseases, Test Validity, Acquired Immunodeficiency Syndrome (AIDS)
Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008
Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…
Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement

Pipes, M. J. – Physics Education, 1981
Reviews various issues related to developing a single system of examining at 16+ to replace the General Certificate of Education and the Certificate of Secondary Education, including current assessment techniques and activities of groups involved in developing this common examination system. (SK)
Descriptors: Measurement Techniques, Science Education, Secondary Education, Secondary School Science

Glas, Cees A. W. – International Journal of Testing, 2002
"Test Scoring" provides insight into psychometric procedures as used by a professional testing company or in large-scale projects. The book contains an overview of standard test theory, a discussion of factor analytic theory, and an exploration of special applications and problems. (SLD)
Descriptors: Educational Testing, Factor Analysis, Measurement Techniques, Psychometrics

Rubin, Rebecca B. – Communication Education, 1984
Presents a short summary report of the results of a national survey, commissioned by the SCA Committee on Assessment and Testing, to determine the range and degree of assessment occurring in colleges and universities and to evaluate the methods used to assess the communication skills of college students. (PD)
Descriptors: College Students, Communication Skills, Educational Assessment, Evaluation Methods

Kelley, Paul R.; Schumacher, Charles F. – Evaluation and the Health Professions, 1984
The National Board of Medical Examiners uses the Rasch model to calibrate test items, maintain item banks, equate scores, and monitor the consistency of examiner item response patterns. The model is also being used in the study of patient management problems examinations, standard-setting, and computer-based examinations. (Author/BS)
Descriptors: Item Analysis, Item Banks, Latent Trait Theory, Mathematical Models
Alexander, Cordelia R. – New Directions for Testing and Measurement, 1983
This case study of testing in the Dallas Independent School District illuminates ways in which the testing program can support a district-wide instructional improvement process to provide a comprehensive and accurate measurement system for teaching. (Author/PN)
Descriptors: Case Studies, Elementary Secondary Education, Inservice Teacher Education, Instructional Improvement

Bennett, Garry – Educational Studies, 1989
Discusses the problems associated with the assessment of art and design. Identifies criterion-referenced assessment practices within the British General Certificate of Secondary Education (GCSE) program as central to the development of a system of examination which is more fair to participants. (KO)
Descriptors: Academic Achievement, Art Education, Criterion Referenced Tests, Design
Previous Page | Next Page ยป
Pages: 1 | 2