ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Measurement Techniques	24
Testing Programs	24
Educational Assessment	9
Evaluation Methods	9
Student Evaluation	9
Academic Achievement	6
Elementary Secondary Education	6
Test Use	6
Psychometrics	5
Scoring	5
Test Validity	5
Performance Based Assessment	4
School Districts	4
Accountability	3
Educational Testing	3
Evaluation Research	3
Foreign Countries	3
Item Response Theory	3
Measures (Individuals)	3
Test Construction	3
Test Items	3
Test Reliability	3
Testing	3
Testing Problems	3
Achievement Tests	2
More ▼

Source

Educational Measurement:…	7
Applied Measurement in…	1
Assessing Writing	1
Assessment in Education:…	1
Communication Education	1
ETS Research Report Series	1
Educational Studies	1
Educational and Psychological…	1
Evaluation and the Health…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of College Admission	1
Journal of Drug Education	1
Journal of Educational…	1
New Directions for Testing…	1
Physics Education	1
Principal Leadership	1
Yearbook of the National…	1
More ▼

Publication Type

Journal Articles	24
Reports - Evaluative	9
Reports - Descriptive	8
Opinion Papers	3
Reports - Research	3
Book/Product Reviews	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Elementary Education	2
Higher Education	2
Adult Education	1
Grade 10	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Postsecondary Education	1
More ▼

Audience

Policymakers	1
Teachers	1

Location

United Kingdom (Great Britain)	2
California	1
Florida	1
Minnesota	1
Nebraska	1
New Zealand	1
Singapore	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Graduate Record Examinations	1
National Assessment of…	1
Praxis Series	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Application of Best Linear Prediction and Penalized Best Linear Prediction to ETS Tests. Research Report. ETS RR-20-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2020

Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].

Descriptors: Prediction, Scores, Tests, Testing Programs

Bringing Consequences and Side Effects of Testing and Assessment to the Foreground

Peer reviewed

Direct link

Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016

Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…

Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation

Design of a Computer-Adaptive Test to Measure English Literacy and Numeracy in the Singapore Workforce: Considerations, Benefits, and Implications

Peer reviewed

Direct link

Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011

A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…

Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Universal Design and Multimethod Approaches to Item Review

Peer reviewed

Direct link

Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008

Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…

Descriptors: Test Items, Disabilities, Test Construction, Testing Programs

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Construct Equivalence across Grades in a Vertical Scale for a K-12 Large-Scale Reading Assessment

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009

In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…

Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics

Assessing the Validity of a Single-Item HIV Risk Stage-of-Change Measure

Peer reviewed

Direct link

Napper, Lucy E.; Branson, Catherine M.; Fisher, Dennis G.; Reynolds, Grace L.; Wood, Michelle M. – Journal of Drug Education, 2008

This study examined the validity of a single-item measure of HIV risk stage of change that HIV prevention contractors were required to collect by the California State Office of AIDS. The single-item measure was compared to the more conventional University of Rhode Island Change Assessment (URICA). Participants were members of Los Angeles…

Descriptors: Testing Programs, Sexually Transmitted Diseases, Test Validity, Acquired Immunodeficiency Syndrome (AIDS)

Determining Sufficient Measurement Opportunities when Using Multiple Cut Scores

Peer reviewed

Direct link

Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008

Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…

Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement

A Single System of Examining at 16+.

Peer reviewed

Pipes, M. J. – Physics Education, 1981

Reviews various issues related to developing a single system of examining at 16+ to replace the General Certificate of Education and the Certificate of Secondary Education, including current assessment techniques and activities of groups involved in developing this common examination system. (SK)

Descriptors: Measurement Techniques, Science Education, Secondary Education, Secondary School Science

Educational Measurement in the Psychometric School of Frederick Lord and Darryl Bock.

Peer reviewed

Glas, Cees A. W. – International Journal of Testing, 2002

"Test Scoring" provides insight into psychometric procedures as used by a professional testing company or in large-scale projects. The book contains an overview of standard test theory, a discussion of factor analytic theory, and an exploration of special applications and problems. (SLD)

Descriptors: Educational Testing, Factor Analysis, Measurement Techniques, Psychometrics

Communication Assessment Instruments and Procedures in Higher Education.

Peer reviewed

Rubin, Rebecca B. – Communication Education, 1984

Presents a short summary report of the results of a national survey, commissioned by the SCA Committee on Assessment and Testing, to determine the range and degree of assessment occurring in colleges and universities and to evaluate the methods used to assess the communication skills of college students. (PD)

Descriptors: College Students, Communication Skills, Educational Assessment, Evaluation Methods

The Rasch Model: Its Use by the National Board of Medical Examiners.

Peer reviewed

Kelley, Paul R.; Schumacher, Charles F. – Evaluation and the Health Professions, 1984

The National Board of Medical Examiners uses the Rasch model to calibrate test items, maintain item banks, equate scores, and monitor the consistency of examiner item response patterns. The model is also being used in the study of patient management problems examinations, standard-setting, and computer-based examinations. (Author/BS)

Descriptors: Item Analysis, Item Banks, Latent Trait Theory, Mathematical Models

A Case Study: Testing in the Dallas Independent School District.

Alexander, Cordelia R. – New Directions for Testing and Measurement, 1983

This case study of testing in the Dallas Independent School District illuminates ways in which the testing program can support a district-wide instructional improvement process to provide a comprehensive and accurate measurement system for teaching. (Author/PN)

Descriptors: Case Studies, Elementary Secondary Education, Inservice Teacher Education, Instructional Improvement

Words in My Eyes--The Assessment of Art and Design in GCSE.

Peer reviewed

Bennett, Garry – Educational Studies, 1989

Discusses the problems associated with the assessment of art and design. Identifies criterion-referenced assessment practices within the British General Certificate of Secondary Education (GCSE) program as central to the development of a system of examination which is more fair to participants. (KO)

Descriptors: Academic Achievement, Art Education, Criterion Referenced Tests, Design

Previous Page | Next Page »

Pages: 1 | 2

Buckendahl, Chad W.	2
Plake, Barbara S.	2
Ackermann, Richard	1
Alexander, Cordelia R.	1
Bach, James V.	1
Bennett, Garry	1
Bottsford-Miller, Nicole A.	1
Branson, Catherine M.	1
Brown, Gavin T. L.	1
Eguez, Jane	1
Ercikan, Kadriye	1
Fisher, Dennis G.	1
Fitzpatrick, Anne R.	1
Foord, Kathleen A.	1
Ganguli, Debalina	1
Glas, Cees A. W.	1
Glasswell, Kath	1
Haberman, Shelby J.	1
Haertel, Edward H.	1
Harland, Don	1
Herman, Joan L.	1
Hubley, Anita M.	1
Impara, James C.	1
Ito, Kyoko	1
Jacobsen, Jared	1
More ▼