ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Educational Testing	7
Error of Measurement	7
Test Reliability	7
Correlation	2
Evaluation Methods	2
Mathematical Models	2
Measurement Techniques	2
Simulation	2
Statistical Bias	2
Test Items	2
Achievement Tests	1
Adaptive Testing	1
Computer Assisted Testing	1
Computer Programs	1
Creative Thinking	1
Creativity Tests	1
Divergent Thinking	1
Educational Assessment	1
Educational Policy	1
Educational Researchers	1
Effect Size	1
Elementary Secondary Education	1
Evaluation	1
Evaluation Problems	1
High Stakes Tests	1
More ▼

Source

American Educational Research…	1
Educational Leadership	1
Journal of Experimental…	1
Journal of Teacher Education	1
ProQuest LLC	1

Author

Demetrulias, Diana Mayer	1
Feldt, Leonard S.	1
Gilmer, Jerry S.	1
Papay, John P.	1
Patience, Wayne M.	1
Popham, W. James	1
Reckase, Mark D.	1
Topczewski, Anna Marie	1
Williams, Richard H.	1
Zimmerman, Donald W.	1

Publication Type

Journal Articles	4
Reports - Research	3
Reports - Descriptive	2
Dissertations/Theses -…	1
Opinion Papers	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

Direct link

Topczewski, Anna Marie – ProQuest LLC, 2013

Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

Descriptors: Item Response Theory, Scaling, Scores, Student Development

Unraveling Reliability

Peer reviewed

Direct link

Popham, W. James – Educational Leadership, 2009

If a person were to ask an educator to identify the two most important attributes of an education test, the response most certainly would be "validity and reliability." These two tightly wedded concepts have become icons in the field of education assessment. As far as validity is concerned, the term doesn't refer to the accuracy of a test. Rather,…

Descriptors: Educational Testing, Educational Assessment, Student Evaluation, Test Reliability

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

On the Virtues and Vices of the Standard Error of Measurement.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984

This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)

Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

Is There Life Outside of Standard Deviations? Measuring Creativity in a Tests and Measurements Classroom.

Peer reviewed

Demetrulias, Diana Mayer – Journal of Teacher Education, 1980

Even in an educational tests and measurements classroom it is possible to stimulate student creativity and divergent thinking. Examples of projects and test constructions are given to provide food for thought for the creative teacher. (JN)

Descriptors: Creative Thinking, Creativity Tests, Divergent Thinking, Educational Testing

Operational Characteristics of a Rasch Model Tailored Testing Procedure when Program Parameters and Item Pool Attributes are Varied.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing