ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Educational Testing	27
Statistical Analysis	27
Test Reliability	27
Test Construction	13
Academic Achievement	8
Multiple Choice Tests	8
Scores	6
Test Interpretation	6
Test Validity	6
Achievement Tests	5
Item Analysis	5
Reading Tests	5
Testing Programs	5
Correlation	4
Curriculum Based Assessment	4
Evaluation Methods	4
Evaluation Research	4
Grade 5	4
Item Response Theory	4
Mathematical Models	4
Measurement Techniques	4
Reading Comprehension	4
Screening Tests	4
Standardized Tests	4
Teacher Effectiveness	4
More ▼

Source

Behavioral Research and…	4
Regional Educational…	2
Alberta Journal of…	1
Cogent Education	1
International Journal of…	1
ProQuest LLC	1

Publication Type

Reports - Research	12
Reports - Evaluative	5
Numerical/Quantitative Data	4
Journal Articles	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Reference Materials -…	1
Reports - Descriptive	1

Education Level

Elementary Education	6
Elementary Secondary Education	6
High Schools	2
Middle Schools	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Higher Education	1

Audience

Researchers

Location

California	1
California (Stanford)	1
Canada	1
Colorado (Denver)	1
Ghana	1
Michigan	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

ACT Assessment	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Stanford Achievement Tests	2

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Using Reliability and Item Analysis to Evaluate a Teacher-Developed Test in Educational Measurement and Evaluation

Peer reviewed

Direct link

Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017

Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…

Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

Download full text

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

Download full text

Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 6, Grade 3, Curriculum Based Assessment, Educational Testing

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States are increasingly interested in including measures of student achievement growth, or "value- added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot …

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. Summary. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

The Impact of Student Ability and Method for Varying the Position of Correct Answers in Classroom Multiple-Choice Tests

Direct link

Joseph, Dane Christian – ProQuest LLC, 2010

Multiple-choice item-writing guideline research is in its infancy. Haladyna (2004) calls for a science of item-writing guideline research. The purpose of this study is to respond to such a call. The purpose of this study was to examine the impact of student ability and method for varying the location of correct answers in classroom multiple-choice…

Descriptors: Evidence, Test Format, Guessing (Tests), Program Effectiveness

An Index of Efficiency for Fixed-Length Mastery Tests.

Download full text

Harris, Chester W. – 1972

The efficiency of mastery tests of fixed length which sorts students into two categories is analyzed. For the sort of the students, an index, suggested by Fisher's linear discriminant function for two groups, is provided. (DB)

Descriptors: Educational Testing, Models, Statistical Analysis, Student Distribution

A Glossary of Measurement Terms Used in Title I Evaluation.

Download full text

Fortna, Richard O. – 1981

Measurement terms used in Title I evaluation are contained in this glossary. Several types of measurement techniques are identified and defined. Other measurement terms which are defined include those relating to validity, reliability, statistical analysis, test interpretation, and program effectiveness. (DWH)

Descriptors: Educational Testing, Evaluation Methods, Glossaries, Program Evaluation

An Investigation of the Accuracy of Alternative Methods of True Score Estimation in High-Stakes Mixed-Format Examinations.

Peer reviewed

Klinger, Don A.; Rogers, W. Todd – Alberta Journal of Educational Research, 2003

The estimation accuracy of procedures based on classical test score theory and item response theory (generalized partial credit model) were compared for examinations consisting of multiple-choice and extended-response items. Analysis of British Columbia Scholarship Examination results found an error rate of about 10 percent for both methods, with…

Descriptors: Academic Achievement, Educational Testing, Foreign Countries, High Stakes Tests

An Alternate Procedure to Obtain Ability Estimates in Latent Trait Models.

Houser, Ronald L.; And Others – 1983

This report describes a procedure that promises to improve the stability, accuracy, and efficiency of the employment of latent trait models and an application of the procedure to the Rasch model. Data were collected from the Portland Public Schools Level Tests administered to 25,740 students. Since each of the 173 items (chosen from the total…

Descriptors: Academic Achievement, Educational Testing, Item Banks, Latent Trait Theory

Technical Report of Selected Aspects of the 1969-70 Michigan Educational Assessment Program.

Download full text

Michigan State Dept. of Education, Lansing. – 1971

This report describes the development of the 1969-70 Michigan Educational Assessment measures used in assessing the levels and distribution of educational performance for Michigan's districts, schools, and pupils. The report has four sections. The first section contains a brief description of the 1969-70 assessment program, including a statement…

Descriptors: Achievement Tests, Attitude Measures, Educational Testing, Measurement Instruments

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Julie	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Tindal, Gerald	4
Booker, Kevin	2
Bruch, Julie	2
Gill, Brian	2
ANDRADE, MANUEL	1
Arhin, Ato Kwamina	1
Ekstrom, Ruth B.	1
Elias, Patricia J.	1
Feldt, Leonard S.	1
Fortna, Richard O.	1
Gilmer, Jerry S.	1
Harris, Chester W.	1
Hooper, Frank H.	1
Hopkins, Kenneth D.	1
Houser, Ronald L.	1
Joseph, Dane Christian	1
Klinger, Don A.	1
Lado, Robert	1
McDonald, Frederick J.	1
More ▼