ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Descriptor

Achievement Tests	24
Error of Measurement	24
Test Reliability	24
Test Validity	11
Foreign Countries	7
Test Construction	7
Academic Achievement	6
Item Response Theory	6
Grade 7	5
Item Analysis	5
Test Bias	5
Testing Problems	5
Elementary Education	4
Grade 4	4
Scores	4
Secondary School Students	4
Statistical Analysis	4
Test Interpretation	4
Test Items	4
Test Theory	4
Testing Programs	4
Common Core State Standards	3
Correlation	3
Criterion Referenced Tests	3
Data Collection	3
More ▼

Source

New York State Education…	3
Research Papers in Education	2
American Educational Research…	1
British Journal of…	1
Education and Information…	1
Educational Leadership	1
Educational and Psychological…	1
Evaluation and the Health…	1
International Journal of…	1
ProQuest LLC	1

Publication Type

Reports - Research	13
Journal Articles	9
Reports - Evaluative	5
Speeches/Meeting Papers	4
Numerical/Quantitative Data	3
Reports - Descriptive	3
Dissertations/Theses -…	1
Reference Materials -…	1

Education Level

Secondary Education	8
Elementary Education	4
Grade 3	4
Grade 4	4
Grade 5	4
Early Childhood Education	3
Grade 6	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Primary Education	3
Elementary Secondary Education	1
Higher Education	1
More ▼

Audience

Researchers

Location

New York	3
United Kingdom (England)	2
Canada	1
Ireland	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
College Level Academic Skills…	1
Kaufman Assessment Battery…	1
Metropolitan Achievement Tests	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

How Did Spain Perform in PISA 2018? New Estimates of Children's PISA Reading Scores

Peer reviewed

Direct link

John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024

International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Evaluating Measurement Invariance of Students' Practices Regarding Online Information Questionnaire in PISA 2022: A Comparative Study Using MGCFA and Alignment Method

Peer reviewed

Direct link

Esra Sözer Boz – Education and Information Technologies, 2025

International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…

Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students

The Effect of Multiple-Choice Test Items' Difficulty Degree on the Reliability Coefficient and the Standard Error of Measurement Depending on the Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021

This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement

From OLS to Multilevel Multidimensional Mixture IRT: A Model Refinement Approach to Investigating Patterns of Relationships in PISA 2012 Data

Direct link

Gulsah Gurkan – ProQuest LLC, 2021

Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…

Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

Reporting Error and Reliability to Test-Takers: An International Review

Peer reviewed

Direct link

Bradshaw, Jenny; Wheater, Rebecca – Research Papers in Education, 2013

This review examined a range of approaches internationally to the reporting of assessment results for individual students, with a particular focus on how results are represented, the level of detail reported and the steps taken to quantify, report and explain error and uncertainty in the results' reports or certificates given to students in a…

Descriptors: Test Reliability, Error of Measurement, High Stakes Tests, Foreign Countries

New York State Testing Program 2016: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2016

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

New York State Testing Program 2014: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2014

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

The Application of Strength of Association Statistics to the Item Analysis of an In-Training Examination in Diagnostic Radiology.

Peer reviewed

Diamond, James J.; McCormick, Janet – Evaluation and the Health Professions, 1986

Using item responses from an in-training examination in diagnostic radiology, the application of a strength of association statistic to the general problem of item analysis is illustrated. Criteria for item selection, general issues of reliability, and error of measurement are discussed. (Author/LMO)

Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Graduate Medical Education

Some Relationships between the Binomial Error Model and Classical Test Theory.

Peer reviewed

Feldt, Leonard S. – Educational and Psychological Measurement, 1984

The binomial error model includes form-to-form difficulty differences as error variance and leads to Ruder-Richardson formula 21 as an estimate of reliability. If the form-to-form component is removed from the estimate of error variance, the binomial model leads to KR 20 as the reliability estimate. (Author/BW)

Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Mathematical Formulas

The Reliability of the K-ABC for Hispanic and White Children: A Comparison by Year.

Download full text

Hernandez, Arthur E.; Willson, Victor – 1984

Scores of two groups of White and Hispanic children at 11 age levels from 2.5 years to 12.5 years were assessed. The scores were drawn from the Kaufman Assessment Battery for Children (K-ABC), an individually administered assessment battery designed to measure intelligence and achievement and intended for minority group assessment. Reliability…

Descriptors: Achievement Tests, Elementary Education, Error of Measurement, Hispanic Americans

A Case in Support of Using Locally Developed Non-Normed Tests for Title I Program Evaluation.

Christie, Samuel G.; Conniff, William A. – 1981

Stockton Unified School District's successful strategy of using a locally developed, non-normed achievement test to implement the norm referenced model of the Title I Evaluation and Reporting System (Model A2) is described. Documented are the procedures involved in the development of a curriculum guide and test items, and the administration of…

Descriptors: Achievement Tests, Elementary Secondary Education, Error of Measurement, Norm Referenced Tests

Achievement Test Items--Methods of Study. CSE Monograph Series in Evaluation, 6.

Harris, Chester W.; And Others – 1977

The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…

Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies

Previous Page | Next Page »

Pages: 1 | 2

Haladyna, Tom	2
Al-zboon, Habis Saad	1
Alrekebat, Amjad Farhan	1
Barker, Pierce	1
Belcher, Marcia	1
Bradshaw, Jenny	1
Bramley, Tom	1
Christie, Samuel G.	1
Conniff, William A.	1
Crocker, A. C.	1
Dhawan, Vikas	1
Diamond, James J.	1
Esra Sözer Boz	1
Feldt, Leonard S.	1
Forbes, Dean W.	1
Gulsah Gurkan	1
Harris, Chester W.	1
Hernandez, Arthur E.	1
John Jerrim	1
Luis Alejandro Lopez-Agudo	1
McCormick, Janet	1
Murchan, Damian P.	1
Oscar David…	1
Papay, John P.	1
More ▼