Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 17 |
Descriptor
Error of Measurement | 18 |
Testing | 18 |
Test Construction | 8 |
Test Reliability | 8 |
Language Tests | 7 |
Scores | 7 |
Scoring | 7 |
Testing Programs | 7 |
English | 6 |
Grade 3 | 6 |
Item Response Theory | 6 |
More ▼ |
Source
Author
Publication Type
Reports - Descriptive | 18 |
Journal Articles | 10 |
Numerical/Quantitative Data | 6 |
Guides - General | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Elementary Education | 6 |
Grade 3 | 6 |
Early Childhood Education | 5 |
Grade 4 | 5 |
Grade 5 | 5 |
Grade 6 | 5 |
Grade 7 | 5 |
Grade 8 | 5 |
Intermediate Grades | 5 |
Junior High Schools | 5 |
Middle Schools | 5 |
More ▼ |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Location
New York | 5 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
What Works Clearinghouse Rating
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Kane, Michael – Journal of Educational Measurement, 2011
Errors don't exist in our data, but they serve a vital function. Reality is complicated, but our models need to be simple in order to be manageable. We assume that attributes are invariant over some conditions of observation, and once we do that we need some way of accounting for the variability in observed scores over these conditions of…
Descriptors: Error of Measurement, Scores, Test Interpretation, Testing
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Solano-Flores, Guillermo; Li, Min – Educational Research and Evaluation, 2013
We discuss generalizability (G) theory and the fair and valid assessment of linguistic minorities, especially emergent bilinguals. G theory allows examination of the relationship between score variation and language variation (e.g., variation of proficiency across languages, language modes, and social contexts). Studies examining score variation…
Descriptors: Measurement, Testing, Language Proficiency, Test Construction
New York State Education Department, 2016
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
New York State Education Department, 2015
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
Sijtsma, Klaas – Psychometrika, 2009
This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…
Descriptors: Measurement, Error of Measurement, Scores, Computation
New York State Education Department, 2014
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
Bollen, Kenneth A.; Davis, Walter R. – Structural Equation Modeling: A Multidisciplinary Journal, 2009
We discuss the identification, estimation, and testing of structural equation models that have causal indicators. We first provide 2 rules of identification that are particularly helpful in models with causal indicators--the 2C emitted paths rule and the exogenous X rule. We demonstrate how these rules can help us distinguish identified from…
Descriptors: Structural Equation Models, Testing, Identification, Statistical Significance
Brockmann, Frank – Council of Chief State School Officers, 2011
State testing programs today are more extensive than ever, and their results are required to serve more purposes and high-stakes decisions than one might have imagined. Assessment results are used to hold schools, districts, and states accountable for student performance and to help guide a multitude of important decisions. This report describes…
Descriptors: Accuracy, Measurement, Testing, Expertise
Doorey, Nancy A. – Council of Chief State School Officers, 2011
The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…
Descriptors: Testing, Sampling, Expertise, Testing Programs
Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009
Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…
Descriptors: Test Results, Test Items, Testing, Aptitude Tests
A Generally Robust Approach for Testing Hypotheses and Setting Confidence Intervals for Effect Sizes
Keselman, H. J.; Algina, James; Lix, Lisa M.; Wilcox, Rand R.; Deering, Kathleen N. – Psychological Methods, 2008
Standard least squares analysis of variance methods suffer from poor power under arbitrarily small departures from normality and fail to control the probability of a Type I error when standard assumptions are violated. This article describes a framework for robust estimation and testing that uses trimmed means with an approximate degrees of…
Descriptors: Intervals, Testing, Least Squares Statistics, Effect Size
Barkaoui, Khaled – Canadian Modern Language Review, 2007
Essay tests are widely used to assess ESL/EFL learners' writing abilities for instructional, administrative, and research purposes. Relevant literature was searched to identify 70 empirical studies on ESL/EFL essay tests. The majority of these studies examined task, essay, and rater effects on essay rating and scores. Less attention has been given…
Descriptors: Essay Tests, Language Tests, English (Second Language), Second Language Learning
Previous Page | Next Page ยป
Pages: 1 | 2