Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 21 |
Descriptor
Source
Author
| Cai, Li | 2 |
| Hansen, Mark | 2 |
| Li, Yuan H. | 2 |
| Li, Zhen | 2 |
| Monroe, Scott | 2 |
| Abbott, Martin L. | 1 |
| Albano, Anthony D. | 1 |
| Alonzo, Alicia C. | 1 |
| Anderson, Beverly L. | 1 |
| Anderson, Michael | 1 |
| Ariel, Adelaide | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 64 |
| Journal Articles | 27 |
| Speeches/Meeting Papers | 13 |
| Information Analyses | 3 |
| Tests/Questionnaires | 2 |
| Reference Materials -… | 1 |
| Reports - Evaluative | 1 |
Education Level
Audience
| Researchers | 2 |
Location
| Washington | 3 |
| Georgia | 2 |
| Virginia | 2 |
| Arizona | 1 |
| Arizona (Mesa) | 1 |
| Azerbaijan | 1 |
| Canada | 1 |
| China (Shanghai) | 1 |
| Finland | 1 |
| Florida | 1 |
| Greece | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 2 |
| Comprehensive Employment and… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020
Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…
Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making
Tindal, Gerald; Nese, Joseph F. T.; Stevens, Joseph J. – Educational Assessment, 2017
For the past decade, the accountability model associated with No Child Left Behind (NCLB) emphasized proficiency on end of year tests; with Every Student Succeeds Act (ESSA) the emphasis on proficiency within statewide testing programs, though now integrated with other measures of student learning, nevertheless remains a primary metric for…
Descriptors: Testing Programs, Middle School Students, Models, State Standards
Goldhaber, Dan; Cowan, James; Theobald, Roddy – Journal of Teacher Education, 2017
We use longitudinal data from Washington State to provide estimates of the extent to which performance on the edTPA, a performance-based, subject-specific assessment of teacher candidates, is predictive of the likelihood of employment in the teacher workforce and value-added measures of teacher effectiveness. While edTPA scores are highly…
Descriptors: Predictive Validity, Preservice Teachers, Preservice Teacher Education, Longitudinal Studies
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
It is a well-known problem in testing the fit of models to multinomial data that the full underlying contingency table will inevitably be sparse for tests of reasonable length and for realistic sample sizes. Under such conditions, full-information test statistics such as Pearson's X[superscript 2] and the likelihood ratio statistic G[superscript…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Simui, Francis; Chibale, Henry; Namangala, Boniface – Open Praxis, 2017
This paper focuses on the management of distance education examination in a lowly resourced North-Eastern region of Zambia. The study applies Hermeneutic Phenomenology approach to generate and make sense of the data. It is the lived experiences of 2 invigilators and 66 students purposively selected that the study draws its insights from. Meaning…
Descriptors: Distance Education, Phenomenology, Testing Programs, Testing
Chen, Qian – International Journal of Science and Mathematics Education, 2014
In this study, the Trends in International Mathematics and Science Study 2007 data were used to build mathematics achievement models of fourth graders in two East Asian school systems: Hong Kong and Singapore. In each school system, eight variables at student level and nine variables at school/class level were incorporated to build an achievement…
Descriptors: Foreign Countries, Mathematics Achievement, Grade 4, Mathematics Tests
Alonzo, Alicia C.; Ke, Li – Measurement: Interdisciplinary Research and Perspectives, 2016
A new vision of science learning described in the "Next Generation Science Standards"--particularly the science and engineering practices and their integration with content--pose significant challenges for large-scale assessment. This article explores what might be learned from advances in large-scale science assessment and…
Descriptors: Science Achievement, Science Tests, Group Testing, Accountability
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016
Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Sabatini, John; O'Reilly, Tenaha; Deane, Paul – ETS Research Report Series, 2013
This report describes the foundation and rationale for a framework designed to measure reading literacy. The aim of the effort is to build an assessment system that reflects current theoretical conceptions of reading and is developmentally sensitive across a prekindergarten to 12th grade student range. The assessment framework is intended to…
Descriptors: Reading Tests, Literacy, Models, Testing Programs
Liem, Gregory Arief D.; Martin, Andrew J.; Anderson, Michael; Gibson, Robyn; Sudmalis, David – Journal of Educational Psychology, 2014
Drawing on the Programme for International Student Assessment 2003 data set comprising over 190,000 15-year-old students in 25 countries, the current study sought to examine the role of arts-related information and communication technology (ICT) use in students' problem-solving skill and science and mathematics achievement. Structural equation…
Descriptors: Problem Solving, Science Achievement, Mathematics Achievement, Computer Use
Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014
In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…
Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences
Conti, Maria; LaMance, Rachel; Miller-Cochran, Susan – Composition Forum, 2017
To address the needs and interests of primary stakeholders in a writing program, this article presents a model of "grassroots" assessment that involves instructors from all ranks as well as students in the development, facilitation, and interpretation of assessment results. The authors describe two assessment plans that measured student…
Descriptors: Writing Improvement, Needs Assessment, Stakeholders, Student Needs
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010
The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…
Descriptors: Test Bias, Testing Programs, Evaluation, Measurement

Peer reviewed
Direct link
