Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 37 |
Descriptor
Statistical Analysis | 124 |
Testing Programs | 124 |
Academic Achievement | 34 |
Elementary Secondary Education | 32 |
State Programs | 32 |
Test Results | 30 |
Test Construction | 26 |
Achievement Tests | 25 |
Educational Assessment | 24 |
Test Reliability | 20 |
Comparative Analysis | 17 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 14 |
Elementary Education | 10 |
Higher Education | 9 |
Postsecondary Education | 6 |
Secondary Education | 6 |
Grade 5 | 3 |
Grade 6 | 3 |
Grade 7 | 3 |
Middle Schools | 3 |
Early Childhood Education | 2 |
Grade 2 | 2 |
More ▼ |
Audience
Researchers | 8 |
Parents | 1 |
Policymakers | 1 |
Location
Australia | 4 |
Pennsylvania (Philadelphia) | 3 |
United Kingdom (England) | 3 |
Canada | 2 |
Georgia | 2 |
Mississippi | 2 |
Nigeria | 2 |
Oregon | 2 |
Pennsylvania | 2 |
Tennessee | 2 |
United States | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Tindal, Gerald; Nese, Joseph F. T.; Stevens, Joseph J. – Educational Assessment, 2017
For the past decade, the accountability model associated with No Child Left Behind (NCLB) emphasized proficiency on end of year tests; with Every Student Succeeds Act (ESSA) the emphasis on proficiency within statewide testing programs, though now integrated with other measures of student learning, nevertheless remains a primary metric for…
Descriptors: Testing Programs, Middle School Students, Models, State Standards
Jiang, Feng; McComas, William F. – International Journal of Science Education, 2015
Gauging the effectiveness of specific teaching strategies remains a major topic of interest in science education. Inquiry teaching among others has been supported by extensive research and recommended by the National Science Education Standards. However, most of the empirical evidence in support was collected in research settings rather than in…
Descriptors: Inquiry, Active Learning, Science Instruction, Science Achievement
Benítez, Isabel; Padilla, José-Luis – Journal of Mixed Methods Research, 2014
Differential item functioning (DIF) can undermine the validity of cross-lingual comparisons. While a lot of efficient statistics for detecting DIF are available, few general findings have been found to explain DIF results. The objective of the article was to study DIF sources by using a mixed method design. The design involves a quantitative phase…
Descriptors: Foreign Countries, Mixed Methods Research, Test Bias, Cross Cultural Studies
Creagh, Sue – English Teaching: Practice and Critique, 2014
The Australian field of English as a Second Language (ESL) teaching is globally respected for its research and practice achievements over a period of some 30 years. However, this essential field of pedagogy is being diluted in the current Australian reform agenda which is firmly founded on a traditional vision of English as first language, and…
Descriptors: Foreign Countries, Standardized Tests, English (Second Language), Second Language Learning
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Unamma, Anthony Odera – Open Praxis, 2013
This research work was aimed at determining the degree of community members' interference in the conduct of university distance learning examination in South Eastern Nigeria. It was also aimed at finding out the factors responsible for the community members' interference, the ways by which interference is effected, the consequences and the…
Descriptors: Foreign Countries, Distance Education, Community Involvement, Testing Problems
McBee, Matthew T.; Peters, Scott J.; Waterman, Craig – Gifted Child Quarterly, 2014
Best practice in gifted and talented identification procedures involves making decisions on the basis of multiple measures. However, very little research has investigated the impact of different methods of combining multiple measures. This article examines the consequences of the conjunctive ("and"), disjunctive/complementary…
Descriptors: Best Practices, Ability Identification, Academically Gifted, Correlation
Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011
Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…
Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs
Edwards, Aretha Hargrove – ProQuest LLC, 2012
The purpose of this study was to examine Mississippi Delta area public high school counselors' and principals' perception of the impact of SATP2 testing on counselors' services to students in order to determine whether or not testing responsibilities have an adverse effect on counselors' delivery of services to students. This study was similar to…
Descriptors: Public Schools, High Schools, School Counselors, Principals
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Elementary School Students
Daniel, Tracy Demetrie – ProQuest LLC, 2012
Determining if the investment in educational technology will improve student achievement is complicated and multifarious. The purpose of this study was to evaluate the influence of teacher technology integration on student achievement as measured by the Mississippi Subject Area Testing Program (SATP) and to explore the relationship between…
Descriptors: Academic Achievement, High Stakes Tests, Educational Technology, Self Efficacy
Creagh, Sue – TESOL in Context, 2014
Teachers are now experiencing the age of quantitative test-driven assessment, in which there is little weight accorded to teacher-based judgement about student progress. In the Australian context, the NAPLaN test has become a driving force in school and teacher accountability. The language of NAPLaN is one of bands and numerical scores and…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Student Evaluation