NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)19
What Works Clearinghouse Rating
Showing 1 to 15 of 72 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan – Language Testing, 2017
Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…
Descriptors: Language Tests, Equated Scores, Testing Programs, Comparative Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Yang, Ji Seung; Cai, Li – Journal of Educational and Behavioral Statistics, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yang, Ji Seung; Cai, Li – Grantee Submission, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM; Cai, 2008, 2010a, 2010b). Results indicate that the MH-RM algorithm can…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Yang, Ji Seung; Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013
The main purpose of this study is to improve estimation efficiency in obtaining full-information maximum likelihood (FIML) estimates of contextual effects in the framework of a nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM; Cai, 2008, 2010a, 2010b). Results indicate that the MH-RM…
Descriptors: Context Effect, Computation, Hierarchical Linear Modeling, Mathematics
Peer reviewed Peer reviewed
Direct linkDirect link
Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014
In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…
Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences
Rutkowski, David; Wild, Justin; Rutkowski, Leslie – Center for Evaluation and Education Policy, Indiana University, 2013
Are U.S. and, in particular, Hoosier students competitive and ready to succeed in an ever-changing and increasingly global economic landscape? This question is frequently considered by K-12 education stakeholders at all levels, including national, state, and local officials. One of the central ways in which education systems can compare themselves…
Descriptors: Mathematics Achievement, Science Achievement, International Programs, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Hardy, Ian – Journal of Education Policy, 2014
This paper explores how the strong policy push to improve students' results on national literacy and numeracy tests -- the National Assessment Program, Literacy and Numeracy (NAPLAN) -- in the Australian state of Queensland influenced schooling practices, including teachers' learning. The paper argues the focus upon improved test scores on NAPLAN…
Descriptors: Literacy, Numeracy, Foreign Countries, Standardized Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Thurlow, Martha L.; Albus, Debra A.; Lazarus, Sheryl S. – National Center on Educational Outcomes, 2015
Graduation requirements and diploma options for students with disabilities who participate in the general assessment has been a topic of interest for many years. The recent push for all students, including those with disabilities, to leave school ready for college and career has heightened the importance of understanding what states are requiring…
Descriptors: Synthesis, Disabilities, Educational Policy, Graduation Requirements
Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015
This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…
Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
von Davier, Alina A. – ETS Research Report Series, 2012
Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
McLaughlin, Joseph W.; Skaggs, Gary; Patterson, Margaret Becker – GED Testing Service, 2009
GED testing candidates have many options available to them to prepare for the GED Test, including adult education classes, practice tests, and self-study. This study focused on candidates who voluntarily took the GED Test and could choose freely among preparation activities. We examined GED Test preparation activities and created eight mutually…
Descriptors: Community Colleges, Testing, Public School Adult Education, Profiles
Di Giacomo, F. Tony; Fishbein, Bethany G.; Buckley, Vanessa W. – College Board, 2013
Many articles and reports have reviewed, researched, and commented on international assessments from the perspective of exploring what is relevant for the United States' education systems. Researchers make claims about whether the top-performing systems have transferable practices or policies that could be applied to the United States. However,…
Descriptors: Comparative Testing, International Assessment, Relevance (Education), Testing Programs
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5