Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 17 |
Descriptor
Statistical Analysis | 19 |
Equated Scores | 6 |
Item Response Theory | 6 |
Test Items | 6 |
Computation | 5 |
Data Analysis | 3 |
Data Collection | 3 |
Difficulty Level | 3 |
English (Second Language) | 3 |
Language Tests | 3 |
Mathematics Tests | 3 |
More ▼ |
Source
Educational Testing Service | 19 |
Author
Sinharay, Sandip | 5 |
Haberman, Shelby J. | 3 |
Jia, Yue | 2 |
Moses, Tim | 2 |
Rijmen, Frank | 2 |
Xu, Xueli | 2 |
Yan, Duanli | 2 |
Almond, Russell | 1 |
Brownstein, Beth | 1 |
Curley, Edward | 1 |
Deng, Weiling | 1 |
More ▼ |
Publication Type
Reports - Research | 12 |
Reports - Evaluative | 4 |
Numerical/Quantitative Data | 3 |
Reports - Descriptive | 2 |
Guides - Classroom - Learner | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 4 |
Elementary Education | 3 |
Higher Education | 3 |
Grade 4 | 2 |
Intermediate Grades | 2 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 12 | 1 |
Grade 8 | 1 |
Grade 9 | 1 |
High Schools | 1 |
More ▼ |
Audience
Location
Kentucky | 1 |
Texas | 1 |
Washington | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Livingston, Samuel A. – Educational Testing Service, 2014
This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…
Descriptors: Equated Scores, Scoring, Self Evaluation (Individuals), Scores
Use of Continuous Exponential Families to Link Forms via Anchor Tests. Research Report. ETS RR-11-11
Haberman, Shelby J.; Yan, Duanli – Educational Testing Service, 2011
Continuous exponential families are applied to linking test forms via an internal anchor. This application combines work on continuous exponential families for single-group designs and work on continuous exponential families for equivalent-group designs. Results are compared to those for kernel and equipercentile equating in the case of chained…
Descriptors: Equated Scores, Statistical Analysis, Language Tests, Mathematics Tests
Sinharay, Sandip; Haberman, Shelby J.; Jia, Helena – Educational Testing Service, 2011
Standard 3.9 of the "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council for Measurement in Education, 1999) demands evidence of model fit when an item response theory (IRT) model is used to make inferences from a data set. We applied two recently…
Descriptors: Item Response Theory, Goodness of Fit, Statistical Analysis, Language Tests
Jia, Yue; Stokes, Lynne; Harris, Ian; Wang, Yan – Educational Testing Service, 2011
Estimation of parameters of random effects models from samples collected via complex multistage designs is considered. One way to reduce estimation bias due to unequal probabilities of selection is to incorporate sampling weights. Many researchers have been proposed various weighting methods (Korn, & Graubard, 2003; Pfeffermann, Skinner,…
Descriptors: Computation, Statistical Bias, Sampling, Statistical Analysis
Hsieh, Chueh-an; Xu, Xueli; von Davier, Matthias – Educational Testing Service, 2010
This paper presents an application of a jackknifing approach to variance estimation of ability inferences for groups of students, using a multidimensional discrete model for item response data. The data utilized to demonstrate the approach come from the National Assessment of Educational Progress (NAEP). In contrast to the operational approach…
Descriptors: National Competency Tests, Reading Tests, Grade 4, Computation
Sinharay, Sandip; Haberman, Shelby – Educational Testing Service, 2011
Recently, the literature has seen increasing interest in subscores for their potential diagnostic values; for example, one study suggested the report of weighted averages of a subscore and the total score, whereas others showed, for various operational and simulated data sets, that weighted averages, as compared to subscores, lead to more accurate…
Descriptors: Equated Scores, Weighted Scores, Tests, Statistical Analysis
Walker, Michael E.; Kim, Sooyeon – Educational Testing Service, 2010
This study examined the use of an all multiple-choice (MC) anchor for linking mixed format tests containing both MC and constructed-response (CR) items, in a nonequivalent groups design. An MC-only anchor could effectively link two such test forms if either (a) the MC and CR portions of the test measured the same construct, so that the MC anchor…
Descriptors: Equated Scores, Test Format, Multiple Choice Tests, Statistical Analysis
Xu, Xueli; Jia, Yue – Educational Testing Service, 2011
Estimation of item response model parameters and ability distribution parameters has been, and will remain, an important topic in the educational testing field. Much research has been dedicated to addressing this task. Some studies have focused on item parameter estimation when the latent ability was assumed to follow a normal distribution,…
Descriptors: Test Items, Statistical Analysis, Computation, Item Response Theory
Haberman, Shelby J.; Sinharay, Sandip; Lee, Yi-Hsuan – Educational Testing Service, 2011
Providing information to test takers and test score users about the abilities of test takers at different score levels has been a persistent problem in educational and psychological measurement (Carroll, 1993). Scale anchoring (Beaton & Allen, 1992), a technique that describes what students at different points on a score scale know and can do,…
Descriptors: Statistical Analysis, Scores, Regression (Statistics), Item Response Theory
Ling, Guangming; Rijmen, Frank – Educational Testing Service, 2011
The factorial structure of the Time Management (TM) scale of the Student 360: Insight Program (S360) was evaluated based on a national sample. A general procedure with a variety of methods was introduced and implemented, including the computation of descriptive statistics, exploratory factor analysis (EFA), and confirmatory factor analysis (CFA).…
Descriptors: Time Management, Measures (Individuals), Statistical Analysis, Factor Analysis
Frankel, Lois; Brownstein, Beth – Educational Testing Service, 2016
The work described in this report is the second phase of a project to provide easy-to-use tools for authoring and rendering secondaryschool algebra-levelmath expressions insynthesized speech that is useful for studentswithblindnessor lowvision.This report describes the development and results of the second feedback study performed for our project,…
Descriptors: Visual Impairments, Blindness, Assistive Technology, Feedback (Response)
Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Educational Testing Service, 2010
In the equating literature, a recurring concern is that equating functions that utilize a single anchor to account for examinee groups' nonequivalence are biased when the groups are extremely different and/or when the anchor only weakly measures what the tests measure. Several proposals have been made to address this equating bias by incorporating…
Descriptors: Equated Scores, Data Collection, Statistical Analysis, Differences
Rijmen, Frank – Educational Testing Service, 2010
As is the case for any statistical model, a multidimensional latent growth model comes with certain requirements with respect to the data collection design. In order to measure growth, repeated measurements of the same set of individuals are required. Furthermore, the data collection design should be specified such that no individual is given the…
Descriptors: Tests, Statistical Analysis, Models, Measurement
Educational Testing Service, 2010
This document describes the breadth of the research that the ETS (Educational Testing Service) Research & Development division is conducting in 2010. This portfolio will be updated in early 2011 to reflect changes to existing projects and new projects that were added after this document was completed. The research described in this portfolio falls…
Descriptors: Portfolios (Background Materials), Testing Programs, Educational Testing, Private Agencies
Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Descriptors: Test Bias, Statistical Analysis, Computation, Scores
Previous Page | Next Page ยป
Pages: 1 | 2