Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 12 |
Descriptor
Comparative Analysis | 17 |
Equated Scores | 17 |
College Entrance Examinations | 13 |
Scores | 6 |
Raw Scores | 5 |
Statistical Analysis | 5 |
Test Items | 5 |
Correlation | 4 |
Evaluation Methods | 4 |
Item Response Theory | 4 |
Scaling | 4 |
More ▼ |
Source
ETS Research Report Series | 7 |
College Entrance Examination… | 4 |
ACT, Inc. | 2 |
College Board | 1 |
Educational Research and… | 1 |
Language Testing | 1 |
Routledge, Taylor & Francis… | 1 |
Author
Liu, Jinghua | 7 |
Dorans, Neil J. | 4 |
Curley, Edward | 3 |
Guo, Hongwen | 3 |
Dorans, Neil | 2 |
Carey, Jill | 1 |
Cascallar, Alicia S. | 1 |
DeCarlo, Lawrence T. | 1 |
Fraillon, Julian | 1 |
Gutierrez Arvizu, Maria Nelly | 1 |
Haberman, Shelby J. | 1 |
More ▼ |
Publication Type
Reports - Research | 14 |
Journal Articles | 9 |
Reports - Descriptive | 2 |
Books | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 17 |
Postsecondary Education | 14 |
High Schools | 4 |
Secondary Education | 3 |
Elementary Secondary Education | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Students | 1 |
Location
Puerto Rico | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 11 |
ACT Assessment | 3 |
National Merit Scholarship… | 1 |
Praxis Series | 1 |
Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Liu, Jinghua; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2014
Maintaining score interchangeability and scale consistency is crucial for any testing programs that administer multiple forms across years. The use of a multiple linking design, which involves equating a new form to multiple old forms and averaging the conversions, has been proposed to control scale drift. However, the use of multiple linking…
Descriptors: Comparative Analysis, Reliability, Test Construction, Equated Scores
LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan – Language Testing, 2017
Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…
Descriptors: Language Tests, Equated Scores, Testing Programs, Comparative Analysis
Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016
Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…
Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference
Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014
The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…
Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis
Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012
This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…
Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores
Liu, Jinghua; Curley, Edward; Low, Albert – ETS Research Report Series, 2009
This study examines the stability of the SAT® scale from 1994 to 2001. A 1994 form and a 2001 form were readministered in a 2005 SAT administration, and the 1994 form was equated to the 2001 form. The new conversion was compared to the old conversion. Both the verbal and math sections exhibit a similar degree of scale drift, but in opposite…
Descriptors: College Entrance Examinations, Scaling, Verbal Tests, Mathematics Tests
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010
This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…
Descriptors: Guides, Item Response Theory, Test Items, Correlation
Liu, Jinghua; Zhu, Xiaowen – ETS Research Report Series, 2008
The purpose of this paper is to explore methods to approximate population invariance without conducting multiple linkings for subpopulations. Under the single group or equivalent groups design, no linking needs to be performed for the parallel-linear system linking functions. The unequated raw score information can be used as an approximation. For…
Descriptors: Raw Scores, Test Format, Comparative Analysis, Test Construction
Haberman, Shelby J.; Guo, Hongwen; Liu, Jinghua; Dorans, Neil J. – ETS Research Report Series, 2008
This study uses historical data to explore the consistency of SAT® I: Reasoning Test score conversions and to examine trends in scaled score means. During the period from April 1995 to December 2003, both Verbal (V) and Math (M) means display substantial seasonality, and a slight increasing trend for both is observed. SAT Math means increase more…
Descriptors: College Entrance Examinations, Thinking Skills, Logical Thinking, Scaling
Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007
This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…
Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods
ACT, Inc., 2005
One of the most challenging issues a state must resolve in designing a statewide standards and college readiness assessment is that of how student scores should be reported. The ACT is an effective and reliable measure of student readiness for college and work, but in some cases states may wish to augment the ACT with tests of their own design. In…
Descriptors: Academic Achievement, Raw Scores, Achievement Rating, School Readiness
Dorans, Neil J. – College Entrance Examination Board, 2000
Distinctions were made between three classes of statistical linkage: equivalence, concordance, and prediction. These distinctions were based on rational content considerations and empirical statistical relationships. A large database involving SAT I and ACT scores was used to determine which type of linkage was best suited for different scores and…
Descriptors: Statistical Analysis, Prediction, Scores, Standardized Tests
Schneider, Dianne; Dorans, Neil – College Entrance Examination Board, 1999
This paper describes how results on the ACT and SAT I can be compared through statistical linking procedures.
Descriptors: Individual Differences, Student Characteristics, Comparative Analysis, Scores
Previous Page | Next Page »
Pages: 1 | 2