Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 15 |
Descriptor
Correlation | 15 |
Language Tests | 5 |
Scores | 5 |
Factor Analysis | 4 |
Test Validity | 4 |
College Entrance Examinations | 3 |
Foreign Countries | 3 |
Science Tests | 3 |
Scoring | 3 |
Test Items | 3 |
Test Theory | 3 |
More ▼ |
Source
Educational Testing Service | 15 |
Author
Haberman, Shelby J. | 2 |
Rijmen, Frank | 2 |
Sinharay, Sandip | 2 |
Attali, Yigal | 1 |
Cline, Fred | 1 |
Cline, Frederick | 1 |
Deane, Paul | 1 |
Dorans, Neil J. | 1 |
Flotts, Paulina | 1 |
Harris, Ian | 1 |
Jia, Yue | 1 |
More ▼ |
Publication Type
Reports - Research | 8 |
Reports - Evaluative | 4 |
Numerical/Quantitative Data | 2 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 4 |
Elementary Secondary Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
High Schools | 2 |
Secondary Education | 2 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Audience
Location
Chile | 1 |
China | 1 |
Colombia | 1 |
Egypt | 1 |
Georgia | 1 |
Germany | 1 |
Japan | 1 |
Kentucky | 1 |
Ohio | 1 |
South Carolina | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
ACT Assessment | 1 |
Marlowe Crowne Social… | 1 |
National Assessment of… | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Haberman, Shelby J.; Sinharay, Sandip – Educational Testing Service, 2011
Subscores are reported for several operational assessments. Haberman (2008) suggested a method based on classical test theory to determine if the true subscore is predicted better by the corresponding subscore or the total score. Researchers are often interested in learning how different subgroups perform on subtests. Stricker (1993) and…
Descriptors: True Scores, Test Theory, Prediction, Group Membership
Santelices, Maria Veronica; Ugarte, Juan Jose; Flotts, Paulina; Radovic, Darinka; Kyllonen, Patrick – Educational Testing Service, 2011
This paper presents the development and initial validation of new measures of critical thinking and noncognitive attributes that were designed to supplement existing standardized tests used in the admissions system for higher education in Chile. The importance of various facets of this process, including the establishment of technical rigor and…
Descriptors: Foreign Countries, College Entrance Examinations, Test Construction, Test Validity
Dorans, Neil J. – Educational Testing Service, 2010
Santelices and Wilson (2010) claimed to have addressed technical criticisms of Freedle (2003) presented in Dorans (2004a) and elsewhere. Santelices and Wilson's abstract claimed that their study confirmed that SAT[R] verbal items do function differently for African American and White subgroups. In this commentary, I demonstrate that the…
Descriptors: College Entrance Examinations, Verbal Tests, Test Bias, Test Items
Jia, Yue; Stokes, Lynne; Harris, Ian; Wang, Yan – Educational Testing Service, 2011
Estimation of parameters of random effects models from samples collected via complex multistage designs is considered. One way to reduce estimation bias due to unequal probabilities of selection is to incorporate sampling weights. Many researchers have been proposed various weighting methods (Korn, & Graubard, 2003; Pfeffermann, Skinner,…
Descriptors: Computation, Statistical Bias, Sampling, Statistical Analysis
Moses, Tim – Educational Testing Service, 2011
The purpose of this study was to consider the relationships of prediction, measurement, and scaling invariance when these invariances were simultaneously evaluated in psychometric test data. An approach was developed to evaluate prediction, measurement, and scaling invariance based on linear and nonlinear prediction, measurement, and scaling…
Descriptors: Prediction, Measurement, Scaling, Tests
Powers, Donald E.; Kim, Hae-Jin; Yu, Feng; Weng, Vincent Z.; VanWinkle, Waverely – Educational Testing Service, 2009
To facilitate the interpretation of test scores from the new TOEIC[R] (Test of English for International Communications[TM]) speaking and writing tests as measures of English-language proficiency, we administered a self-assessment inventory to TOEIC examinees in Japan and Korea, to gather their perceptions of their ability to perform a variety of…
Descriptors: English for Special Purposes, Language Tests, Writing Tests, Speech Tests
Steinberg, Jonathan; Cline, Frederick; Sawaki, Yasuyo – Educational Testing Service, 2011
This study examined the scores on a state standards-based Grade 5 Science assessment obtained by a group of students without learning disabilities who took the standard form of the test and by three groups of students with learning disabilities: one taking the standard form of the test without accommodations or modifications, a second taking the…
Descriptors: Learning Disabilities, State Standards, Educational Improvement, Science Tests
Young, John W.; Cline, Fred – Educational Testing Service, 2009
"High Schools That Work" (HSTW) is a school improvement initiative that was inaugurated by the Southern Regional Education Board (SREB) in 1987. The main purpose of this concurrent validity study is to evaluate one or more measures by investigating their relationship to other commonly used and established measures given at or about the…
Descriptors: Validity, Educational Improvement, Improvement Programs, High Schools
Ling, Guangming; Rijmen, Frank – Educational Testing Service, 2011
The factorial structure of the Time Management (TM) scale of the Student 360: Insight Program (S360) was evaluated based on a national sample. A general procedure with a variety of methods was introduced and implemented, including the computation of descriptive statistics, exploratory factor analysis (EFA), and confirmatory factor analysis (CFA).…
Descriptors: Time Management, Measures (Individuals), Statistical Analysis, Factor Analysis
Sinharay, Sandip – Educational Testing Service, 2010
Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008) suggested a method based on classical test theory to determine whether subscores have added value over total scores. This paper provides a literature review and reports when subscores were found to have added value for…
Descriptors: Scores, Correlation, Reliability, Item Response Theory
Rijmen, Frank – Educational Testing Service, 2009
Three multidimensional item response theory (IRT) models for testlet-based tests are described. In the bifactor model (Gibbons & Hedeker, 1992), each item measures a general dimension in addition to a testlet-specific dimension. The testlet model (Bradlow, Wainer, & Wang, 1999) is a bifactor model in which the loadings on the specific dimensions…
Descriptors: Item Response Theory, Models, Graphs, Comparative Analysis
Deane, Paul – Educational Testing Service, 2011
This paper presents a socio-cognitive framework for connecting writing pedagogy and writing assessment with modern social and cognitive theories of writing. It focuses on providing a general framework that highlights the connections between writing competency and other literacy skills; identifies key connections between literacy instruction,…
Descriptors: Writing (Composition), Writing Evaluation, Writing Tests, Cognitive Ability
Stricker, Lawrence J.; Attali, Yigal – Educational Testing Service, 2010
The principal aims of this study, a conceptual replication of an earlier investigation of the TOEFL[R] computer-based test, or TOEFL CBT, in Buenos Aires, Cairo, and Frankfurt, were to assess test takers' reported acceptance of the TOEFL Internet-based test, or TOEFL iBT[TM], and its associations with possible determinants of this acceptance and…
Descriptors: Computer Attitudes, Questionnaires, Comparative Analysis, Foreign Countries