Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 13 |
Descriptor
Scoring | 65 |
Testing Programs | 65 |
State Programs | 33 |
Higher Education | 18 |
Writing Evaluation | 18 |
Essay Tests | 16 |
Scores | 14 |
Test Construction | 14 |
Elementary Secondary Education | 12 |
Test Reliability | 12 |
Educational Testing | 11 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 65 |
Speeches/Meeting Papers | 24 |
Journal Articles | 17 |
Numerical/Quantitative Data | 6 |
Reports - Descriptive | 6 |
Tests/Questionnaires | 5 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 7 |
Elementary Secondary Education | 5 |
Postsecondary Education | 5 |
Secondary Education | 3 |
Elementary Education | 1 |
Grade 4 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Audience
Researchers | 7 |
Policymakers | 1 |
Practitioners | 1 |
Teachers | 1 |
Location
California | 7 |
Alabama | 1 |
Connecticut | 1 |
Denmark | 1 |
France | 1 |
Georgia | 1 |
Greece | 1 |
India | 1 |
Japan | 1 |
Nevada | 1 |
New Zealand | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Akour, Mutasem; Sabah, Saed; Hammouri, Hind – Journal of Psychoeducational Assessment, 2015
The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…
Descriptors: Test Bias, Science Tests, Test Items, Scoring
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014
Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…
Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy
Mullis, Ina V. S., Ed.; Martin, Michael O., Ed. – International Association for the Evaluation of Educational Achievement, 2014
It is critical for countries to ensure that capable secondary school students receive further preparation in advanced mathematics and science, so that they are ready to enter challenging university-level studies that prepare them for careers in science, technology, engineering, and mathematics (STEM) fields. This group of students will become the…
Descriptors: Mathematics Tests, Science Tests, Educational Assessment, Secondary School Students
Gafoor, K. Abdul; Farooque, T. K. Umer – Online Submission, 2014
In view of the strengthening momentum in efforts to reforms in examinations in higher education of India and Kerala in particular, and holding that teacher education is in privileged position to initiate examination reforms in higher education by virtue of its link with both school education and the higher education, this paper focuses attention…
Descriptors: Preservice Teacher Education, Secondary School Curriculum, Testing Programs, Testing
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010
Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…
Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification
Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009
The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…
Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics
Karkee, Thakur; Lewis, Daniel M.; Hoskens, Machteld; Yao, Lihua; Haug, Carolyn – 2003
Two methods to establish a common scale across grades within a content area using a common item design (separate and concurrent) have previously been studied under simulated conditions. Separate estimation is accomplished through separate calibration and grade-by-grade chained linking. Concurrent calibration established the vertical scale in a…
Descriptors: Estimation (Mathematics), Mathematics Tests, Scaling, Scoring
Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David M. – ETS Research Report Series, 2008
This report presents the results of a research and development effort for SpeechRater? Version 1.0 (v1.0), an automated scoring system for the spontaneous speech of English language learners used operationally in the Test of English as a Foreign Language™ (TOEFL®) Practice Online assessment (TPO). The report includes a summary of the validity…
Descriptors: Speech, Scoring, Scoring Rubrics, Scoring Formulas
Lampe, Richard E. – 1984
This study examines the accuracy of the self-scoring efforts of 306 eighth-graders on the Kuder General Interest Survey (GIS), and suggests possible methods to improve self-scoring accuracy. The GIS is widely used to assist junior high school students with their educational and vocational planning. After the administration of the test by English…
Descriptors: Interest Inventories, Junior High Schools, Profiles, Scoring
Shannon, Gregory A. – 1983
Rescoring of Center for Occupational and Professional Assessment objective-referenced tests is decided largely by content experts selected by client organizations. A few of the test items, statistically flagged for review, are not rescored. Some of this incongruence could be due to the use of the biserial correlation (r-biserial) as an…
Descriptors: Adults, Criterion Referenced Tests, Item Analysis, Occupational Tests