Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 23 |
Descriptor
Scoring | 62 |
Weighted Scores | 62 |
Test Validity | 21 |
Test Reliability | 20 |
Correlation | 17 |
Scoring Formulas | 13 |
Test Items | 13 |
Item Analysis | 12 |
Multiple Choice Tests | 11 |
Statistical Analysis | 11 |
Computer Assisted Testing | 10 |
More ▼ |
Source
Author
Attali, Yigal | 5 |
Bridgeman, Brent | 3 |
Echternacht, Gary | 3 |
Downey, Ronald G. | 2 |
Haladyna, Thomas M. | 2 |
Jackson, Rex | 2 |
Reilly, Richard R. | 2 |
Sinharay, Sandip | 2 |
Ahlgren, Andrew | 1 |
Alderton, David L. | 1 |
Arieli-Attali, Meirav | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 11 |
Postsecondary Education | 6 |
Secondary Education | 3 |
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Researchers | 2 |
Policymakers | 1 |
Location
Finland | 3 |
Australia | 2 |
Belgium | 2 |
Canada | 2 |
France | 2 |
Germany | 2 |
Ireland | 2 |
Italy | 2 |
Netherlands | 2 |
New Zealand | 2 |
Sweden | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Steven L. Wise; G. Gage Kingsbury; Meredith L. Langi – Applied Measurement in Education, 2023
Recent research has provided evidence that performance change during a student's test event can indicate the presence of test-taking disengagement. Meaningful performance change implies that some portions of the test event reflect assumed maximum performance better than others and, because disengagement tends to diminish performance,…
Descriptors: Tests, Weighted Scores, Test Wiseness, Scoring
Ramsay, James; Wiberg, Marie; Li, Juan – Journal of Educational and Behavioral Statistics, 2020
Ramsay and Wiberg used a new version of item response theory that represents test performance over nonnegative closed intervals such as [0, 100] or [0, n] and demonstrated that optimal scoring of binary test data yielded substantial improvements in point-wise root-mean-squared error and bias over number right or sum scoring. We extend these…
Descriptors: Scoring, Weighted Scores, Item Response Theory, Intervals
Soh, Kaycheng – Journal of Higher Education Policy and Management, 2017
World university rankings use the weight-and-sum approach to process data. Although this seems to pass the common sense test, it has statistical problems. In recent years, seven such problems have been uncovered: spurious precision, weight discrepancies, assumed mutual compensation, indictor redundancy, inter-system discrepancy, negligence of…
Descriptors: Reputation, Colleges, Evaluation Methods, Institutional Characteristics
Wasis; Kumaidi; Bastari; Mundilarto; Wintarti, Atik – Eurasian Journal of Educational Research, 2018
Purpose: This is a developmental research study that aims to develop a model of polytomous scoring based-on weighting for multiple correct items in the subject of physics. Weighting was analytically applied based on question complexity and imposed penalties on wrong answers. Research Methods: Within the development model, Fenrich's development…
Descriptors: Physics, Science Education, Scoring, Secondary School Students
Breyer, F. Jay; Rupp, André A.; Bridgeman, Brent – ETS Research Report Series, 2017
In this research report, we present an empirical argument for the use of a contributory scoring approach for the 2-essay writing assessment of the analytical writing section of the "GRE"® test in which human and machine scores are combined for score creation at the task and section levels. The approach was designed to replace a currently…
Descriptors: College Entrance Examinations, Scoring, Essay Tests, Writing Evaluation
Arieli-Attali, Meirav – ProQuest LLC, 2016
This dissertation investigated the feasibility of self-adapted testing (SAT) as a formative assessment tool with the focus on learning. Under two different orientation goals--to excel on a test (performance goal) or to learn from the test (learning goal)--I examined the effect of different scoring rules provided as interactive feedback, on test…
Descriptors: Adaptive Testing, Formative Evaluation, Feedback (Response), Scoring
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Chen, Jing; Zhang, Mo; Bejar, Isaac I. – ETS Research Report Series, 2017
Automated essay scoring (AES) generally computes essay scores as a function of macrofeatures derived from a set of microfeatures extracted from the text using natural language processing (NLP). In the "e-rater"® automated scoring engine, developed at "Educational Testing Service" (ETS) for the automated scoring of essays, each…
Descriptors: Computer Assisted Testing, Scoring, Automation, Essay Tests
Köhler, Hannah, Ed.; Weber, Sabine, Ed.; Brese, Falk, Ed.; Schulz, Wolfram, Ed.; Carstens, Ralph, Ed. – International Association for the Evaluation of Educational Achievement, 2018
The IEA's International Civic and Citizenship Education Study (ICCS) investigates the ways in which young people are prepared to undertake their roles as citizens in a range of countries in the second decade of the 21st century. ICCS 2016 is the second cycle of a study initiated in 2009. The ICCS 2016 user guide describes the content and format of…
Descriptors: Guides, Citizenship Education, Citizen Participation, Citizenship Responsibility
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Reform Support Network, 2012
This publication offers SEAs and LEAs steps for evaluating a Request for Proposal (RFP) that allows input from a wide range of participants and specific subject matter experts, permits flexibility and weighted scoring in appropriate areas and provides an objective and defensible process for determining the vendor finalist.
Descriptors: Program Proposals, Program Evaluation, Evaluation Methods, Weighted Scores
Powers, Donald E.; Escoffery, David S.; Duchnowski, Matthew P. – Applied Measurement in Education, 2015
By far, the most frequently used method of validating (the interpretation and use of) automated essay scores has been to compare them with scores awarded by human raters. Although this practice is questionable, human-machine agreement is still often regarded as the "gold standard." Our objective was to refine this model and apply it to…
Descriptors: Essays, Test Scoring Machines, Program Validation, Criterion Referenced Tests
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…
Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)
Greenberg, Kathleen Puglisi – Teaching of Psychology, 2012
The scoring instrument described in this article is based on a deconstruction of the seven sections of an American Psychological Association (APA)-style empirical research report into a set of learning outcomes divided into content-, expression-, and format-related categories. A double-weighting scheme used to score the report yields a final grade…
Descriptors: Scoring, Research Reports, Grading, Outcome Measures
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…
Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar