ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Weighted Scores	49
Higher Education	12
Mathematical Models	9
Evaluation Methods	8
Equated Scores	7
Measurement Techniques	7
Test Reliability	7
Comparative Analysis	6
Elementary Secondary Education	6
Evaluation Criteria	6
Models	6
Research Methodology	6
Scoring	6
Statistical Analysis	6
Correlation	5
Data Analysis	5
Interrater Reliability	5
Predictive Measurement	5
Test Items	5
Test Validity	5
College Faculty	4
Decision Making	4
Estimation (Mathematics)	4
Foreign Countries	4
Grading	4
More ▼

Source

Grantee Submission	2
Research in Higher Education	2
Aspects of Educational and…	1
College Board	1
Educational and Psychological…	1
Eurasian Journal of…	1
International Association for…	1
International Educational…	1
International Group for the…	1
International Working Group…	1
Pearson	1
More ▼

Publication Type

Speeches/Meeting Papers	49
Reports - Research	31
Reports - Evaluative	12
Journal Articles	5
Reports - Descriptive	5
Guides - Classroom - Teacher	1
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	3
Elementary Secondary Education	2
Postsecondary Education	2
Elementary Education	1
Grade 4	1

Audience

Researchers	6
Practitioners	1
Teachers	1

Location

Japan	2
Texas	1
Turkey	1

Laws, Policies, & Programs

Education Consolidation…	1
Elementary and Secondary…	1

Assessments and Surveys

Comprehensive Tests of Basic…	1
National Assessment of…	1
National Longitudinal Study…	1
New Jersey College Basic…	1
North Carolina End of Course…	1
Program for International…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Temporal Aggregation for the Synthetic Control Method

Peer reviewed

Direct link

Liyang Sun; Eli Ben-Michael; Avi Feller – Grantee Submission, 2024

The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit with panel data. Two challenges arise with higher frequency data (e.g., monthly versus yearly): (1) achieving excellent pre-treatment fit is typically more challenging; and (2) overfitting to noise is more likely. Aggregating data…

Descriptors: Evaluation Methods, Comparative Analysis, Computation, Data Analysis

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

Peer reviewed

Direct link

Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024

Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…

Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines

An Investigation into Weighting Problem in Norm-Referenced Grading System

Peer reviewed
PDF on ERIC

Download full text

Öztürk Gübes, Nese – Eurasian Journal of Educational Research, 2021

Purpose: In grading, one of the most common errors is made in combining two or more different test scores. This study aimed to investigate the agreement of grades calculated by weighting raw scores and standard scores. Research Methods: In this simulation study, data were simulated for midterm and final measurements. Nine conditions [3 (class…

Descriptors: Grading, Raw Scores, Weighted Scores, Norm Referenced Tests

Do Country Stereotypes Exist in PISA? A Clustering Approach for Large, Sparse, and Weighted Data

Download full text

Saarela, Mirka; Kärkkäinen, Tommi – International Educational Data Mining Society, 2015

Certain stereotypes can be associated with people from different countries. For example, the Italians are expected to be emotional, the Germans functional, and the Chinese hard-working. In this study, we cluster all 15-year-old students representing the 68 different nations and territories that participated in the latest Programme for…

Descriptors: Weighted Scores, Stereotypes, Standardized Tests, Student Characteristics

Development of Visualization of Learning Outcomes Using Curriculum Mapping

Download full text

Ikuta, Takashi; Gotoh, Yasushi – International Association for Development of the Information Society, 2012

Niigata University has started to develop the Niigata University Bachelor Assessment System (NBAS). The objective is to have groups of teachers belonging to educational programs discuss whether visualized learning outcomes are comprehensible. Discussions based on teachers' subjective judgments showed in general that visualized learning outcomes…

Descriptors: Foreign Countries, College Students, Visualization, Outcomes of Education

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Obtaining Rubric Weights for Assessments by More than One Lecturer Using a Pairwise Learning Model

Download full text

Quevedo, J. R.; Montanes, E. – International Working Group on Educational Data Mining, 2009

Specifying the criteria of a rubric to assess an activity, establishing the different quality levels of proficiency of development and defining weights for every criterion is not as easy as one a priori might think. Besides, the complexity of these tasks increases when they involve more than one lecturer. Reaching an agreement about the criteria…

Descriptors: Data Analysis, Scoring Rubrics, Evaluation Criteria, Automation

Developing Form Assembly Specifications for Exams with Multiple Choice and Constructed Response Items: Balancing Reliability and Validity Concerns

Download full text

Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010

The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…

Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

The Reliability and Validity of Weighted Composite Scores.

Download full text

Kane, Michael; Case, Susan – 2003

The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…

Descriptors: Reliability, Simulation, Validity, Weighted Scores

Validating Alternative Modes of Scoring for Coloured Progressive Matrices.

Download full text

Razel, Micha; Eylon, Bat-Sheva – 1987

Conventional scoring of the Coloured Progressive Matrices (CPM) was compared with three methods of multiple weight scoring. The methods include: (1) theoretical weighting in which the weights were based on a theory of cognitive processing; (2) judged weighting in which the weights were given by a group of nine adult expert judges; and (3)…

Descriptors: Intelligence Tests, Measurement Techniques, Scoring, Test Validity

Mathematical Modelling with 9-Year-Olds

Download full text

English, Lyn D.; Watters, James J. – International Group for the Psychology of Mathematics Education, 2005

This paper reports on the mathematical modelling of four classes of 4th-grade children as they worked on a modelling problem involving the selection of an Australian swimming team for the 2004 Olympics. The problem was implemented during the second year of the children's participation in a 3-year longitudinal program of modelling experiences…

Descriptors: Mathematical Models, Grade 4, Longitudinal Studies, Qualitative Research

Construction of Criterion Weights for the Selection of Tasks for Training in the United States Army Infantry School.

Tyler, Edward C. – 1981

By using the paired comparison methodology, it was possible to establish evaluative priorities, providing criterion weights which reflected the thoughts and feelings of an advisory committee consisting of experts in the task selection process. For the U.S. Army Infantry School, the weights can be used to stress higher weighted criterion results in…

Descriptors: Advisory Committees, Criteria, Curriculum Development, Job Analysis

Estimating Reliability of Factor Analytic Results.

Download full text

Thompson, Bruce; Frankiewicz, Ronald G. – 1980

A procedure for estimating reliability in a factor analytic context, when reliability of the extracted factors is not an emphasis, is identified. The procedure is an extension of Dressel's work and might be applied in attitude measurement. It assesses how homogeneous the weighted original item responses are, when they are scored for pattern…

Descriptors: Attitude Measures, Error of Measurement, Factor Analysis, Measures (Individuals)

Use of Observed, True, and Scale Variability in Combining Students' Scores in Grading.

Download full text

Thayer, Jerome D. – 1991

Combining student scores to form subtotals and finally a total score to determine a grade is discussed. The composite score reached by combining measures or subtotals is only valid when the scores are combined so that the actual weight of each measure or subtotal in the total score is the same as the intended weight. Three types of variability…

Descriptors: Academic Achievement, Elementary Secondary Education, Grading, Mathematical Models

The Assessment of Partial Knowledge: The Use of the Credit Model in Scoring Multiple-Choice Items.

Smith, Richard M. – 1981

One of the recurrent themes of the psychometric literature has been the idea that the incorrect responses a person makes to test items contain information that might be useful in determining the person's position on the variable the items are intended to define. The "Partial Credit" model, a member of the family of latent trait models…

Descriptors: Algebra, High Schools, Latent Trait Theory, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Avi Feller	2
Cason, Gerald J.	2
Kane, Michael	2
Smith, Richard M.	2
Abdel-fattah, Abdel-fattah A.	1
Anderson, Karen M.	1
Araki, Yoshikazu	1
Barrett, Thomas J.	1
Blakely, Craig H.	1
Bloom, Allan M.	1
Case, Susan	1
Centra, John A.	1
Chien, Yuehmei	1
Chris Holmes	1
Donlon, Thomas F.	1
Eli Ben-Michael	1
English, Lyn D.	1
Ewing, Maureen	1
Eylon, Bat-Sheva	1
Fitzpatrick, Anne R.	1
Frankiewicz, Ronald G.	1
Gabriel, Roy M.	1
Gardner, Don E.	1
Gonzalez-Pino, Barbara	1
More ▼