Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 8 |
Descriptor
Source
Author
Avi Feller | 2 |
Cason, Gerald J. | 2 |
Kane, Michael | 2 |
Smith, Richard M. | 2 |
Abdel-fattah, Abdel-fattah A. | 1 |
Anderson, Karen M. | 1 |
Araki, Yoshikazu | 1 |
Barrett, Thomas J. | 1 |
Blakely, Craig H. | 1 |
Bloom, Allan M. | 1 |
Case, Susan | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 49 |
Reports - Research | 31 |
Reports - Evaluative | 12 |
Journal Articles | 5 |
Reports - Descriptive | 5 |
Guides - Classroom - Teacher | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 3 |
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 4 | 1 |
Audience
Researchers | 6 |
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Liyang Sun; Eli Ben-Michael; Avi Feller – Grantee Submission, 2024
The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit with panel data. Two challenges arise with higher frequency data (e.g., monthly versus yearly): (1) achieving excellent pre-treatment fit is typically more challenging; and (2) overfitting to noise is more likely. Aggregating data…
Descriptors: Evaluation Methods, Comparative Analysis, Computation, Data Analysis
Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024
Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…
Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines
Öztürk Gübes, Nese – Eurasian Journal of Educational Research, 2021
Purpose: In grading, one of the most common errors is made in combining two or more different test scores. This study aimed to investigate the agreement of grades calculated by weighting raw scores and standard scores. Research Methods: In this simulation study, data were simulated for midterm and final measurements. Nine conditions [3 (class…
Descriptors: Grading, Raw Scores, Weighted Scores, Norm Referenced Tests
Saarela, Mirka; Kärkkäinen, Tommi – International Educational Data Mining Society, 2015
Certain stereotypes can be associated with people from different countries. For example, the Italians are expected to be emotional, the Germans functional, and the Chinese hard-working. In this study, we cluster all 15-year-old students representing the 68 different nations and territories that participated in the latest Programme for…
Descriptors: Weighted Scores, Stereotypes, Standardized Tests, Student Characteristics
Ikuta, Takashi; Gotoh, Yasushi – International Association for Development of the Information Society, 2012
Niigata University has started to develop the Niigata University Bachelor Assessment System (NBAS). The objective is to have groups of teachers belonging to educational programs discuss whether visualized learning outcomes are comprehensible. Discussions based on teachers' subjective judgments showed in general that visualized learning outcomes…
Descriptors: Foreign Countries, College Students, Visualization, Outcomes of Education
Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012
Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models
Quevedo, J. R.; Montanes, E. – International Working Group on Educational Data Mining, 2009
Specifying the criteria of a rubric to assess an activity, establishing the different quality levels of proficiency of development and defining weights for every criterion is not as easy as one a priori might think. Besides, the complexity of these tasks increases when they involve more than one lecturer. Reaching an agreement about the criteria…
Descriptors: Data Analysis, Scoring Rubrics, Evaluation Criteria, Automation
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity
Kane, Michael; Case, Susan – 2003
The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…
Descriptors: Reliability, Simulation, Validity, Weighted Scores
Razel, Micha; Eylon, Bat-Sheva – 1987
Conventional scoring of the Coloured Progressive Matrices (CPM) was compared with three methods of multiple weight scoring. The methods include: (1) theoretical weighting in which the weights were based on a theory of cognitive processing; (2) judged weighting in which the weights were given by a group of nine adult expert judges; and (3)…
Descriptors: Intelligence Tests, Measurement Techniques, Scoring, Test Validity
English, Lyn D.; Watters, James J. – International Group for the Psychology of Mathematics Education, 2005
This paper reports on the mathematical modelling of four classes of 4th-grade children as they worked on a modelling problem involving the selection of an Australian swimming team for the 2004 Olympics. The problem was implemented during the second year of the children's participation in a 3-year longitudinal program of modelling experiences…
Descriptors: Mathematical Models, Grade 4, Longitudinal Studies, Qualitative Research
Tyler, Edward C. – 1981
By using the paired comparison methodology, it was possible to establish evaluative priorities, providing criterion weights which reflected the thoughts and feelings of an advisory committee consisting of experts in the task selection process. For the U.S. Army Infantry School, the weights can be used to stress higher weighted criterion results in…
Descriptors: Advisory Committees, Criteria, Curriculum Development, Job Analysis
Thompson, Bruce; Frankiewicz, Ronald G. – 1980
A procedure for estimating reliability in a factor analytic context, when reliability of the extracted factors is not an emphasis, is identified. The procedure is an extension of Dressel's work and might be applied in attitude measurement. It assesses how homogeneous the weighted original item responses are, when they are scored for pattern…
Descriptors: Attitude Measures, Error of Measurement, Factor Analysis, Measures (Individuals)
Thayer, Jerome D. – 1991
Combining student scores to form subtotals and finally a total score to determine a grade is discussed. The composite score reached by combining measures or subtotals is only valid when the scores are combined so that the actual weight of each measure or subtotal in the total score is the same as the intended weight. Three types of variability…
Descriptors: Academic Achievement, Elementary Secondary Education, Grading, Mathematical Models
Smith, Richard M. – 1981
One of the recurrent themes of the psychometric literature has been the idea that the incorrect responses a person makes to test items contain information that might be useful in determining the person's position on the variable the items are intended to define. The "Partial Credit" model, a member of the family of latent trait models…
Descriptors: Algebra, High Schools, Latent Trait Theory, Multiple Choice Tests