NotesFAQContact Us
Collection
Advanced
Search Tips
Location
Indiana1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 26 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Platas, Linda M.; Sitbakhan, Yasmin – Global Education Review, 2018
Test scoring procedures should align with the intended uses and interpretations of test results. In this paper, we examine three test scoring procedures for an operational assessment of early numeracy, the Early Grade Mathematics Assessment (EGMA). The EGMA is an assessment that tests young children's foundational mathematics knowledge and has…
Descriptors: Alignment (Education), Scoring, Test Use, Mathematics Tests
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Peer reviewed Peer reviewed
Direct linkDirect link
Jenkins, Lyndsay N.; Demaray, Michelle K.; Wren, Nicole Smit; Secord, Stephanie M.; Lyell, Kelly M.; Magers, Amy M.; Setmeyer, Andrea J.; Rodelo, Carlota; Newcomb-McNeal, Ericka; Tennant, Jaclyn – Contemporary School Psychology, 2014
The goal of this paper was to critically review and evaluate five common social-emotional and behavioral screeners: Behavioral and Emotional Screening System (Kamphaus and Reynolds 2007), Behavior Intervention Monitoring Assessment System (McDougal et al. 2011), Social Skills Improvement System Performance Screening Guide (Elliott and Gresham…
Descriptors: Social Development, Emotional Development, Screening Tests, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Braun, Henry; von Davier, Matthias – Large-scale Assessments in Education, 2017
Background: Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV) methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT…
Descriptors: Scores, Test Use, Measurement, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009
Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…
Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity
Bovaird, James A., Ed.; Geisinger, Kurt F., Ed.; Buckendahl, Chad W., Ed. – APA Books, 2011
Educational assessment and, more broadly, educational research in the United States have entered into an era characterized by a dramatic increase in the prevalence and importance of test score use in accountability systems. This volume covers a selection of contemporary issues about testing science and practice that impact the nation's public…
Descriptors: Graduate Students, Test Use, Student Placement, Educational Research
Hwang, Dae-Yeop; Henson, Robin K. – 2002
The Learning Style Inventory (LSI; Kolb, 1976; 1985 ) is a commonly used measure of learning styles based on Kolbs Experiential Learning Model. The psychometric soundness of LSI scores has been critiqued historically. This study reviewed the literature on the LSI and evaluated the psychometric properties of Kolbs original and revised versions of…
Descriptors: Cognitive Style, Meta Analysis, Psychometrics, Reliability
Peer reviewed Peer reviewed
Burrell, Brenda; And Others – Educational and Psychological Measurement, 1995
The measurement characteristics of the Perceived Adequacy of Resources Scale, a measure of family functioning, were investigated. The reliability and validity of total and subtest scores were studied with 113 mothers. Results were generally favorable regarding the integrity of scores from the measure. (SLD)
Descriptors: Family Characteristics, Mothers, Psychometrics, Scores
Peer reviewed Peer reviewed
Traub, Ross E. – Educational Measurement: Issues and Practice, 1997
Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)
Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics
Peer reviewed Peer reviewed
Bishop, Sheryl L.; And Others – Educational and Psychological Measurement, 1997
The psychometric analyses of a previous study of the Tennessee Self-Concept Scale were replicated in a study with 111 female nursing and medical educators. Results support the previous findings challenging the proposed theoretical structure but supporting the reliable measurement of some as-yet-unclear dimension by the instrument. (SLD)
Descriptors: College Faculty, Higher Education, Medical Education, Nursing
Peer reviewed Peer reviewed
Cecil, Heather; Stanley, Melinda A. – Educational and Psychological Measurement, 1997
The psychometric properties of the Body Esteem Scale (BES) were studied with 255 girls and boys in grades 5 through 12. Internal consistency was found for the gender-specific subscales. Results provide preliminary evidence that the BES may be a psychometrically defensible assessment of body esteem among adolescents. (SLD)
Descriptors: Adolescents, Body Image, Elementary Secondary Education, Psychometrics
Peer reviewed Peer reviewed
Loo, S. Robert; Thorpe, Karran – Educational and Psychological Measurement, 1999
Used samples of 142 management and 123 nursing undergraduates to evaluate the psychometric properties and factor structure of the newly developed Form S (short form) of the Watson-Glaser Critical Thinking Appraisal (G. Watson and E. Glaser, 1964, 1994). Results provide only limited support for Form S, and further refinement is suggested. (SLD)
Descriptors: Administration, Critical Thinking, Higher Education, Nursing
Previous Page | Next Page ยป
Pages: 1  |  2