ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Psychometrics	26
Scores	26
Test Use	26
Test Validity	10
Test Construction	9
Test Reliability	9
Elementary Secondary Education	5
Higher Education	5
Achievement Tests	4
Error of Measurement	4
Reliability	4
Standardized Tests	4
Test Results	4
Elementary Education	3
Mathematics Tests	3
Test Format	3
Test Interpretation	3
Test Items	3
Testing	3
Testing Programs	3
Tests	3
Academic Achievement	2
Adults	2
College Entrance Examinations	2
College Students	2
More ▼

Source

Educational and Psychological…	6
Educational Measurement:…	3
APA Books	1
College Board	1
Contemporary School Psychology	1
Global Education Review	1
Journal of Educational…	1
Large-scale Assessments in…	1
Learning Disability Quarterly	1
New Meridian Corporation	1

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Evaluative	6
Speeches/Meeting Papers	6
Reports - Descriptive	3
Books	2
Information Analyses	2
Collected Works - General	1
Guides - General	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Elementary Education	1

Audience

Practitioners	2
Community	1
Students	1

Location

Indiana

Laws, Policies, & Programs

Assessments and Surveys

Learning Style Inventory	1
National Assessment of…	1
North Carolina End of Course…	1
Program for International…	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1
Systematic Screening for…	1
Tennessee Self Concept Scale	1
Trends in International…	1
Watson Glaser Critical…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Aligning Test Scoring Procedures with Test Uses of the Early Grade Mathematics Assessment: A Balancing Act

Peer reviewed
PDF on ERIC

Download full text

Ketterlin-Geller, Leanne R.; Perry, Lindsey; Platas, Linda M.; Sitbakhan, Yasmin – Global Education Review, 2018

Test scoring procedures should align with the intended uses and interpretations of test results. In this paper, we examine three test scoring procedures for an operational assessment of early numeracy, the Early Grade Mathematics Assessment (EGMA). The EGMA is an assessment that tests young children's foundational mathematics knowledge and has…

Descriptors: Alignment (Education), Scoring, Test Use, Mathematics Tests

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

A Critical Review of Five Commonly Used Social-Emotional and Behavioral Screeners for Elementary or Secondary Schools

Peer reviewed

Direct link

Jenkins, Lyndsay N.; Demaray, Michelle K.; Wren, Nicole Smit; Secord, Stephanie M.; Lyell, Kelly M.; Magers, Amy M.; Setmeyer, Andrea J.; Rodelo, Carlota; Newcomb-McNeal, Ericka; Tennant, Jaclyn – Contemporary School Psychology, 2014

The goal of this paper was to critically review and evaluate five common social-emotional and behavioral screeners: Behavioral and Emotional Screening System (Kamphaus and Reynolds 2007), Behavior Intervention Monitoring Assessment System (McDougal et al. 2011), Social Skills Improvement System Performance Screening Guide (Elliott and Gresham…

Descriptors: Social Development, Emotional Development, Screening Tests, Scores

The Use of Test Scores from Large-Scale Assessment Surveys: Psychometric and Statistical Considerations

Peer reviewed

Direct link

Braun, Henry; von Davier, Matthias – Large-scale Assessments in Education, 2017

Background: Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV) methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT…

Descriptors: Scores, Test Use, Measurement, Psychometrics

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Consequences of Test Score Use as Validity Evidence: Roles and Responsibilities

Peer reviewed

Direct link

Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009

This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…

Descriptors: Tests, Test Validity, Scores, Data Collection

Select Psychometric Properties and Predictive Validity of Scores on the SAT Writing Section

Download full text

Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009

Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…

Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity

High-Stakes Testing in Education: Science and Practice in K-12 Settings

Direct link

Bovaird, James A., Ed.; Geisinger, Kurt F., Ed.; Buckendahl, Chad W., Ed. – APA Books, 2011

Educational assessment and, more broadly, educational research in the United States have entered into an era characterized by a dramatic increase in the prevalence and importance of test score use in accountability systems. This volume covers a selection of contemporary issues about testing science and practice that impact the nation's public…

Descriptors: Graduate Students, Test Use, Student Placement, Educational Research

A Critical Review of the Literature on Kolb's Learning Style Inventory with Implications for Score Reliability.

Download full text

Hwang, Dae-Yeop; Henson, Robin K. – 2002

The Learning Style Inventory (LSI; Kolb, 1976; 1985 ) is a commonly used measure of learning styles based on Kolbs Experiential Learning Model. The psychometric soundness of LSI scores has been critiqued historically. This study reviewed the literature on the LSI and evaluated the psychometric properties of Kolbs original and revised versions of…

Descriptors: Cognitive Style, Meta Analysis, Psychometrics, Reliability

Measurement Characteristics of the Perceived Adequacy of Resources Scale.

Peer reviewed

Burrell, Brenda; And Others – Educational and Psychological Measurement, 1995

The measurement characteristics of the Perceived Adequacy of Resources Scale, a measure of family functioning, were investigated. The reliability and validity of total and subtest scores were studied with 113 mothers. Results were generally favorable regarding the integrity of scores from the measure. (SLD)

Descriptors: Family Characteristics, Mothers, Psychometrics, Scores

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

The Emperor's Clothes: Assessing the Validity of Scores on the Tennessee Self-Concept Scale.

Peer reviewed

Bishop, Sheryl L.; And Others – Educational and Psychological Measurement, 1997

The psychometric analyses of a previous study of the Tennessee Self-Concept Scale were replicated in a study with 111 female nursing and medical educators. Results support the previous findings challenging the proposed theoretical structure but supporting the reliable measurement of some as-yet-unclear dimension by the instrument. (SLD)

Descriptors: College Faculty, Higher Education, Medical Education, Nursing

Reliability and Validity of Adolescents' Scores on the Body Esteem Scale.

Peer reviewed

Cecil, Heather; Stanley, Melinda A. – Educational and Psychological Measurement, 1997

The psychometric properties of the Body Esteem Scale (BES) were studied with 255 girls and boys in grades 5 through 12. Internal consistency was found for the gender-specific subscales. Results provide preliminary evidence that the BES may be a psychometrically defensible assessment of body esteem among adolescents. (SLD)

Descriptors: Adolescents, Body Image, Elementary Secondary Education, Psychometrics

A Psychometric Investigation of Scores on the Watson-Glaser Critical Thinking Appraisal New Form S.

Peer reviewed

Loo, S. Robert; Thorpe, Karran – Educational and Psychological Measurement, 1999

Used samples of 142 management and 123 nursing undergraduates to evaluate the psychometric properties and factor structure of the newly developed Form S (short form) of the Watson-Glaser Critical Thinking Appraisal (G. Watson and E. Glaser, 1964, 1994). Results provide only limited support for Form S, and further refinement is suggested. (SLD)

Descriptors: Administration, Critical Thinking, Higher Education, Nursing

Previous Page | Next Page »

Pages: 1 | 2

Thompson, Bruce	3
Amery D. Wu	1
Bielinski, John	1
Bishop, Sheryl L.	1
Bovaird, James A., Ed.	1
Braun, Henry	1
Buckendahl, Chad W., Ed.	1
Burrell, Brenda	1
Cecil, Heather	1
Chongruksa, Jiratha	1
Curtis, W. John	1
Davis, W. Alan	1
Demaray, Michelle K.	1
Espelage, Dorothy L.	1
Ferrara, Steven	1
Geisinger, Kurt F., Ed.	1
Henson, Robin K.	1
Hwang, Dae-Yeop	1
Jake Stone	1
Jenkins, Lyndsay N.	1
Kamps, Jodi	1
Ketterlin-Geller, Leanne R.	1
Kim, YoungKoung Rachel	1
Kolen, Michael J.	1
More ▼