ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	19

Descriptor

Test Reliability	68
Weighted Scores	68
Test Validity	43
Multiple Choice Tests	25
Correlation	21
Scoring	20
Scoring Formulas	19
Item Analysis	17
Test Items	14
Achievement Tests	13
Guessing (Tests)	13
Statistical Analysis	13
Test Construction	13
Measurement Techniques	11
Response Style (Tests)	9
Error of Measurement	8
Higher Education	8
Scores	8
Comparative Analysis	7
Factor Analysis	7
Predictive Validity	7
Test Interpretation	7
Confidence Testing	6
Evaluation Methods	6
Foreign Countries	6
More ▼

Publication Type

Reports - Research	27
Journal Articles	21
Reports - Evaluative	7
Speeches/Meeting Papers	7
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Non-Print Media	1
Reference Materials - General	1
Reports - Descriptive	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Secondary Education	3
High Schools	2
Secondary Education	2
Adult Education	1
Early Childhood Education	1
Elementary Education	1
Grade 4	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Researchers

Location

Portugal	2
United Kingdom (England)	2
Asia	1
Australia	1
Brazil	1
Colombia	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Ireland (Dublin)	1
Israel	1
Italy	1
Japan	1
Kansas	1
Kazakhstan	1
Michigan	1
Netherlands	1
Norway	1
Ohio	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	3
SAT (College Admission Test)	2
California Achievement Tests	1
Defining Issues Test	1
International Association for…	1
Program for International…	1
Progress in International…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 68 results Save | Export

Weighting Opt-In Surveys to Accommodate the Effects of Nonresponse

Peer reviewed

Direct link

Ashani Jayasekera; Laura Stapleton – Society for Research on Educational Effectiveness, 2024

Background: A growing number of surveys are conducted online where respondents can choose to complete the questionnaire (Lehdonvirta et al., 2020). As respondents are self-selected, there is potential that the respondents will not be an accurate representation of the population. For example, white people are disproportionately more likely to…

Descriptors: Online Surveys, Test Construction, Test Validity, Test Reliability

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

Peer reviewed

Direct link

Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017

In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…

Descriptors: Cutting Scores, Weighted Scores, Classification, Testing

Worth Weighting? How to Think about and Use Weights in Survey Experiments

Peer reviewed
PDF on ERIC

Download full text

Direct link

Luke W. Miratrix; Jasjeet S. Sekhon; Alexander G. Theodoridis; Luis F. Campos – Grantee Submission, 2018

The popularity of online surveys has increased the prominence of using weights that capture units' probabilities of inclusion for claims of representativeness. Yet, much uncertainty remains regarding how these weights should be employed in analysis of survey experiments: Should they be used or ignored? If they are used, which estimators are…

Descriptors: Online Surveys, Weighted Scores, Data Interpretation, Robustness (Statistics)

Improving Work Based Assessment: Addressing Grade Inflation Numerically or Pedagogically?

Peer reviewed
PDF on ERIC

Download full text

Robbins, Joy; Firth, Amanda; Evans, Maria – Practitioner Research in Higher Education, 2018

Work based assessment (WBA) is a common but contentious practice increasingly used to grade university students on professional degrees. A key issue in WBA is the potentially low assessment literacy of the assessors, which can lead to a host of unintended results, including grade inflation. We identified grade inflation in the WBA of the clinical…

Descriptors: Grade Inflation, Weighted Scores, Evaluation Methods, Evaluation Research

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

Aligning English Language Testing with Curriculum

Peer reviewed
PDF on ERIC

Download full text

Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016

Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction

Correcting for Sample Problems in PISA and the Improvement in Portuguese Students' Performance

Peer reviewed

Direct link

Freitas, Pedro; Nunes, Luís Catela; Balcão Reis, Ana; Seabra, Carmo; Ferro, Adriana – Assessment in Education: Principles, Policy & Practice, 2016

The results of large-scale international assessments such as Programme for International Student Assessment (PISA) have attracted a considerable attention worldwide and are often used by policy-makers to support educational policies. To ensure that the published results represent the actual population, these surveys go through a thorough scrutiny…

Descriptors: International Assessment, Student Characteristics, Weighted Scores, Evaluation Problems

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Developing a Weighted Measure of Speech Sound Accuracy

Peer reviewed

Direct link

Preston, Jonathan L.; Ramsdell, Heather L.; Oller, D. Kimbrough; Edwards, Mary Louise; Tobin, Stephen J. – Journal of Speech, Language, and Hearing Research, 2011

Purpose: To develop a system for numerically quantifying a speaker's phonetic accuracy through transcription-based measures. With a focus on normal and disordered speech in children, the authors describe a system for differentially weighting speech sound errors on the basis of various levels of phonetic accuracy using a Weighted Speech Sound…

Descriptors: Speech, Phonetics, Measures (Individuals), Weighted Scores

Assumptions of Multiple Regression: Correcting Two Misconceptions

Peer reviewed
PDF on ERIC

Download full text

Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013

In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…

Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables

Reliability and Perceived Pedagogical Utility of a Weighted Music Performance Assessment Rubric

Peer reviewed

Direct link

Latimer, Marvin E., Jr.; Bergee, Martin J.; Cohen, Mary L. – Journal of Research in Music Education, 2010

The purpose of this study was to investigate the reliability and perceived pedagogical utility of a multidimensional weighted performance assessment rubric used in Kansas state high school large-group festivals. Data were adjudicator rubrics (N = 2,016) and adjudicator and director questionnaires (N = 515). Rubric internal consistency was…

Descriptors: Music Activities, State Programs, Performance Based Assessment, Weighted Scores

Standardising Assessment to Meet Student Needs in Foreign Language Modules in a University Context: Is Standardisation Possible?

Peer reviewed

Direct link

Nunan, Anna – Language Learning in Higher Education, 2014

The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…

Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards

Developing Form Assembly Specifications for Exams with Multiple Choice and Constructed Response Items: Balancing Reliability and Validity Concerns

Download full text

Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010

The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…

Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

Technical Adequacy of Early Numeracy Curriculum-Based Measurement in Kindergarten

Peer reviewed

Direct link

Martinez, Rebecca S.; Missall, Kristen N.; Graney, Suzanne Bamonto; Aricak, O. Tolga; Clarke, Ben – Assessment for Effective Intervention, 2009

The current study examines the technical adequacy of four Early Numeracy Curriculum-Based Measurement (EN-CBM) screening tasks: "Oral Counting" (OC), "Number Identification" (NI), "Quantity Discrimination" (QD), and "Missing Number" (MN). Results from 59 kindergarten students assessed in the fall and spring reveal moderate to high test-retest and…

Descriptors: Curriculum Based Assessment, Numeracy, Predictive Validity, Kindergarten

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	10
Journal of Educational…	6
Applied Psychological…	5
Applied Measurement in…	2
College Board	2
Grantee Submission	2
Assessment for Effective…	1
Assessment in Education:…	1
ETS Research Report Series	1
Evaluation and the Health…	1
International Association for…	1
International Association for…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Research in Music…	1
Journal of Speech, Language,…	1
Language Learning in Higher…	1
Measurement in Physical…	1
PROFILE: Issues in Teachers'…	1
Practical Assessment,…	1
Practitioner Research in…	1
Society for Research on…	1
More ▼

Echternacht, Gary	5
Reilly, Richard R.	4
Hendrickson, Gerry F.	3
Jackson, Rex	3
Downey, Ronald G.	2
Hendrickson, Amy	2
Patterson, Brian	2
Alexander G. Theodoridis	1
Aricak, O. Tolga	1
Ashani Jayasekera	1
Attali, Yigal	1
Balcão Reis, Ana	1
Barrett, Thomas J.	1
Bayuk, Robert J.	1
Beggs, Donald L.	1
Bejar, Issac I.	1
Bergee, Martin J.	1
Blakely, Craig H.	1
Bock, R. Darrell	1
Brown, James Dean	1
Cason, Gerald J.	1
Clarke, Ben	1
Claudy, John G.	1
Cohen, Mary L.	1
More ▼