ERIC - Search Results

Publication Date

In 2025	39
Since 2024	192
Since 2021 (last 5 years)	495
Since 2016 (last 10 years)	996
Since 2006 (last 20 years)	2028

Descriptor

Error of Measurement	3295
Statistical Analysis	599
Scores	504
Item Response Theory	445
Correlation	434
Comparative Analysis	422
Foreign Countries	415
Test Reliability	408
Computation	404
Simulation	370
Reliability	355
Sample Size	352
Models	351
Evaluation Methods	348
Test Items	345
Measurement Techniques	318
Factor Analysis	308
Sampling	300
Statistical Bias	299
Research Methodology	288
Goodness of Fit	258
Monte Carlo Methods	257
Psychometrics	257
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	268
Secondary Education	196
Elementary Education	194
Postsecondary Education	194
Elementary Secondary Education	126
Middle Schools	96
High Schools	80
Junior High Schools	76
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	16
New York	15
United Kingdom	15
Texas	14
North Carolina	13
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Japan	10
Spain	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1,456 to 1,470 of 3,295 results Save | Export

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Qualification Users' Perceptions and Experiences of Assessment Reliability

Peer reviewed

Direct link

Chamberlain, Suzanne – Research Papers in Education, 2013

This paper presents the findings of a study designed to explore qualification users' perceptions and experiences of reliability in the context of national assessment outcomes in England. The study consisted of 17 focus groups conducted across six sectors of qualification users: students, teachers, trainee teachers, job-seekers, employers and…

Descriptors: Qualifications, Test Reliability, Foreign Countries, Focus Groups

Estimating Impacts of Treatment Random Assignment on Classroom Quality in the Head Start Impact Study: The Problem of Missing Data

Peer reviewed
PDF on ERIC

Download full text

Friedman-Krauss, Allison H.; Connors, Maia C.; Morris, Pamela A. – Society for Research on Educational Effectiveness, 2013

As a result of the 1998 reauthorization of Head Start, the Department of Health and Human Services conducted a national evaluation of the Head Start program. The goal of Head Start is to improve the school readiness skills of low-income children in the United States. There is a substantial body of experimental and correlational research that has…

Descriptors: Early Intervention, Preschool Education, School Readiness, Low Income Groups

IRR: A Blind Guide

Peer reviewed
PDF on ERIC

Download full text

Kierulff, Herbert – American Journal of Business Education, 2012

Over the past 60 years the internal rate of return (IRR) has become a major tool in investment evaluation. Many executives prefer it to net present value (NPV), presumably because they can more easily comprehend a percentage measure. This article demonstrates that, except in the rare case of an investment that is followed by a single cash return,…

Descriptors: Outcomes of Education, Measurement Techniques, Outcome Measures, Definitions

Measurement Invariance of Posttraumatic Stress Disorder Symptoms across Three Civilian Trauma Types

Direct link

Carter, Benjamin Hammond – ProQuest LLC, 2012

The factor structure of posttraumatic stress disorder (PTSD) remains the subject of intense investigation. The DSM three-factor conceptualization of PTSD has not been empirically supported; rather, two four-factor models of PTSD (King, Leskin, King, & Weathers, 1998; Simms, Watson, & Doebbeling, 2002) have garnered the majority of support…

Descriptors: Factor Structure, Posttraumatic Stress Disorder, Trauma, Symptoms (Individual Disorders)

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Factor Structure of the Revised TOEIC[R] Test: A Multiple-Sample Analysis

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2012

This study examined the factor structure of the listening and reading sections of the revised Test of English for International Communication (TOEIC[R]) test. The data from the TOEIC IP (institutional program) test taken by 569 English learners were randomly split into two samples (n = 285 vs. 284). Four models (higher-order, correlated,…

Descriptors: Communication (Thought Transfer), Second Language Learning, Factor Structure, Measurement

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

Efficient Estimation of the Standardized Value

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2009

We derive an estimator of the standardized value which, under the standard assumptions of normality and homoscedasticity, is more efficient than the established (asymptotically efficient) estimator and discuss its gains for small samples. (Contains 1 table and 3 figures.)

Descriptors: Efficiency, Computation, Statistics, Sample Size

On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha

Peer reviewed

Direct link

Sijtsma, Klaas – Psychometrika, 2009

This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…

Descriptors: Measurement, Error of Measurement, Scores, Computation

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

International Test Score Comparisons and Educational Policy: A Review of the Critiques

Peer reviewed
PDF on ERIC

Download full text

Carnoy, Martin – National Education Policy Center, 2015

Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…

Descriptors: Criticism, Reputation, Test Validity, Error of Measurement

Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

Download full text

Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010

This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

Descriptors: Equated Scores, Scoring, Responses, Test Items

Generalizability Theory: Measuring the Dependability of Selected Methods for Scoring Classroom Assessments

Direct link

Lengh, Carolyn J. – ProQuest LLC, 2010

This study compares the dependability of four classroom assessment scoring methods. Generalizability theory (G) and alternative decision (D) are used to measure the results of students' classroom assessment scores and compare the results of the four scoring methods on variability of rater by person variance and the level of G and D coefficients…

Descriptors: Generalizability Theory, Scoring, Social Studies, Tests

Using State-Space Model with Regime Switching to Represent the Dynamics of Facial Electromyography (EMG) Data

Peer reviewed

Direct link

Yang, Manshu; Chow, Sy-Miin – Psychometrika, 2010

Facial electromyography (EMG) is a useful physiological measure for detecting subtle affective changes in real time. A time series of EMG data contains bursts of electrical activity that increase in magnitude when the pertinent facial muscles are activated. Whereas previous methods for detecting EMG activation are often based on deterministic or…

Descriptors: Test Bias, Error of Measurement, Human Body, Diagnostic Tests

« Previous Page | Next Page »

Pages: 1 | ... | 94 | 95 | 96 | 97 | 98 | 99 | 100 | 101 | 102 | ... | 220

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	69
Journal of Experimental…	69
ETS Research Report Series	58
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	46
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
International Journal of…	26
Journal of Educational…	26
Practical Assessment,…	26
National Center for Education…	25
Psychology in the Schools	24
Structural Equation Modeling	23
More ▼

Journal Articles	2348
Reports - Research	1892
Reports - Evaluative	702
Reports - Descriptive	342
Speeches/Meeting Papers	328
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	26
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	44
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
ACT Assessment	18
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	8
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼