ERIC - Search Results

Publication Date

In 2025	40
Since 2024	193
Since 2021 (last 5 years)	496
Since 2016 (last 10 years)	997
Since 2006 (last 20 years)	2029

Descriptor

Error of Measurement	3296
Statistical Analysis	600
Scores	505
Item Response Theory	445
Correlation	434
Comparative Analysis	422
Foreign Countries	415
Test Reliability	408
Computation	405
Simulation	370
Reliability	355
Sample Size	352
Models	351
Evaluation Methods	348
Test Items	345
Measurement Techniques	318
Factor Analysis	308
Sampling	300
Statistical Bias	300
Research Methodology	288
Goodness of Fit	258
Monte Carlo Methods	257
Psychometrics	257
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	268
Secondary Education	196
Elementary Education	194
Postsecondary Education	194
Elementary Secondary Education	126
Middle Schools	96
High Schools	80
Junior High Schools	76
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	16
New York	15
United Kingdom	15
Texas	14
North Carolina	13
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Japan	10
Spain	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1,966 to 1,980 of 3,296 results Save | Export

Reliability and the Nonequivalent Groups with Anchor Test Design. Research Report. ETS RR-07-16

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007

This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…

Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

On the Consistency of Individual Classification Using Short Scales

Peer reviewed

Direct link

Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007

Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…

Descriptors: Psychiatry, Patients, Error of Measurement, Test Length

Statistical Reform: Evidence-Based Practice, Meta-Analyses, and Single Subject Designs

Peer reviewed

Direct link

Jenson, William R.; Clark, Elaine; Kircher, John C.; Kristjansson, Sean D. – Psychology in the Schools, 2007

Evidence-based practice approaches to interventions has come of age and promises to provide a new standard of excellence for school psychologists. This article describes several definitions of evidence-based practice and the problems associated with traditional statistical analyses that rely on rejection of the null hypothesis for the…

Descriptors: School Psychologists, Statistical Analysis, Hypothesis Testing, Intervention

Participants, Texts, and Processes in ESL/EFL Essay Tests: A Narrative Review of the Literature

Peer reviewed

Direct link

Barkaoui, Khaled – Canadian Modern Language Review, 2007

Essay tests are widely used to assess ESL/EFL learners' writing abilities for instructional, administrative, and research purposes. Relevant literature was searched to identify 70 empirical studies on ESL/EFL essay tests. The majority of these studies examined task, essay, and rater effects on essay rating and scores. Less attention has been given…

Descriptors: Essay Tests, Language Tests, English (Second Language), Second Language Learning

A Maximal Graded Exercise Test to Accurately Predict VO2max in 18-65-Year-Old Adults

Peer reviewed

Direct link

George, James D.; Bradshaw, Danielle I.; Hyde, Annette; Vehrs, Pat R.; Hager, Ronald L.; Yanowitz, Frank G. – Measurement in Physical Education and Exercise Science, 2007

The purpose of this study was to develop an age-generalized regression model to predict maximal oxygen uptake (VO sub 2 max) based on a maximal treadmill graded exercise test (GXT; George, 1996). Participants (N = 100), ages 18-65 years, reached a maximal level of exertion (mean plus or minus standard deviation [SD]; maximal heart rate [HR sub…

Descriptors: Metabolism, Body Composition, Multiple Regression Analysis, Error of Measurement

The Impact of Outliers on Cronbach's Coefficient Alpha Estimate of Reliability: Visual Analogue Scales

Peer reviewed

Direct link

Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2007

The impact of outliers on Cronbach's coefficient [alpha] has not been documented in the psychometric or statistical literature. This is an important gap because coefficient [alpha] is the most widely used measurement statistic in all of the social, educational, and health sciences. The impact of outliers on coefficient [alpha] is investigated for…

Descriptors: Psychometrics, Computation, Reliability, Monte Carlo Methods

A Review of Recent Developments in Differential Item Functioning. Research Report. ETS RR-08-43

Peer reviewed
PDF on ERIC

Download full text

Mapuranga, Raymond; Dorans, Neil J.; Middleton, Kyndra – ETS Research Report Series, 2008

In many practical settings, essentially the same differential item functioning (DIF) procedures have been in use since the late 1980s. Since then, examinee populations have become more heterogeneous, and tests have included more polytomously scored items. This paper summarizes and classifies new DIF methods and procedures that have appeared since…

Descriptors: Test Bias, Educational Development, Evaluation Methods, Statistical Analysis

An Evaluation of the Kernel Equating Method: A Special Study with Pseudotests Constructed from Real Test Data. Research Report. ETS RR-06-02

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Holland, Paul W.; Livingston, Samuel A.; Casabianca, Jodi; Grant, Mary C.; Martin, Kathleen – ETS Research Report Series, 2006

This study examines how closely the kernel equating (KE) method (von Davier, Holland, & Thayer, 2004a) approximates the results of other observed-score equating methods--equipercentile and linear equatings. The study used pseudotests constructed of item responses from a real test to simulate three equating designs: an equivalent groups (EG)…

Descriptors: Equated Scores, Statistical Analysis, Simulation, Tests

Empirical vs. Expected IRT-Based Reliability Estimation in Computerized Multistage Testing (MST)

Download full text

Zhang, Yanwei; Breithaupt, Krista; Tessema, Aster; Chuah, David – Online Submission, 2006

Two IRT-based procedures to estimate test reliability for a certification exam that used both adaptive (via a MST model) and non-adaptive design were considered in this study. Both procedures rely on calibrated item parameters to estimate error variance. In terms of score variance, one procedure (Method 1) uses the empirical ability distribution…

Descriptors: Individual Testing, Test Reliability, Programming, Error of Measurement

Power and Type I Error of the Mean and Covariance Structure Analysis Model for Detecting Differential Item Functioning in Graded Response Items

Peer reviewed

Direct link

Gonzalez-Roma, Vicente; Hernandez, Ana; Gomez-Benito, Juana – Multivariate Behavioral Research, 2006

In this simulation study, we investigate the power and Type I error rate of a procedure based on the mean and covariance structure analysis (MACS) model in detecting differential item functioning (DIF) of graded response items with five response categories. The following factors were manipulated: type of DIF (uniform and non-uniform), DIF…

Descriptors: Multivariate Analysis, Item Response Theory, Test Bias, Sample Size

The Effects of Parceling Unidimensional Scales on Structural Parameter Estimates in Structural Equation Modeling

Peer reviewed

Direct link

Sass, Daniel A.; Smith, Philip L. – Structural Equation Modeling: A Multidisciplinary Journal, 2006

Structural equation modeling allows several methods of estimating the disattenuated association between 2 or more latent variables (i.e., the measurement model). In one common approach, measurement models are specified using item parcels as indicators of latent constructs. Item parcels versus original items are often used as indicators in these…

Descriptors: Structural Equation Models, Item Analysis, Error of Measurement, Measures (Individuals)

Computation of Effect Size for Moderating Effects of Categorical Variables in Multiple Regression

Peer reviewed

Direct link

Aguinis, Herman; Pierce, Charles A. – Applied Psychological Measurement, 2006

The computation and reporting of effect size estimates is becoming the norm in many journals in psychology and related disciplines. Despite the increased importance of effect sizes, researchers may not report them or may report inaccurate values because of a lack of appropriate computational tools. For instance, Pierce, Block, and Aguinis (2004)…

Descriptors: Effect Size, Multiple Regression Analysis, Predictor Variables, Error of Measurement

The Impact of Inappropriate Modeling of Cross-Classified Data Structures

Peer reviewed

Direct link

Meyers, Jason L.; Beretvas, S. Natasha – Multivariate Behavioral Research, 2006

Cross-classified random effects modeling (CCREM) is used to model multilevel data from nonhierarchical contexts. These models are widely discussed but infrequently used in social science research. Because little research exists assessing when it is necessary to use CCREM, 2 studies were conducted. A real data set with a cross-classified structure…

Descriptors: Social Science Research, Computation, Models, Data Analysis

Why Generalizability Theory Is Essential and Classical Test Theory Is Often Inadequate.

Download full text

Kieffer, Kevin M. – 1998

This paper discusses the benefits of using generalizabilty theory in lieu of classical test theory. Generalizability theory subsumes and extends the precepts of classical test theory by estimating the magnitude of multiple sources of measurement error and their interactions simultaneously in a single analysis. Since classical test theory examines…

Descriptors: Error of Measurement, Generalizability Theory, Heuristics, Interaction

Derivation and Application of the Conditional Standard Error of Measurement in Prediction.

Woodruff, David – 1989

Previous methods for estimating the conditional standard error of measurement (CSEM) at specific score or ability levels are critically discussed, and a brief summary of prior empirical results is given. A new method is developed which avoids theoretical problems inherent in some prior methods, is easy to implement, and estimates not only a…

Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Models, Predictive Measurement

« Previous Page | Next Page »

Pages: 1 | ... | 128 | 129 | 130 | 131 | 132 | 133 | 134 | 135 | 136 | ... | 220

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Journal of Experimental…	70
Grantee Submission	69
ETS Research Report Series	58
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	46
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
International Journal of…	26
Journal of Educational…	26
Practical Assessment,…	26
National Center for Education…	25
Psychology in the Schools	24
Structural Equation Modeling	23
More ▼

Journal Articles	2349
Reports - Research	1892
Reports - Evaluative	702
Reports - Descriptive	343
Speeches/Meeting Papers	328
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	26
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	44
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
ACT Assessment	18
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	8
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼