ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	59
Since 2006 (last 20 years)	155

Descriptor

Sampling	218
Scores	218
Foreign Countries	69
Statistical Analysis	62
Correlation	49
Academic Achievement	41
Comparative Analysis	35
Error of Measurement	31
Gender Differences	27
Research Methodology	27
Reliability	24
Measures (Individuals)	22
Questionnaires	22
Teaching Methods	22
Elementary School Students	21
Achievement Tests	20
Sample Size	20
Pretests Posttests	19
Evaluation Methods	18
Data Analysis	17
Test Reliability	17
Control Groups	16
Research Design	16
College Students	15
Computation	15
More ▼

Publication Type

Journal Articles	133
Reports - Research	132
Reports - Evaluative	33
Speeches/Meeting Papers	19
Dissertations/Theses -…	17
Reports - Descriptive	15
Numerical/Quantitative Data	11
Tests/Questionnaires	8
Opinion Papers	4
Guides - General	3
Guides - Non-Classroom	3
Book/Product Reviews	2
Books	2
Information Analyses	2
Collected Works - General	1
Reference Materials -…	1
Reports -…	1
More ▼

Education Level

Higher Education	37
Elementary Education	26
Postsecondary Education	26
Secondary Education	21
Middle Schools	15
Elementary Secondary Education	12
High Schools	9
Junior High Schools	8
Grade 4	7
Early Childhood Education	6
Grade 3	5
Grade 5	5
Preschool Education	5
Grade 2	4
Intermediate Grades	4
Kindergarten	4
Primary Education	4
Grade 1	3
Grade 8	2
Adult Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
More ▼

Audience

Researchers	6
Practitioners	1
Students	1

Location

Turkey	13
Florida	8
Iran	7
California	6
Nigeria	6
India	4
Indiana	3
Indonesia	3
Japan	3
North Carolina	3
Pakistan	3
Tennessee	3
United States	3
Australia	2
Colombia	2
Georgia	2
Sweden	2
Texas	2
United Kingdom	2
Africa	1
Alaska	1
Albania	1
Arizona	1
Arkansas	1
Brazil	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 1 to 15 of 218 results Save | Export

Reliable for Whom? Inferring and Reporting Reliability across Diverse Populations

Peer reviewed

Direct link

Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024

We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…

Descriptors: Best Practices, Reliability, Counseling, Research

The Implications of Propensity Score Augmentation for Generalization

Peer reviewed

Direct link

Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023

Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…

Descriptors: Generalizability Theory, Probability, Scores, Sampling

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Range Restriction Affects Factor Analysis: Normality, Estimation, Fit, Loadings, and Reliability

Peer reviewed

Direct link

Franco-Martínez, Alicia; Alvarado, Jesús M.; Sorrel, Miguel A. – Educational and Psychological Measurement, 2023

A sample suffers range restriction (RR) when its variance is reduced comparing with its population variance and, in turn, it fails representing such population. If the RR occurs over the latent factor, not directly over the observed variable, the researcher deals with an indirect RR, common when using convenience samples. This work explores how…

Descriptors: Factor Analysis, Factor Structure, Scores, Sampling

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Enhancing Pre-Service Mathematics Teachers Understanding of Sampling Distributions with Conceptual Change Texts

Peer reviewed
PDF on ERIC

Download full text

Özmen, Zeynep Medine; Güven, Bülent – Journal of Pedagogical Research, 2022

The present study aimed to remediate pre-service teachers' misconceptions about sampling distributions and to develop their conceptual understanding through the use of conceptual change texts (CCTs). The participants consisted of 84 pre-service teachers. To determine the pre-service teachers' conceptual understanding of sampling distributions, an…

Descriptors: Preservice Teachers, Mathematics Teachers, Sampling, Statistical Distributions

Impact of Differential Item Functioning on Group Score Reporting in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Joo, Sean; Ali, Usama; Robin, Frederic; Shin, Hyo Jeong – Large-scale Assessments in Education, 2022

We investigated the potential impact of differential item functioning (DIF) on group-level mean and standard deviation estimates using empirical and simulated data in the context of large-scale assessment. For the empirical investigation, PISA 2018 cognitive domains (Reading, Mathematics, and Science) data were analyzed using Jackknife sampling to…

Descriptors: Test Items, Item Response Theory, Scores, Student Evaluation

Large-Sample Properties of Minimum Discriminant Information Adjustment Estimates under Complex Sampling Designs. Research Report. ETS RR-20-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby; McCaffrey, Daniel F.; Lockwood, J. R. – ETS Research Report Series, 2020

Minimum discriminant information adjustment (MDIA), an approach to weighting samples to conform to known population information, provides a generalization of raking and poststratification. In the case of simple random sampling with replacement with uniform sampling weights, large-sample properties are available for MDIA estimates of population…

Descriptors: Discriminant Analysis, Sampling, Sample Size, Scores

A Comparison of Machine Learning Algorithms for Predicting Student Performance in an Online Mathematics Game

Peer reviewed

Direct link

Ji-Eun Lee; Amisha Jindal; Sanika Nitin Patki; Ashish Gurung; Reilly Norum; Erin Ottmar – Interactive Learning Environments, 2024

This paper demonstrated how to apply Machine Learning (ML) techniques to analyze student interaction data collected in an online mathematics game. Using a data-driven approach, we examined 1) how different ML algorithms influenced the precision of middle-school students' (N = 359) performance (i.e. posttest math knowledge scores) prediction and 2)…

Descriptors: Teaching Methods, Algorithms, Mathematics Tests, Computer Games

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

A Comparison of Machine Learning Algorithms for Predicting Student Performance in an Online Mathematics Game

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ji-Eun Lee; Amisha Jindal; Sanika Nitin Patki; Ashish Gurung; Reilly Norum; Erin Ottmar – Grantee Submission, 2023

This paper demonstrated how to apply Machine Learning (ML) techniques to analyze student interaction data collected in an online mathematics game. Using a data-driven approach, we examined: (1) how different ML algorithms influenced the precision of middle-school students' (N = 359) performance (i.e. posttest math knowledge scores) prediction; and…

Descriptors: Teaching Methods, Algorithms, Mathematics Tests, Computer Games

Error Variance in Common Population Linking Bridge Studies. Research Report. ETS RR-19-42

Peer reviewed
PDF on ERIC

Download full text

Jewsbury, Paul A. – ETS Research Report Series, 2019

When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…

Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ProQuest LLC	17
Educational and Psychological…	9
Journal of Educational and…	6
Grantee Submission	5
ETS Research Report Series	4
Journal of Educational…	4
Applied Measurement in…	3
Education Policy Analysis…	3
Journal of Education and…	3
Journal of Research on…	3
National Center for Education…	3
Psychological Assessment	3
English Language Teaching	2
Intelligence	2
International Education…	2
International Electronic…	2
Journal of Clinical Psychology	2
Journal of Education and…	2
Journal of Educational…	2
Journal of Speech, Language,…	2
Journal on School Educational…	2
Large-scale Assessments in…	2
National Center for Education…	2
Online Submission	2
Phi Delta Kappan	2
More ▼

Amisha Jindal	3
Ashish Gurung	3
Chan, Wendy	3
Erin Ottmar	3
Ji-Eun Lee	3
Lockwood, J. R.	3
McCaffrey, Daniel F.	3
Reilly Norum	3
Sanika Nitin Patki	3
Zimmerman, Donald W.	3
Ho, Andrew D.	2
Mihaly, Kata	2
Reardon, Sean F.	2
Sass, Tim R.	2
Schochet, Peter Z.	2
Aadahl, Mette	1
Abduljabbar, Adel S.	1
Abuya, Benta	1
Adelson, P. David	1
Afurobi, Ada	1
Ali, Usama	1
Alipoor, Iman	1
Alvarado, Jesús M.	1
Anwyll, Steve	1
More ▼

National Assessment of…	5
Program for International…	3
SAT (College Admission Test)	3
ACT Assessment	2
California Critical Thinking…	2
Early Childhood Longitudinal…	2
Goodenough Harris Drawing Test	2
Student Teacher Relationship…	2
Trends in International…	2
Adjective Check List	1
Alabama High School…	1
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Beck Anxiety Inventory	1
California Critical Thinking…	1
Childrens Report of Parental…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Coopersmith Self Esteem…	1
Flesch Kincaid Grade Level…	1
Florida Comprehensive…	1
Gates MacGinitie Reading Tests	1
Graduate Record Examinations	1
Group Embedded Figures Test	1
Hopkins Symptom Checklist	1
More ▼