ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

True Scores	22
Item Response Theory	7
Error of Measurement	6
Reliability	6
Correlation	3
Elementary Secondary Education	3
Evaluation Methods	3
Test Construction	3
Test Theory	3
Ability	2
Computation	2
Data Analysis	2
Estimation (Mathematics)	2
Experiential Learning	2
Grading	2
Measurement	2
Measurement Techniques	2
Psychometrics	2
Raw Scores	2
Regression (Statistics)	2
Simulation	2
Test Items	2
Testing Problems	2
Academic Achievement	1
Bayesian Statistics	1
More ▼

Source

Applied Psychological…	3
Psychometrika	2
Alberta Journal of…	1
International Educational…	1
Journal of Applied Measurement	1
Journal of Educational…	1
Journal of Educational and…	1
Language, Speech, and Hearing…	1
Measurement in Physical…	1
Measurement:…	1
Mid-Western Educational…	1
Online Submission	1
Structural Equation Modeling	1
Teaching Statistics: An…	1
More ▼

Publication Type

Reports - Descriptive	22
Journal Articles	15
Speeches/Meeting Papers	5
Reference Materials -…	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Psychometric Packages in R

Peer reviewed

Direct link

Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019

The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…

Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory

A Human-Machine Hybrid Peer Grading Framework for SPOCs

Peer reviewed
PDF on ERIC

Download full text

Han, Yong; Wu, Wenjun; Ji, Suozhao; Zhang, Lijun; Zhang, Hui – International Educational Data Mining Society, 2019

Peer-grading is commonly adopted by instructors as an effective assessment method for MOOCs (Massive Open Online Courses) and SPOCs (Small Private online course). For solving the problems brought by varied skill levels and attitudes of online students, statistical models have been proposed to improve the fairness and accuracy of peer-grading.…

Descriptors: Peer Evaluation, Grading, Online Courses, Computer Assisted Testing

Rocks: A Concrete Activity That Introduces Normal Distribution, Sampling Error, Central Limit Theorem and True Score Theory

Download full text

Van Duzer, Eric – Online Submission, 2011

This report introduces a short, hands-on activity that addresses a key challenge in teaching quantitative methods to students who lack confidence or experience with statistical analysis. Used near the beginning of the course, this activity helps students develop an intuitive insight regarding a number of abstract concepts which are key to…

Descriptors: Course Content, True Scores, Statistical Analysis, Sampling

IRTEQ: Windows Application that Implements Item Response Theory Scaling and Equating

Peer reviewed

Direct link

Han, Kyung T. – Applied Psychological Measurement, 2009

This article provides a brief description of a Windows application called IRTEQ. IRTEQ employs an intuitive, user-friendly graphic user interface that can rescale one test form to another by using various item response theory (IRT) scaling methods. It supports various IRT models for test forms. It can also equate test scores on the scale of one…

Descriptors: Item Response Theory, Scaling, True Scores, Equated Scores

A Measure for the Reliability of a Rating Scale Based on Longitudinal Clinical Trial Data

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert – Psychometrika, 2007

A new measure for reliability of a rating scale is introduced, based on the classical definition of reliability, as the ratio of the true score variance and the total variance. Clinical trial data can be employed to estimate the reliability of the scale in use, whenever repeated measurements are taken. The reliability is estimated from the…

Descriptors: Schizophrenia, Rating Scales, Likert Scales, True Scores

A Disattenuated Part-Whole Correlation Formula.

Peer reviewed

Lee, Guemin – Journal of Educational Measurement, 2000

Presents and illustrates an appropriate formula for correction for attenuation that can be used in situations in which one measure includes another measure as its part. The formula can be used for computing the correlation coefficient for true scores between total test and part test. (SLD)

Descriptors: Correlation, True Scores

Measuring Marbles: Demonstrating the Basic Tenets of Measurement Theory

Peer reviewed

Direct link

Wininger, Steven R. – Teaching Statistics: An International Journal for Teachers, 2007

A hands-on activity is described in which students attempt to measure something that they cannot see. In small groups, students estimate the number of marbles in sealed boxes. Next, students' estimates are compared with the actual numbers. Last, values from both the students' estimates and actual numbers are used to explain measurement theory and…

Descriptors: Computation, Measurement, Experiential Learning, Theories

Estimation of Graded Response Model Parameters Using MULTILOG.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1997

Describes an idiosyncracy of the MULTILOG (D. Thissen, 1991) parameter estimation process discovered during a simulation study involving the graded response model. A misordering reflected in boundary function location parameter estimates resulted in a large negative contribution to the true score followed by a large positive contribution. These…

Descriptors: Estimation (Mathematics), Simulation, True Scores

Classical Test Theory as a First-Order Item Response Theory: Application to True-Score Prediction from a Possibly Nonparallel Test.

Peer reviewed

Holland, Paul W.; Hoskens, Machteld – Psychometrika, 2003

Gives an account of classical test theory that shows how it can be viewed as a mean and variance approximation to a general version of item response theory and then shows how this approach can give insight into predicting the true score of a test and the true scores of tests not necessarily parallel to the given test. (SLD)

Descriptors: Prediction, Test Format, Test Theory, True Scores

Peer reviewed

Green, Samuel B.; Hershberger, Scott L. – Structural Equation Modeling, 2000

Proposes true score models that can account for correlated errors and their effect on coefficient alpha. These models allow random measurement errors on earlier items to affect directly or indirectly the scores on later items. Conditions under which coefficient alpha may yield spuriously high estimates or reliability are discussed. (SLD)

Descriptors: Correlation, Error of Measurement, Reliability, True Scores

Reliability of True Cutting Scores for Rasch Calibrated Items.

PDF pending restoration

Dimitrov, Dimiter M. – 2003

This paper provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty parameters when the trait distribution is normal or logistic. With the proposed formula, one can evaluate the theoretical values of classical reliability indexes for norm-referenced and criterion-referenced…

Descriptors: Cutting Scores, Item Response Theory, Reliability, True Scores

Reliability and True-Score Measures of Binary Items as a Function of Their Rasch Difficulty Parameter.

Peer reviewed

Dimitrov, Dimiter M. – Journal of Applied Measurement, 2003

Proposes formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty when the trait (ability) distribution is normal or logistic. Provides an illustrative example for using the proposed formulas. (SLD)

Descriptors: Ability, Difficulty Level, Item Response Theory, Reliability

Improved Type I Error Control and Reduced Estimation Bias for DIF Detection Using SIBTEST.

Peer reviewed

Jiang, Hai; Stout, William – Journal of Educational and Behavioral Statistics, 1998

Proposes a new regression correction for the SIBTEST statistical tests (R. Shealy and W. Stout, 1993) that essentially uses a two-segment piecewise linear regression of the true on observed matching subtest scores. A simulation study illustrates the approach. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Regression (Statistics), Simulation

Expected Values and Reliability of Number-Right Scores for IRT Calibrated Items.

Download full text

Dimitrov, Dimiter M. – 2003

This paper provides analytic evaluations of expected (marginal) true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distributions, marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-references interpretations are…

Descriptors: Item Response Theory, Reliability, Test Construction, Test Items

Error Variance of Rasch Measurement with Logistic Ability Distributions.

PDF pending restoration

Dimitrov, Dimiter M. – 2002

Exact formulas for classical error variance are provided for Rasch measurement with logistic distributions. An approximation formula with the normal ability distribution is also provided. With the proposed formulas, the additive contribution of individual items to the population error variance can be determined without knowledge of the other test…

Descriptors: Ability, Error of Measurement, Item Response Theory, Test Items

Previous Page | Next Page »

Pages: 1 | 2

Dimitrov, Dimiter M.	4
Alonso, Ariel	1
Baker, Frank B.	1
Banks, Karen	1
Brown, Jonathan R.	1
Darlington, Richard B.	1
Dulaney, Chuck	1
Giacobbi, Peter R., Jr	1
Goldberg, Gail Lynn	1
Green, Samuel B.	1
Han, Kyung T.	1
Han, Yong	1
Hershberger, Scott L.	1
Holland, Paul W.	1
Hoskens, Machteld	1
Ji, Suozhao	1
Jiang, Hai	1
Johnson, Stephen	1
Kolen, Michael	1
Laenen, Annouschka	1
Lee, Guemin	1
McDonald, Roderick P.	1
Molenberghs, Geert	1
Penfield, Randall D.	1
Schumacker, Randall	1
More ▼