Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
True Scores | 22 |
Item Response Theory | 7 |
Error of Measurement | 6 |
Reliability | 6 |
Correlation | 3 |
Elementary Secondary Education | 3 |
Evaluation Methods | 3 |
Test Construction | 3 |
Test Theory | 3 |
Ability | 2 |
Computation | 2 |
More ▼ |
Source
Author
Publication Type
Reports - Descriptive | 22 |
Journal Articles | 15 |
Speeches/Meeting Papers | 5 |
Reference Materials -… | 1 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
What Works Clearinghouse Rating
Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019
The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…
Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory
Han, Yong; Wu, Wenjun; Ji, Suozhao; Zhang, Lijun; Zhang, Hui – International Educational Data Mining Society, 2019
Peer-grading is commonly adopted by instructors as an effective assessment method for MOOCs (Massive Open Online Courses) and SPOCs (Small Private online course). For solving the problems brought by varied skill levels and attitudes of online students, statistical models have been proposed to improve the fairness and accuracy of peer-grading.…
Descriptors: Peer Evaluation, Grading, Online Courses, Computer Assisted Testing
Van Duzer, Eric – Online Submission, 2011
This report introduces a short, hands-on activity that addresses a key challenge in teaching quantitative methods to students who lack confidence or experience with statistical analysis. Used near the beginning of the course, this activity helps students develop an intuitive insight regarding a number of abstract concepts which are key to…
Descriptors: Course Content, True Scores, Statistical Analysis, Sampling
Han, Kyung T. – Applied Psychological Measurement, 2009
This article provides a brief description of a Windows application called IRTEQ. IRTEQ employs an intuitive, user-friendly graphic user interface that can rescale one test form to another by using various item response theory (IRT) scaling methods. It supports various IRT models for test forms. It can also equate test scores on the scale of one…
Descriptors: Item Response Theory, Scaling, True Scores, Equated Scores
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert – Psychometrika, 2007
A new measure for reliability of a rating scale is introduced, based on the classical definition of reliability, as the ratio of the true score variance and the total variance. Clinical trial data can be employed to estimate the reliability of the scale in use, whenever repeated measurements are taken. The reliability is estimated from the…
Descriptors: Schizophrenia, Rating Scales, Likert Scales, True Scores

Lee, Guemin – Journal of Educational Measurement, 2000
Presents and illustrates an appropriate formula for correction for attenuation that can be used in situations in which one measure includes another measure as its part. The formula can be used for computing the correlation coefficient for true scores between total test and part test. (SLD)
Descriptors: Correlation, True Scores
Wininger, Steven R. – Teaching Statistics: An International Journal for Teachers, 2007
A hands-on activity is described in which students attempt to measure something that they cannot see. In small groups, students estimate the number of marbles in sealed boxes. Next, students' estimates are compared with the actual numbers. Last, values from both the students' estimates and actual numbers are used to explain measurement theory and…
Descriptors: Computation, Measurement, Experiential Learning, Theories

Baker, Frank B. – Applied Psychological Measurement, 1997
Describes an idiosyncracy of the MULTILOG (D. Thissen, 1991) parameter estimation process discovered during a simulation study involving the graded response model. A misordering reflected in boundary function location parameter estimates resulted in a large negative contribution to the true score followed by a large positive contribution. These…
Descriptors: Estimation (Mathematics), Simulation, True Scores

Holland, Paul W.; Hoskens, Machteld – Psychometrika, 2003
Gives an account of classical test theory that shows how it can be viewed as a mean and variance approximation to a general version of item response theory and then shows how this approach can give insight into predicting the true score of a test and the true scores of tests not necessarily parallel to the given test. (SLD)
Descriptors: Prediction, Test Format, Test Theory, True Scores

Green, Samuel B.; Hershberger, Scott L. – Structural Equation Modeling, 2000
Proposes true score models that can account for correlated errors and their effect on coefficient alpha. These models allow random measurement errors on earlier items to affect directly or indirectly the scores on later items. Conditions under which coefficient alpha may yield spuriously high estimates or reliability are discussed. (SLD)
Descriptors: Correlation, Error of Measurement, Reliability, True Scores

Dimitrov, Dimiter M. – 2003
This paper provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty parameters when the trait distribution is normal or logistic. With the proposed formula, one can evaluate the theoretical values of classical reliability indexes for norm-referenced and criterion-referenced…
Descriptors: Cutting Scores, Item Response Theory, Reliability, True Scores

Dimitrov, Dimiter M. – Journal of Applied Measurement, 2003
Proposes formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty when the trait (ability) distribution is normal or logistic. Provides an illustrative example for using the proposed formulas. (SLD)
Descriptors: Ability, Difficulty Level, Item Response Theory, Reliability

Jiang, Hai; Stout, William – Journal of Educational and Behavioral Statistics, 1998
Proposes a new regression correction for the SIBTEST statistical tests (R. Shealy and W. Stout, 1993) that essentially uses a two-segment piecewise linear regression of the true on observed matching subtest scores. A simulation study illustrates the approach. (SLD)
Descriptors: Estimation (Mathematics), Item Bias, Regression (Statistics), Simulation
Dimitrov, Dimiter M. – 2003
This paper provides analytic evaluations of expected (marginal) true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distributions, marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-references interpretations are…
Descriptors: Item Response Theory, Reliability, Test Construction, Test Items

Dimitrov, Dimiter M. – 2002
Exact formulas for classical error variance are provided for Rasch measurement with logistic distributions. An approximation formula with the normal ability distribution is also provided. With the proposed formulas, the additive contribution of individual items to the population error variance can be determined without knowledge of the other test…
Descriptors: Ability, Error of Measurement, Item Response Theory, Test Items
Previous Page | Next Page ยป
Pages: 1 | 2