ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	95

Descriptor

True Scores	415
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	50
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	191
Reports - Research	175
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 61 to 75 of 415 results Save | Export

Subject-Centered Scalability: The Sine Qua Non of Summated Ratings

Peer reviewed

Direct link

Drewes, Donald W. – Psychological Methods, 2009

A unifying theory of subject-centered scalability is offered that is grounded in structural true score modeling, is conceptually distinct from internal consistency and homogeneity as determined by item correlations, and is empirically confirmable. Scalability holds when item true scores are perfectly correlated but differ in their individual scale…

Descriptors: Rating Scales, Factor Analysis, True Scores, Mathematical Models

Coping with Memory Effect and Serial Correlation when Estimating Reliability in a Longitudinal Framework

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010

Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…

Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics

Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009

The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

Gauging the Gaps: A Deeper Look at Student Achievement. K-12 Policy

Download full text

Rowan, Anna Habash; Hall, Daria; Haycock, Kati – Education Trust, 2010

Leaders in schools, districts, and states, along with policymakers in Washington, D.C., are focusing new energy on closing long-standing gaps in performance that separate low-income students and students of color from others. It's critically important that their efforts succeed--for students, their families, their communities, and for their…

Descriptors: Elementary Secondary Education, Academic Achievement, National Competency Tests, True Scores

Reliability and Attribute-Based Scoring in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Cui, Ying; Zhou, Jiawen – Journal of Educational Measurement, 2009

The attribute hierarchy method (AHM) is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. Results from an AHM analysis yield information on examinees' cognitive strengths and weaknesses. Hence, the AHM…

Descriptors: Test Items, True Scores, Psychometrics, Algebra

An Alternative IRT Observed Score Equating Method. CRESST Report 751

Download full text

Kang, Taehoon; Chen, Troy T. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2009

In this report, an alternative item response theory (IRT) observed score equating method was newly developed. The proposed equating method was illustrated with two real data sets and the equating results were compared to those of traditional IRT true score and IRT observed score equating methods. Using three loss indices, the new method appeared…

Descriptors: Equated Scores, Item Response Theory, True Scores, Methods

Posttraumatic Stress Disorder and Intimate Relationship Problems: A Meta-Analysis

Peer reviewed

Direct link

Taft, Casey T.; Watkins, Laura E.; Stafford, Jane; Street, Amy E.; Monson, Candice M. – Journal of Consulting and Clinical Psychology, 2011

Objective: The authors conducted a meta-analysis of empirical studies investigating associations between indices of posttraumatic stress disorder (PTSD) and intimate relationship problems to empirically synthesize this literature. Method: A literature search using PsycINFO, Medline, Published International Literature on Traumatic Stress (PILOTS),…

Descriptors: Aggression, Posttraumatic Stress Disorder, Doctoral Dissertations, Error of Measurement

Linear Prediction of a True Score from a Direct Estimate and Several Derived Estimates

Peer reviewed

Direct link

Haberman, Shelby J.; Qian, Jiahe – Journal of Educational and Behavioral Statistics, 2007

Statistical prediction problems often involve both a direct estimate of a true score and covariates of this true score. Given the criterion of mean squared error, this study determines the best linear predictor of the true score given the direct estimate and the covariates. Results yield an extension of Kelley's formula for estimation of the true…

Descriptors: Prediction, Regression (Statistics), True Scores, Correlation

The Impact of Equating Method and Format Representation of Common Items on the Adequacy of Mixed-Format Test Equating Using Nonequivalent Groups

Direct link

Hagge, Sarah Lynn – ProQuest LLC, 2010

Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…

Descriptors: Test Format, True Scores, Equated Scores, Psychometrics

Taking the True Measure of a Liberal Education

Direct link

Katz, Stanley N. – Chronicle of Higher Education, 2008

How should one think about assessment in general education--or what is sometimes called liberal education--in the pluralistic environment of American higher education? Generalizations about longitudinal collegiate assessment are difficult, not least because of the remarkable range of four-year institutions and the students who attend them.…

Descriptors: Undergraduate Study, General Education, Outcomes of Education, True Scores

A Researcher's Dilemma: A Comparison of Estimated versus Actual College G.P.A.

Download full text

Herman, William E.; Nelson, Gena C. – Online Submission, 2009

This study compared college student reported grade point averages (GPA) with actual GPA as recorded at the Registrar's Office to determine the accuracy of student reported GPA. Results indicated that, on average, students reported slightly higher GPA than their actual GPA. Additionally, females were virtually as accurate as males and students with…

Descriptors: Grade Point Average, Research Problems, Statistical Bias, True Scores

A Modification to Angoff and Bookmarking Cut Scores to Account for the Imperfect Reliability of Test Scores

Peer reviewed

Direct link

MacCann, Robert G. – Educational and Psychological Measurement, 2008

It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…

Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores

An Equipercentile Version of the Levine Linear Observed-Score Equating Function Using the Methods of Kernel Equating. Research Report. ETS RR-07-14

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Fournier-Zajac, Stephanie; Holland, Paul W. – ETS Research Report Series, 2007

In the nonequivalent groups with anchor test (NEAT) design, there are several ways to use the information provided by the anchor in the equating process. One of the NEAT-design equating methods is the linear observed-score Levine method (Kolen & Brennan, 2004). It is based on a classical test theory model of the true scores on the test forms…

Descriptors: Equated Scores, Statistical Analysis, Test Items, Test Theory

Modeling Latent True Scores to Determine the Utility of Aggregate Student Perceptions as Classroom Indicators in HLM: The Case of Classroom Goal Structures

Peer reviewed

Direct link

Miller, Angela D.; Murdock, Tamera B. – Contemporary Educational Psychology, 2007

Measures of classroom climate such as classroom goal structures are often assessed through students' perceptions; the aggregated means within classrooms are then sometimes labeled as "classroom characteristics." The validity of these constructs is limited by the reliability of the measure at both the student and classroom level; yet, few studies…

Descriptors: True Scores, Teacher Characteristics, Classroom Environment, Student Attitudes

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Multivariate Behavioral…	5
Educational Measurement:…	4
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼