ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Test Reliability	25
Test Theory	25
Testing Problems	25
Test Construction	11
Test Validity	10
Achievement Tests	7
Criterion Referenced Tests	7
Test Interpretation	7
Test Items	7
Mathematical Models	5
Scores	5
Test Bias	5
Educational Testing	4
Elementary Secondary Education	4
Equated Scores	4
Error of Measurement	4
Item Analysis	4
Norm Referenced Tests	4
Reading Tests	4
Statistical Analysis	4
Test Use	4
Testing	4
Adaptive Testing	3
Career Development	3
Computer Assisted Testing	3
More ▼

Source

Journal of Experimental…	2
Applied Psychological…	1
Educational and Psychological…	1
Executive Review	1
Journal of Educational…	1
Journal of Educational and…	1
Performance and Instruction	1
Review of Research in…	1
School Psychology Review	1

Publication Type

Journal Articles	9
Reports - Research	8
Opinion Papers	6
Speeches/Meeting Papers	5
Books	3
Collected Works - Serials	3
Information Analyses	3
Reports - Evaluative	3
Collected Works - General	2
Reports - Descriptive	2
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Reference Materials -…	1
More ▼

Education Level

Audience

Practitioners	3
Teachers	2
Researchers	1
Students	1

Location

Texas

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
Childrens Depression Inventory	1
Expressive One Word Picture…	1
Nelson Denny Reading Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

The Reliability of a Profile.

Peer reviewed

Yarnold, Paul R. – Educational and Psychological Measurement, 1984

Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)

Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability

The Attenuation Paradox of Traditional Test Theory as a Breakdown of Local Independence in Person-Item Response Theory.

Andrich, David – 1984

Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…

Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability

Error of Measurement and Statistical Inference: Some Anomalies.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980

It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)

Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

The Reliability of Sums and Differences of Test Scores: Some New Results and Anomalies.

Peer reviewed

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981

Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…

Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores

Problems, Perspectives, and Practical Issues in Equating.

Peer reviewed

Weiss, David J., Ed. – Applied Psychological Measurement, 1987

Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)

Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Domain-Referenced Testing of Reading Achievement.

Brittain, Mary M.; Brittain, Clay V. – 1981

A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…

Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Test Design Project: Studies in Test Adequacy. Annual Report.

Download full text

Wilcox, Rand R. – 1981

These studies in test adequacy focus on two problems: procedures for estimating reliability, and techniques for identifying ineffective distractors. Fourteen papers are presented on recent advances in measuring achievement (a response to Molenaar); "an extension of the Dirichlet-multinomial model that allows true score and guessing to be…

Descriptors: Achievement Tests, Criterion Referenced Tests, Guessing (Tests), Mathematical Models

A Discussion of the Expressive One-Word Picture Vocabulary Test.

Peer reviewed

Altepeter, Tom – School Psychology Review, 1983

A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)

Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli

Depression in Children: The Children's Depression Inventory.

Download full text

Crowley, Susan L.; And Others – 1993

Issues surrounding accurate assessment of depression in children have received much attention. However, the stability of scores from depression measures has generally been estimated using only classical test score theory, rather than the more powerful generalizability theory. The dependability of scores from the Children's Depression Inventory…

Descriptors: Children, Clinical Diagnosis, Depression (Psychology), Diagnostic Tests

An Overview of Criterion-Referenced Test Development.

Shrock, Sharon; And Others – Performance and Instruction, 1986

Presents major stages in design and development of criterion referenced tests (CRT) with emphasis on differences between CRT construction and norm-referenced test construction. Discussion covers test interpretation; test theory; preparation for test construction (hierarchical analysis, item type selection, and choosing number of items); test…

Descriptors: Adoption (Ideas), Comparative Analysis, Criterion Referenced Tests, Industrial Training

Previous Page | Next Page »

Pages: 1 | 2

Bormuth, John R.	2
Zimmerman, Donald W.	2
Altepeter, Tom	1
Andrich, David	1
Beard, John D., Ed.	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Budescu, David	1
Chase, Clinton I.	1
Cliff, Norman	1
Coffman, William E.	1
Crowley, Susan L.	1
Jacobs, Lucy Cheser	1
Janda, Louis H.	1
Kettler, Ryan J.	1
Linn, Robert L., Ed.	1
Longford, Nicholas T.	1
McNabb, Scott E., Ed.	1
Myers, Charles T.	1
Shrock, Sharon	1
Wadleigh, Sandra L.	1
Weiss, David J., Ed.	1
Wilcox, Rand R.	1
Williams, Richard H.	1
More ▼