ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Test Construction	20
Test Length	20
Testing Problems	20
Test Items	11
Test Validity	7
Item Banks	6
Test Reliability	6
Achievement Tests	5
Adaptive Testing	5
Computer Assisted Testing	5
Mastery Tests	5
Item Analysis	4
Mathematical Models	4
Test Format	4
Criterion Referenced Tests	3
Cutting Scores	3
Educational Testing	3
Elementary Secondary Education	3
Measurement Techniques	3
Multiple Choice Tests	3
Factor Analysis	2
Individual Testing	2
Item Sampling	2
Program Evaluation	2
Psychometrics	2
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
Evaluation in Education:…	1
Journal of Educational…	1
Rhode Island Department of…	1
Science Education…	1

Publication Type

Reports - Research	10
Journal Articles	6
Speeches/Meeting Papers	5
Reports - Evaluative	4
Opinion Papers	3
Guides - Non-Classroom	2
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Elementary Education

Audience

Researchers	2
Practitioners	1

Location

New Jersey	1
Rhode Island	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

Rhode Island State Assessment Program District and School Testing Coordinators Handbook: K-1 Assessment Program

Download full text

Rhode Island Department of Elementary and Secondary Education, 2007

This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…

Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs

A New Approach to Test the Useability of a Science Question Paper in Terms of Time Allotment.

Peer reviewed

Sindhu, R. S.; Sharma, Reeta – Science Education International, 1999

Finds that the time required to attempt all the test items of each question paper in a four-paper sample was inversely proportional to the percentage of students who attempted all the test items of that paper. Extrapolates results to give guidelines for determining the feasibility of newly-developed exam papers. (WRM)

Descriptors: Science Tests, Secondary Education, Test Construction, Test Length

Item Clusters and Computerized Adaptive Testing: A Case for Testlets.

Peer reviewed

Wainer, Howard; Kiely, Gerard L. – Journal of Educational Measurement, 1987

The testlet, a bundle of test items, alleviates some problems associated with computerized adaptive testing: context effects, lack of robustness, and item difficulty ordering. While testlets may be linear or hierarchical, the most useful ones are four-level hierarchical units, containing 15 items and partitioning examinees into 16 classes. (GDC)

Descriptors: Adaptive Testing, Computer Assisted Testing, Context Effect, Item Banks

Applying Ranking and Selection Techniques to Determine the Length of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

A problem of considerable importance in certain educational settings is determining how many items to include on a mastery test. Applying ranking and selection procedures, a solution is given which includes as a special case all existing single-stage, non-Bayesian solutions based on a strong true-score model. (Author/JKS)

Descriptors: Criterion Referenced Tests, Mastery Tests, Nonparametric Statistics, Probability

Three Practical Issues for Modern Adaptive Testing Item Pools.

Download full text

Stocking, Martha L. – 1994

As adaptive testing moves toward operational implementation in large scale testing programs, where it is important that adaptive tests be as parallel as possible to existing linear tests, a number of practical issues arise. This paper concerns three such issues. First, optimum item pool size is difficult to determine in advance of pool…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Standards

Item Pool Construction for Use With Latent Trait Models.

PDF pending restoration

Reckase, Mark D. – 1979

Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…

Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Discriminability in Multidimensional Performance Evaluations.

Peer reviewed

Kafry, Ditsa; And Others – Applied Psychological Measurement, 1979

A series of behavioral expectation scale applications were analyzed in an attempt to point out an appropriate number of dimensions to be included in such studies. Results reflected the problems of dimension interdependence when the number of dimensions exceeds nine. (Author/JKS)

Descriptors: Behavior Rating Scales, Expectation, Factor Analysis, Higher Education

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Pretesting alongside an Operational CAT.

Download full text

Davey, Tim; Pommerich, Mary; Thompson, Tony D. – 1999

In computerized adaptive testing (CAT), new or experimental items are frequently administered alongside operational tests to gather the pretest data needed to replenish and replace item pools. The two basic strategies used to combine pretest and operational items are embedding and appending. Variable-length CATs are preferred because of the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Measurement Techniques

A Short and Simple Introduction to Tailored Testing.

Download full text

Rudner, Lawrence M. – 1978

Tailored testing provides the same information as group-administered standardized tests, but can do so using fewer items because the items administered are selected for the ability of the individual student. Thus, tailored testing offers several advantages over traditional methods. Because individual tailored tests are not timed, anxiety is…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Download full text

Hambleton, Ronald K. – 1986

The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…

Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics

WISC-R Short Forms: Long on Problems.

Boyd, Thomas A.; Tramontana, Michael G. – 1984

To examine the validity of short forms of the Wechsler Intelligence Scale for Children-Revised (WISC-R), the WISC-R was first administered to 106 hospitalized psychiatric patients, aged 8-16. No subjects had a primary diagnosis of mental retardation or learning disability, and one-third were receiving psychotropic medication. WISC-R IQ scores…

Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Wilcox, Rand R. – 1979

Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…

Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests

Previous Page | Next Page »

Pages: 1 | 2

Wilcox, Rand R.	2
Boyd, Thomas A.	1
Carifio, James	1
Carlson, Ken	1
Conger, Anthony J.	1
Davey, Tim	1
Hambleton, Ronald K.	1
Harnisch, Delwyn L.	1
Kafry, Ditsa	1
Kiely, Gerard L.	1
Larson, Gordon A.	1
Millman, Jason	1
Myers, Charles T.	1
Pommerich, Mary	1
Reckase, Mark D.	1
Rudner, Lawrence M.	1
Sharma, Reeta	1
Sindhu, R. S.	1
Stocking, Martha L.	1
Thompson, Tony D.	1
Tramontana, Michael G.	1
Wainer, Howard	1
van der Linden, Wim J.	1
More ▼