Publication Date
| In 2026 | 3 |
| Since 2025 | 437 |
| Since 2022 (last 5 years) | 1935 |
| Since 2017 (last 10 years) | 4079 |
| Since 2007 (last 20 years) | 6785 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 644 |
| Teachers | 455 |
| Researchers | 440 |
| Administrators | 126 |
| Policymakers | 68 |
| Students | 68 |
| Counselors | 26 |
| Parents | 24 |
| Community | 10 |
| Support Staff | 5 |
| Media Staff | 3 |
| More ▼ | |
Location
| Turkey | 608 |
| Australia | 341 |
| Canada | 254 |
| China | 180 |
| Indonesia | 149 |
| United States | 143 |
| United Kingdom | 130 |
| Germany | 117 |
| Taiwan | 111 |
| California | 110 |
| Spain | 107 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 2 |
Peer reviewedGorden, Belita – Reading World, 1983
Compiles information about technical properties from test reviews and manuals and presents information about the nature of test passages, questions, and comprehension skills. (FL)
Descriptors: Evaluation Criteria, Postsecondary Education, Reading Tests, Test Construction
Peer reviewedSilverstein, A. B.; And Others – Child Development, 1982
Reports on a study designed (1) to modify one of the few existing standardized tests of conservation so that it can be used to assess the conservation of identity as well as the conservation of equivalence and (2) to use both versions of the test to gather additional evidence on the question of developmental priority among young children. (MP)
Descriptors: Conservation (Concept), Error Patterns, Research Problems, Test Construction
Peer reviewedAtkinson, Bill – Journal of Learning Disabilities, 1982
The author critiques the program design and educational aspects of the Shell Games, a program developed by Apple Computer, Inc., which can be used by the teacher to design objective tests for adaptation to specific assessment needs. (For related articles, see EC 142 959-962.) (Author)
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Microcomputers, Programing
Peer reviewedRaju, Nambury S. – Psychometrika, 1979
An important relationship is given for two generalizations of coefficient alpha: (1) Rajaratnam, Cronbach, and Gleser's generalizability formula for stratified-parallel tests, and (2) Raju's coefficient beta. (Author/CTM)
Descriptors: Item Analysis, Mathematical Formulas, Test Construction, Test Items
Peer reviewedMerklein, Richard A. – Volta Review, 1981
A short speech perception test for severely and profoundly deaf children (4 to 19 years old) was developed which incorporates "distinctive feature" elements in a minimal contrast, forced choice, word-picture format. (Author)
Descriptors: Deafness, Elementary Secondary Education, Perception Tests, Speech Tests
Peer reviewedDunlap, William P.; Brennen, Alison H. – Journal of Learning Disabilities, 1981
The article describes a diagnostic procedure for assessing children's mental images and knowledge of cardinal numbers, 0 through 9. The diagnostic procedure includes the assessment of a child's visual memory, visual perception, symbol recognition, oral naming of numerals, and symbol-set linkage. (Author/SBH)
Descriptors: Diagnostic Tests, Elementary Education, Learning Disabilities, Mathematics
Peer reviewedFox, Robert A. – Journal of School Health, 1980
Some practical guidelines for developing multiple choice tests are offered. Included are three steps: (1) test design; (2) proper construction of test items; and (3) item analysis and evaluation. (JMF)
Descriptors: Guidelines, Objective Tests, Planning, Test Construction
Berk, Ronald A. – Educational Technology, 1980
Examines four factors involved in the determination of how many test items should be constructed or sampled for a set of objectives: (1) the type of decision to be made with results, (2) importance of objectives, (3) number of objectives, and (4) practical constraints. Specific guidelines that teachers and evaluators can use and an illustrative…
Descriptors: Behavioral Objectives, Criterion Referenced Tests, Guidelines, Test Construction
Peer reviewedde Gruijter, Dato N. M. – Journal of Educational Measurement, 1997
K. May and W. A. Nicewander recently concluded (1994) that percentile ranks are inferior or raw scores as indicators of latent ability. It is argued that their conclusions are incorrect, and an error in their derivation is identified. The incorrect equation results in an incorrect conclusion, as work by F. M. Lord (1980) also indicates.…
Descriptors: Equations (Mathematics), Estimation (Mathematics), Raw Scores, Statistical Distributions
Peer reviewedMay, Kim O.; Nicewander, W. Alan – Journal of Educational Measurement, 1997
Dato de Gruijter is correct in the recent conclusion that one equation derived by the present authors should be changed to reflect that it is an approximation, but it is still argued that percentile ranks for difficult tests can have substantially lower reliability and information relative to their number correct scores holds. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Raw Scores, Reliability
Peer reviewedZwick, Rebecca; Thayer, Dorothy T.; Mazzeo, John – Applied Measurement in Education, 1997
Differential item functioning (DIF) assessment procedures for items with more than two ordered score categories, referred to as polytomous items, were evaluated. Three descriptive statistics (standardized mean difference and two procedures based on the SIBTEST computer program) and five inferential procedures were used. Conditions under which the…
Descriptors: Item Bias, Research Methodology, Statistical Inference, Test Construction
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction
Peer reviewedHattie, John; And Others – Applied Psychological Measurement, 1996
A simulation study was conducted to evaluate the dependability of the "T" index of unidimensionality developed by W. F. Stout and used in his DIMTEST procedure. DIMTEST was found to provide dependable indications of unidimensionality, to be reasonably robust, and to allow for practical demarcation between one and many dimensions. (SLD)
Descriptors: Factor Analysis, Item Response Theory, Robustness (Statistics), Simulation
Peer reviewedWang, Tianyou; Kolen, Michael J. – Applied Psychological Measurement, 1996
A quadratic curve test equating method for equating different test forms under a random-groups data collection design is proposed that equates the first three central moments of the test forms. When applied to real test data, the method performs as well as other equating methods. Procedures from implementing the test are described. (SLD)
Descriptors: Data Collection, Equated Scores, Standardized Tests, Test Construction
Peer reviewedGignac, Gilles; Vernon, Philip A. – Intelligence, 2003
Created an adaptation of the Digit Symbol subtest of the Wechsler Adult Intelligence Scale, the Digit Symbol Rotation test, and evaluated its "g" loading with 54 adults. Results suggest the Digit Symbol Rotation test has more factorial validity than Digit Symbol, but remains equally easy to administer and score. (SLD)
Descriptors: Adults, Factor Structure, Intelligence, Intelligence Tests


