Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Dimitrov, Dimiter M. – 1994
An approach is described that reveals the hierarchical test structure (HTS) based on the cognitive demands of the test items, and conducts a linear trait modeling by using the HST elements as item difficulty components. This approach, referred to as the Hierarchical Latent Trait Approach (HLTA), employs an algorithm that allows all test items to…
Descriptors: Algorithms, Cognitive Processes, Difficulty Level, Higher Education
PDF pending restorationZwick, Rebecca; And Others – 1994
A previous simulation study of methods for assessing item functioning (DIF) in computer-adaptive tests (CATs) showed that modified versions of the Mantel-Haenszel and standardization methods work well with CAT data. In that study, data were generated using the three-parameter logistic (3PL) model, and this same model was assumed in obtaining item…
Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Computer Simulation
Beller, Michal – 1992
It has previously been shown by M. Beller (1990) that an additive tree (Addtree, a hierarchical tree representation of similarity data developed by S. Sattath and A. Tversky in 1977), may be useful for representing the structure between tests and items through the similarity among them as measured by their intercorrelations. In this study, the…
Descriptors: College Entrance Examinations, Decision Making, Difficulty Level, Equations (Mathematics)
Wasem, Jim – 1993
"Pickleball" is a new racquet sport which is one of the fastest growing educational activities in the Northwest. This paper describes the development of a test battery designed to measure students' pickleball skills for purposes of classification; to determine improvement of playing skills; and to aid in grading of individual…
Descriptors: Higher Education, Physical Education, Preservice Teacher Education, Racquet Sports
Scriven, Michael – 1991
An alternative to multiple-choice testing is suggested for educational assessment. The use of what is called "multiple-rating items" is proposed. A multiple-rating item calls for the examinee to rate all of a set of things instead of picking one as with a multiple-choice item. The respondent has to provide a specific rating of each…
Descriptors: Educational Assessment, Essay Tests, Higher Education, Multiple Choice Tests
Tatsuoka, Kikumi K. – 1991
Constructed-response formats are desired for measuring complex and dynamic response processes that require the examinee to understand the structures of problems and micro-level cognitive tasks. These micro-level tasks and their organized structures are usually unobservable. This study shows that elementary graph theory is useful for organizing…
Descriptors: Adult Literacy, Cognitive Measurement, Cognitive Processes, Constructed Response
Nandakumar, Ratna – 1992
The performance of the following four methodologies for assessing unidimensionality was examined: (1) DIMTEST; (2) the approach of P. W. Holland and P. R. Rosenbaum; (3) linear factor analysis; and (4) non-linear factor analysis. Each method is examined and compared with other methods using simulated data sets and real data sets. Seven data sets,…
Descriptors: Ability, Comparative Testing, Correlation, Equations (Mathematics)
Wang, Tianyou; Kolen, Michael J. – 1994
In this paper a quadratic curve equating method for different test forms under a random-group data-collection design is proposed. Procedures for implementing this method and related issues are described and discussed. The quadratic-curve method was evaluated with real test data (from two 30-item subtests for a professional licensure examination…
Descriptors: Comparative Analysis, Data Collection, Equated Scores, Goodness of Fit
O'Neal, Marcia R.; Chissom, Brad S. – 1993
Results obtained using three methods for gathering attitude data were compared. The methods are ranking, paired comparisons, and the Likert-type scale. Three attitude objects, each consisting of five items, were selected or developed for this study. One set had been used with graduate students previously. Participants were 392 students in…
Descriptors: Attitude Measures, College Students, Comparative Analysis, Correlation
Perkins, Kyle; And Others – 1994
This paper reports the results of using a three-layer backpropagation artificial neural network to predict item difficulty in a reading comprehension test. Two network structures were developed, one with and one without a sigmoid function in the output processing unit. The data set, which consisted of a table of coded test items and corresponding…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Expert Systems, Item Analysis
Chang, Lei; And Others – 1994
The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…
Descriptors: Economics, Evaluators, Experience, Interrater Reliability
Clauser, Brian E.; And Others – 1991
This paper explores the effectiveness of the Mantel-Haenszel (MH) statistic in detecting differentially functioning test items when the internal criterion is varied. Using a data set from the 1982 statewide administration of a 150-item life skills examination (the New Mexico High School Proficiency Examination), a randomly selected sample of 1,000…
Descriptors: American Indians, Anglo Americans, Comparative Testing, High School Students
Goodman, Gay; And Others – 1991
This instructor's manual provides numerous suggestions for observational activities, out-of-class assignments and evaluative strategies for undergraduate and graduate students, and follows the organization of the textbook, "Applying Educational Psychology in the Classroom." The book is organized into two sections--the instructor's manual and test…
Descriptors: Assignments, Educational Psychology, Elementary Secondary Education, Evaluation Methods
Cizek, Gregory J. – 1991
A commonly accepted rule for developing equated examinations using the common-items non-equivalent groups (CINEG) design is that items common to the two examinations being equated should be identical. The CINEG design calls for two groups of examinees to respond to a set of common items that is included in two examinations. In practice, this rule…
Descriptors: Certification, Comparative Testing, Difficulty Level, Higher Education
Illinois State Board of Education, Springfield. – 1984
The 4th Grade Test (1984) of the Illinois Inventory of Educational Progress includes 18 reading items, 27 geometry items, 31 science items, a 27-item student questionnaire regarding science opinions, and 40 mathematics items. The test booklet only is included here. (PN)
Descriptors: Educational Assessment, Geometry, Grade 4, Intermediate Grades


