Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Glas, Cees A. W. – 1996
In this paper it is shown that differential item functioning can be evaluated using the Lagrange multiplier test or C. R. Rao's efficient score test. The test is presented in the framework of a number of item response theory (IRT) models such as the Rasch model, the one-parameter logistic model, the two-parameter logistic model, the generalized…
Descriptors: Foreign Countries, Item Bias, Item Response Theory, Scores
Johanson, George; Motlomelo, Samuel – 1998
Many textbooks in educational measurement and classroom assessment have chapters devoted to specific item formats. There may be attempts to relate one item format to another, but the chapters and item formats are largely seem as distinct entities with only loose and uncertain connections. This paper synthesizes these discussions. An item format…
Descriptors: Educational Assessment, Essay Tests, Measurement Techniques, Objective Tests
Yamamoto, Kentaro; Kulick, Edward – 1992
Test items are designed to be representative of the subject areas that they measure and to reflect the importance of specific domains or item types within those subject areas. Content validity is achieved by content specification and number of items in each content domain included in the design of the test. However, largely due to the normal…
Descriptors: Content Validity, Elementary Secondary Education, Field Tests, Mathematical Models
Doye, Peter – 1991
In foreign language testing, as in all testing, validity is the primary criterion for test quality. However plausible the concept of validity, in practice it is not always easy to arrive at congruence between the test situation and the real-life situation the learner is expected to master. Some language educators make authenticity a major…
Descriptors: Language Tests, Performance Based Assessment, Pragmatics, Second Language Instruction
Hambleton, Ronald K.; And Others – 1990
Item response theory (IRT) model parameter estimates have considerable merit and open up new directions for test development, but misleading results are often obtained because of errors in the item parameter estimates. The problem of the effects of item parameter estimation errors on the test development process is discussed, and the seriousness…
Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Sampling
Ackerman, Terry A. – 1991
This paper examines the effect of using unidimensional item response theory (IRT) item parameter estimates of multidimensional items to create weakly parallel test forms using target information curves. To date, all computer-based algorithms that have been devised to create parallel test forms assume that the items are unidimensional. This paper…
Descriptors: Algorithms, Equations (Mathematics), Estimation (Mathematics), Item Response Theory
Phillips, S. E.; Anderson, A. E. – 1983
The LOGTRUE program can be used to obtain a scale of equated raw scores for two tests with parameter estimates on a common item response theory scale. The program derives its name from the method of logistic true score equating described by Lord (1980). The method can be applied to two tests with overlapping items administered to different groups…
Descriptors: Computer Programs, Equated Scores, Group Testing, Latent Trait Theory
Tsutakawa, Robert K. – 1983
This paper presents a method for estimating certain characteristics of test items which are designed to measure ability, or knowledge, in a particular area. Under the assumption that ability parameters are sampled from a normal distribution, the EM algorithm is used to derive maximum likelihood estimates to item parameters of the two-parameter…
Descriptors: Attitude Measures, Estimation (Mathematics), Latent Trait Theory, Maximum Likelihood Statistics
Koch, William R. – 1983
The technique of nonmetric multidimensional scaling (MDS) was applied to real item response data obtained from a multiple-choice achievement test of unknown dimensionality. The goal was to classify the 50 items into the various subtests from which they were drawn originally, the latter being unknown to the investigator. Issues addressed in the…
Descriptors: Achievement Tests, Cluster Analysis, Latent Trait Theory, Multidimensional Scaling
Zhang, Jinming – ETS Research Report Series, 2005
Lord's bias function and the weighted likelihood estimation method are effective in reducing the bias of the maximum likelihood estimate of an examinee's ability under the assumption that the true item parameters are known. This paper presents simulation studies to determine the effectiveness of these two methods in reducing the bias when the item…
Descriptors: Statistical Bias, Maximum Likelihood Statistics, Computation, Ability
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…
Descriptors: Test Items, Equated Scores, Correlation, Tests
Alberta Dept. of Education, Edmonton. – 1990
The Alberta (Canada) Department of Education issues "Grade 12 Diploma Examinations" for various disciplines every six months (dated January and June of each year). Except for the English and Francais examinations (which are issued only in their own language), the examinations are issued in an English edition and a French edition. The…
Descriptors: Achievement Tests, Educational Assessment, Foreign Countries, Grade 12
Veccia, Ellen M.; Schroeder, David H. – 1990
A set of 150 experimental personality items was constructed for an alternate form of the word association personality worksample developed by the Johnson O'Connor Research Foundation. The items were intended to possess several semantic properties hypothesized to facilitate discrimination between objective and subjective examinees. Specifically,…
Descriptors: Adults, Correlation, Objectivity, Personality Measures
McLarty, Joyce R.; And Others – 1988
The effects of superficial gender-related item wording changes on the performance of male and female examinees were studied through mathematics; discrete English items; and an English passage created in neuter, male, and female gender versions. Units of items were administered to randomly equivalent samples of about 250 examinees taking American…
Descriptors: Difficulty Level, English Instruction, Item Analysis, Mathematics Tests
Anderson, Paul S. – 1987
A recent innovation in the area of educational measurement is MDT multi-digit testing, a machine-scored near-equivalent to "fill-in-the-blank" testing. The MDT method is based on long lists (or "Answer Banks") that contain up to 1,000 discrete answers, each with a three-digit label. Students taking an MDT multi-digit test mark…
Descriptors: College Students, Computer Assisted Testing, Higher Education, Scoring

Peer reviewed
