ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Item Analysis	105
Testing Problems	105
Test Items	54
Test Construction	38
Latent Trait Theory	28
Test Validity	27
Achievement Tests	21
Test Bias	21
Higher Education	20
Multiple Choice Tests	20
Mathematical Models	18
Scores	18
Statistical Analysis	16
Difficulty Level	15
Response Style (Tests)	14
Test Reliability	14
Standardized Tests	13
Elementary Secondary Education	12
College Entrance Examinations	11
Test Format	10
Elementary Education	9
Scoring	9
Test Interpretation	9
Comparative Analysis	8
Correlation	8
More ▼

Source

Journal of Educational…	6
Applied Measurement in…	2
Journal of Economic Education	2
Journal of Educational…	2
Journal of Experimental…	2
Psychometrika	2
Applied Psychological…	1
Educational Measurement:…	1
Educational and Psychological…	1
Evaluation and the Health…	1
Instructional Science	1
Journal of Consulting and…	1
Journal of Language and…	1
Language Testing	1
Monographs of the Society for…	1
Remedial and Special Education	1
Spectrum	1
More ▼

Publication Type

Reports - Research	105
Speeches/Meeting Papers	40
Journal Articles	27
Collected Works - Serials	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Colombia	1
Israel	1
New Hampshire	1
New Zealand	1
Ohio	1
South Africa	1
Turkey	1
Virginia	1

Laws, Policies, & Programs

Elementary and Secondary…	1
Emergency School Aid Act 1972	1

Assessments and Surveys

SAT (College Admission Test)	5
Iowa Tests of Basic Skills	4
Stanford Achievement Tests	4
ACT Assessment	3
Graduate Record Examinations	3
California Achievement Tests	2
Metropolitan Achievement Tests	2
Sequential Tests of…	2
Brazelton Neonatal Assessment…	1
Comprehensive Tests of Basic…	1
Gates MacGinitie Reading Tests	1
Metropolitan Readiness Tests	1
Minnesota Multiphasic…	1
Stanford Binet Intelligence…	1
Tennessee Self Concept Scale	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 105 results Save | Export

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

Proficiency Exams in Teaching Turkish as a Foreign Language in TÖMER (Turkish and Foreign Languages Research and Application Centers)

Peer reviewed
PDF on ERIC

Download full text

Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020

Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…

Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Item Discrimination: When More Is Worse.

Peer reviewed

Masters, Geofferey N. – Journal of Educational Measurement, 1988

High item discrimination can indicate a special kind of measurement disturbance via an item that gives high-ability persons a special advantage. The measurement disturbance is described, which occurs when an item is sensitive to individual differences on a second, undesired dimension that is correlated with the variable intended to be measured.…

Descriptors: Academically Gifted, Item Analysis, Test Bias, Test Wiseness

Two New Test Statistics for the Rasch Model.

Peer reviewed

van den Wollenberg, Arnold L. – Psychometrika, 1982

Presently available test statistics for the Rasch model are shown to be insensitive to violations of the assumption of test unidimensionality. Two new statistics are presented. One is similar to available statistics, but with some improvements; the other addresses the problem of insensitivity to unidimensionality. (Author/JKS)

Descriptors: Item Analysis, Latent Trait Theory, Statistics, Test Reliability

A Table for Prorating Incomplete Form R MMPIs.

Peer reviewed

Streiner, David L.; Miller, Harold R. – Journal of Consulting and Clinical Psychology, 1979

A table is provided and described for prorating Minnesota Multiphasic Personality Inventory scales when the entire Form R has not been completed. Good concordance of profile types was found for 300 and 350 completed questions. Interpretations based on 200 items may be suspect. (Author)

Descriptors: Item Analysis, Patients, Personality Assessment, Personality Measures

Recommendations for Accommodations: Implications of (In)consistency

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R. – Remedial and Special Education, 2007

When accurately assigned and administered appropriately, testing accommodations help ameliorate the effects of personal characteristics that limit access to critical information and prevent a person from demonstrating his or her true abilities in the tested domain. Inaccurate assignment or misuse of accommodations may counteract the benefits of…

Descriptors: Testing Accommodations, Individualized Instruction, Individualized Education Programs, Error of Measurement

Finite Measures from Perfect Scores.

PDF pending restoration

Wilson, Mark; Wright, Benjamin D. – 1983

A common problem in practical educational research is that of perfect scores which result when latent trait models are used. A simple procedure for managing the perfect and zero response problem encountered in converting test scores into measures is presented. It allows the test user to chose among two or three reasonable finite representations of…

Descriptors: Factor Analysis, Item Analysis, Latent Trait Theory, Mathematical Models

Deception in Testing: An Investigation of the Tennessee Self Concept Scale.

Garrison, Wayne M.; Stanwyck, Douglas J. – 1979

The susceptibility to faking on the Tennessee Self Concept Scale was examined among college students. Additionally, groups of respondents, instructed to respond in a "random" fashion to pre-determined numbers of items in the TSCS, were subjected to a plausibility analysis of their test response vectors using the Rasch measurement model.…

Descriptors: College Students, Higher Education, Item Analysis, Response Style (Tests)

Detection of Aberrant Response Patterns and Their Effect on Dimensionality.

Peer reviewed

Tatsuoka, Kikumi, K.; Tatsuoka, Maurice M. – Journal of Educational Statistics, 1982

Two indices for measuring the degree of conformity or consistency of an individual examinee's response pattern on a set of items are developed. The use of the indices for spotting aberrant response patterns of examinees is detailed. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Goodness of Fit, Item Analysis

Objective-Referenced-Test Rescore Decisions and Item Statistics: A Matter of Congruence.

Shannon, Gregory A. – 1983

Rescoring of Center for Occupational and Professional Assessment objective-referenced tests is decided largely by content experts selected by client organizations. A few of the test items, statistically flagged for review, are not rescored. Some of this incongruence could be due to the use of the biserial correlation (r-biserial) as an…

Descriptors: Adults, Criterion Referenced Tests, Item Analysis, Occupational Tests

Estimation of Latent Trait Status Using Adaptive Testing Procedures.

Sympson, James B. – 1976

Latent trait test score theory is discussed primarily in terms of Birnbaum's three-parameter logistic model, and with some reference to the Rasch model. Equations and graphic illustrations are given for item characteristic curves and item information curves. An example is given for a hypothetical 20-item adaptive test, showing cumulative results…

Descriptors: Adaptive Testing, Bayesian Statistics, Item Analysis, Latent Trait Theory

Joint-Space Analysis of "Pick-Any" Data: Analysis of Choices from an Unconstrained Set of Alternatives.

Peer reviewed

Levine, Joel H. – Psychometrika, 1979

Social and naturally occurring choice phenomena are often of the "pick any" type in which the number of choices made by a subject as well as the set of alternatives from which they are chosen is unconstrained. A model and scaling method for these data are introduced. (Author/JKS)

Descriptors: Data Analysis, Item Analysis, Mathematical Models, Multidimensional Scaling

Robust Estimation of Ability in the Rasch Model.

Wainer, Howard; Wright, Benjamin D. – 1980

The pure Rasch model was compared with four modifications of the model in a number of different simulations in order to ascertain the comparative efficiencies of the parameter estimations of these modifications. Because there is always noise in test score data, some individuals may have response patterns that do not fit the model and their…

Descriptors: Error of Measurement, Guessing (Tests), Item Analysis, Latent Trait Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Wainer, Howard	3
Choppin, Bruce	2
Doolittle, Allen E.	2
Hoover, H. D.	2
Jaeger, Richard M.	2
Klein, Stephen P.	2
Loyd, Brenda H.	2
Reckase, Mark D.	2
Scheuneman, Janice	2
Secolsky, Charles	2
Waller, Michael I.	2
Wright, Benjamin D.	2
Anderson, Lorin W.	1
Andrich, David	1
Angoff, William H.	1
Bezirhan, Ummugul	1
Bhaskar, R.	1
Birenbaum, Menucha	1
Bolus, Roger	1
Bower, Ruth	1
Braswell, James	1
Bresnock, Anne E.	1
Broussard, Rolland L.	1
Burstein, Leigh	1
More ▼