ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Item Analysis	20
Statistical Analysis	20
Testing Problems	20
Test Validity	9
Test Construction	8
Test Bias	6
Test Items	6
Achievement Tests	5
Latent Trait Theory	5
Adaptive Testing	3
Culture Fair Tests	3
Multiple Choice Tests	3
Response Style (Tests)	3
Scores	3
Test Theory	3
Testing	3
Comparative Analysis	2
Content Analysis	2
Criterion Referenced Tests	2
Error Patterns	2
Ethnic Groups	2
Evaluation Methods	2
Factor Analysis	2
Guessing (Tests)	2
Item Banks	2
More ▼

Source

Applied Measurement in…	1
Assessment in Education:…	1
Journal of Economic Education	1
Psychometrika	1

Publication Type

Reports - Research	16
Speeches/Meeting Papers	5
Journal Articles	3
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Secondary Education

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Emergency School Aid Act 1972

Assessments and Surveys

California Achievement Tests	1
Metropolitan Readiness Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Simultaneous Use of Multiple Answer Copying Indexes to Improve Detection Rates

Peer reviewed

Direct link

Wollack, James A. – Applied Measurement in Education, 2006

Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…

Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size

Estimation of Latent Trait Status Using Adaptive Testing Procedures.

Sympson, James B. – 1976

Latent trait test score theory is discussed primarily in terms of Birnbaum's three-parameter logistic model, and with some reference to the Rasch model. Equations and graphic illustrations are given for item characteristic curves and item information curves. An example is given for a hypothetical 20-item adaptive test, showing cumulative results…

Descriptors: Adaptive Testing, Bayesian Statistics, Item Analysis, Latent Trait Theory

How to Tell if a Test Measures the Same Thing in Different Cultures.

Download full text

Frederiksen, Norman – 1976

A number of different ways of ascertaining whether or not a test measures the same thing in different cultures are examined. Methods range from some that are obvious and simple to those requiring statistical and psychological sophistication. Simpler methods include such things as having candidates "think aloud" and interviewing them about how they…

Descriptors: Analysis of Covariance, Culture Fair Tests, Factor Analysis, Item Analysis

Understanding Differential Item Performance as a Consequence of Gender Differences in Academic Background.

Download full text

Doolittle, Allen E. – 1985

Differential item performance (DIP) is discussed as a concept that does not necessarily imply item bias or unfairness to subgroups of examinees. With curriculum-based achievement tests, DIP is presented as a valid reflection of group differences in requisite skills and instruction. Using data from a national testing of the ACT Assessment, this…

Descriptors: Achievement Tests, High Schools, Item Analysis, Mathematics Achievement

Evaluation Design Project: Multilevel Interpretation of Evaluation Data Study.

Download full text

Miller, M. David; Burstein, Leigh – 1981

Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…

Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models

Reducing Bias in Achievement Tests.

Download full text

Green, Donald Ross – 1976

During the past few years the problem of bias in testing has become an increasingly important issue. In most research, bias refers to the fair use of tests and has thus been defined in terms of an outside criterion measure of the performance being predicted by the test. Recently however, there has been growing interest in assessing bias when such…

Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Minority Groups

Test-Wiseness Cues in the Options of Mathematics Items.

Kuntz, Patricia – 1982

The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…

Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics

A New Method of Assessing Bias in Test Items.

Download full text

Scheuneman, Janice – 1975

In order to screen out items which may be biased against some ethnic group prior to the final selection of items in test construction, a statistical technique for assessing item bias was developed. Based on a theoretical formulation of R. B. Darlington, the method compares the performance of individuals who belong to different ethnic groups, but…

Descriptors: Achievement Tests, Content Analysis, Cultural Influences, Ethnic Groups

Removing the Effects of Random Guessing from Latent Trait Ability Estimates. Research Bulletin No. 74-32.

Download full text

Waller, Michael I. – 1974

In latent trait models the standard procedure for handling the problem caused by guessing on multiple choice tests is to estimate a parameter which is intended to measure the "guessingness" inherent in an item. Birnbaum's three parameter model, which handles guessing in this manner, ignores individual differences in guessing tendency. This paper…

Descriptors: Goodness of Fit, Guessing (Tests), Individual Differences, Item Analysis

Estimation of Latent Ability and Item Parameters When There Are Omitted Responses

Peer reviewed

Lord, Frederic M. – Psychometrika, 1974

Omitted items cannot properly be treated as wrong when estimating ability and item parameters. A convenient method for utilizing the information provided by omissions is presented. Theoretical and empirical justifications are presented for the estimates obtained by the new method. (Author)

Descriptors: Academic Ability, Guessing (Tests), Item Analysis, Latent Trait Theory

Test Scrambling and Student Performance.

Peer reviewed

Gohmann, Stephan F.; Spector, Lee C. – Journal of Economic Education, 1989

Compares the effect of content ordering and scrambled ordering on examinations in courses, such as economics, that require quantitative skills. Empirical results suggest that students do no better if they are given a content-ordered rather than a scrambled examination as student performance is not adversely affected by scrambled ordered…

Descriptors: Cheating, Economics Education, Educational Research, Grading

Item and Total Score Characteristics and Correlates of the JIM Scale.

Download full text

Rodgers, Ron – 1974

The construct of motivation toward school is vaguely defined. The Junior Index of Motivation (JIM Scale) is one of few instruments claiming validity in measuring motivation toward school among junior and semior high students. This study discusses the shortcomings of the JIM scale, and compares item and total score characteristics and correlates…

Descriptors: Adolescents, Correlation, Grade Point Average, Item Analysis

An Evaluation of the Cultural Bias of the Adult Performance Level Assessment.

Broussard, Rolland L. – 1985

The cultural bias of the Adult Performance Level Assessment, Form AA-l (APLA) was examined. The potential influence of cultural differences on scores of a major ethnic group, Acadians or Cajuns, was investigated. Assessment items most prone to produce differences in scores were isolated and administered to selected groups. No significant…

Descriptors: Adult Basic Education, Adult Literacy, Culture Fair Tests, Ethnic Groups

The Use of Precalibrated Item Bank to Establish and Maintain Cutoff Scores: A Case Study of the Florida Teacher Certification Examination.

Download full text

Legg, Sue M. – 1982

A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…

Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Beguin, A. A.	1
Birenbaum, Menucha	1
Broussard, Rolland L.	1
Burstein, Leigh	1
Doolittle, Allen E.	1
Doron, Rina	1
Frederiksen, Norman	1
Gohmann, Stephan F.	1
Green, Donald Ross	1
Kane, Robert B.	1
Kuntz, Patricia	1
Legg, Sue M.	1
Lewy, Arieh	1
Lord, Frederic M.	1
Miller, M. David	1
Ozenne, Dan G.	1
Rodgers, Ron	1
Sarvela, Paul D.	1
Scheuneman, Janice	1
Spector, Lee C.	1
Sympson, James B.	1
Tatsuoka, Kikumi	1
Verstralen, H. H. F. M.	1
Waller, Michael I.	1
More ▼