ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Test Format	18
Test Theory	18
Test Validity	18
Test Construction	8
Test Items	7
Student Evaluation	5
Higher Education	4
Multiple Choice Tests	4
Test Reliability	4
Difficulty Level	3
English (Second Language)	3
Evaluation Methods	3
Foreign Countries	3
Item Analysis	3
Item Response Theory	3
Language Tests	3
Latent Trait Theory	3
Psychometrics	3
Reading Tests	3
Second Language Instruction	3
Second Language Learning	3
Testing Problems	3
Achievement Tests	2
Comparative Analysis	2
Comparative Testing	2
More ▼

Source

Annual Review of Applied…	1
Applied Psychological…	1
Edinburgh Working Papers in…	1
Educational and Psychological…	1
Journal of Economic Education	1
Journal of Interactive Online…	1
Journal of Research in Reading	1
Review of Educational Research	1
Teacher Education Quarterly	1

Publication Type

Journal Articles	9
Reports - Research	9
Information Analyses	4
Opinion Papers	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Canada	1
Netherlands	1
Sweden	1
United Kingdom (England)	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1
Utah	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Defining Issues Test	1
Embedded Figures Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Rasch Scaling and Reading Tests.

Peer reviewed

Pumfrey, Peter D. – Journal of Research in Reading, 1987

Discusses, for the benefit of research workers and other test users, the ongoing controversy concerning the relative merits of conventional test theory and Rasch scaling in the construction of reading tests. Concludes that a great deal of further research is required to see whether these approaches are educationally valid. (JD)

Descriptors: Reading Research, Reading Tests, Test Construction, Test Format

The Radex Structure of Intelligence: A Replication.

Peer reviewed

Adler, Nurit; Guttman, Ruth – Educational and Psychological Measurement, 1982

Thirteen ability tests were administered as defined within a mapping sentence containing four content facets: rule type, expression mode, language of communication and dimensionality of portrayed object. Smallest Space Analysis of intercorrelations among test scores showed the radex structure of the two-dimensional space conformed to the…

Descriptors: Content Analysis, Factor Structure, Intelligence Tests, Scores

Administering Defining Issues Test Online: Do Response Modes Matter?

Peer reviewed

Direct link

Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007

The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…

Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory

Domain-Referenced Testing of Reading Achievement.

Brittain, Mary M.; Brittain, Clay V. – 1981

A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…

Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Developments in Language Testing.

Peer reviewed

Douglas, Dan – Annual Review of Applied Linguistics, 1995

Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…

Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests

Implications for Altering the Context in Which Test Items Appear: A Historical Perspective on an Immediate Concern.

Peer reviewed

Leary, Linda F.; Dorans, Neil J. – Review of Educational Research, 1985

Research on the potential effects of different item arrangement schemes on item statistics is reviewed for three separate periods. Earliest studies investigated the simple main effect of item order on test performance. The late 1960s emphasized interactions between item order and examinees' characteristics. Current concern focuses on item…

Descriptors: Achievement Tests, Aptitude Tests, Item Analysis, Latent Trait Theory

Informal Reasoning Assessment: Using Verbal Reports of Thinking to Improve Multiple-Choice Test Validity. Technical Report No. 430.

Download full text

Norris, Stephen P. – 1988

A study examined whether the process of gathering verbal reports of subjects' thinking while taking multiple-choice critical thinking tests could be used to infer the reasoning process used and identify test items which do not require critical thinking skills. Four factors can render an inference of a subject's critical thinking skills…

Descriptors: Cognitive Processes, Critical Thinking, High School Students, High Schools

State Refinements to the ESEA Title I Evaluation and Reporting System: Utah 1979-80 Project. Final Report.

Download full text

White, Karl; And Others – 1981

To explain discrepancies in Utah's elementary school test results under the Elementary and Secondary Education Act's Title I Evaluation and Reporting System (TIERS), researchers investigated the adequacy and validity of TIERS evaluation models. Model A (norm-referenced testing) is used in most Utah school districts, in preference to Models B or C…

Descriptors: Achievement Tests, Elementary Education, Evaluation Methods, Norm Referenced Tests

Guide to Scoring LEP Student Responses to Open-Ended Science Items. SCASS LEP Consortium Project.

Download full text

Kopriva, Rebecca; Sexton, Ursula M. – 1999

To date, little work has been done to ensure limited English proficient (LEP) students are accurately assessed on a large scale. The purpose of this guide is to help scorers in high volume situations to be able to effectively evaluate the open-ended responses of this population. Section one of this guide presents a brief overview of the State…

Descriptors: English (Second Language), Examiners, Factor Analysis, Limited English Speaking

Steps and Recommendations for More Accurate Placement Test Creation.

Download full text

Murray, Joel R. – 2001

This paper aims to provide practical advice for creating a placement test for English-as-a-Second-Language (ESL) or English-as-a-foreign-language (EFL) instruction. Three forms of concrete assistance are provided: a detailed literature review; detailed steps focusing on the creation of placement tests; and a set of recommendations focusing on…

Descriptors: English (Second Language), Examiners, Factor Analysis, Literature Reviews

An Alignment/Transfer Experiment with Low Socioeconomic Level Students.

Peer reviewed

Elia, June Isaacs – Teacher Education Quarterly, 1994

This study examined the amount of variance explained by alignment of testing to instruction among low socioeconomic level fourth graders, proposing two instructional alignment hypotheses. Results indicated that alignment had an unusually high effect. Low performing low socioeconomic level students achieved high success levels when conditions of…

Descriptors: Culture Fair Tests, Disadvantaged Youth, Elementary Education, Grade 4

Analysis of Differential Item Functioning in Translated Assessment Instruments.

Peer reviewed

Budgell, Glen R.; And Others – Applied Psychological Measurement, 1995

The usefulness of three item response theory-based methods and the Mantel Haenszel technique in evaluating the measurement equivalence of translated assessment instruments was demonstrated in a study involving 2,000 French-speaking Canadian adults who took a French test translation and 2,000 English-speaking adults who took the English original.…

Descriptors: Adults, Chi Square, Cultural Awareness, Culture Fair Tests

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Measurement Characteristics of the Finding Embedded Figures Test in "Speed" versus "Power" Administrations.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1990

Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…

Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Adler, Nurit	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Budgell, Glen R.	1
Dorans, Neil J.	1
Douglas, Dan	1
Elia, June Isaacs	1
Guttman, Ruth	1
Haladyna, Tom	1
Iran-Nejad, Asghar	1
Kiely, Gerard L.	1
Kopriva, Rebecca	1
Leary, Linda F.	1
Lynch, Tony	1
Melancon, Janet G.	1
Murray, Joel R.	1
Norris, Stephen P.	1
Pumfrey, Peter D.	1
Robson, Denise	1
Roid, Gale	1
Sexton, Ursula M.	1
Thoma, Stephen J.	1
Thompson, Bruce	1
Wainer, Howard	1
More ▼