ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Descriptor

Classification	15
Statistical Analysis	15
Test Construction	15
Test Items	7
Models	4
Correlation	3
Difficulty Level	3
Foreign Countries	3
Prediction	3
Research Methodology	3
Scores	3
Test Reliability	3
Test Validity	3
Achievement Tests	2
Automation	2
College Entrance Examinations	2
College Students	2
Computational Linguistics	2
Computer Oriented Programs	2
Computer Software	2
Cutting Scores	2
Ethics	2
Evaluation Methods	2
Factor Analysis	2
Factor Structure	2
More ▼

Source

ETS Research Report Series	3
International Journal of…	2
Crime & Delinquency	1
Educational and Psychological…	1
Eurasian Journal of…	1
Learning Disability Quarterly	1
Review of Higher Education	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Descriptive	3
Reports - Evaluative	2
Numerical/Quantitative Data	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Secondary Education	2

Audience

Location

Canada	1
Israel	1
Pennsylvania (Pittsburgh)	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Kit of Reference Tests for…	1
Minnesota Multiphasic…	1
Test of English as a Foreign…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Misspecification of Attribute Structure in Diagnostic Measurement

Peer reviewed

Direct link

Liu, Ren – Educational and Psychological Measurement, 2018

Attribute structure is an explicit way of presenting the relationship between attributes in diagnostic measurement. The specification of attribute structures directly affects the classification accuracy resulted from psychometric modeling. This study provides a conceptual framework for understanding misspecifications of attribute structures. Under…

Descriptors: Diagnostic Tests, Classification, Test Construction, Relationship

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Empirically Exploring Higher Education Cultures of Assessment

Peer reviewed

Direct link

Fuller, Matthew B.; Skidmore, Susan T.; Bustamante, Rebecca M.; Holzweiss, Peggy C. – Review of Higher Education, 2016

Although touted as beneficial to student learning, cultures of assessment have not been examined adequately using validated instruments. Using data collected from a stratified, random sample (N = 370) of U.S. institutional research and assessment directors, the models tested in this study provide empirical support for the value of using the…

Descriptors: Higher Education, Administrators, Evaluation Methods, Attitude Measures

Enhancing the Interpretability of the Overall Results of an International Test of English-Language Proficiency

Peer reviewed

Direct link

Papageorgiou, Spiros; Morgan, Rick; Becker, Valerie – International Journal of Testing, 2015

The purpose of this study was to enhance the meaning of the scores of an English-language test by developing performance levels and descriptors for reporting overall test performance. The levels and descriptors were intended to accompany the total scale scores of TOEFL Junior® Standard, an international test of English as a second/foreign…

Descriptors: Language Proficiency, Language Tests, English (Second Language), Second Language Learning

Using the Minnesota Multiphasic Personality Inventory-2 to Develop a Scale to Identify Test Anxiety among Students with Learning Disabilities

Peer reviewed

Direct link

Lufi, Dubi; Awwad, Abeer – Learning Disability Quarterly, 2013

The purpose of this article was to describe an initial step developing a new scale to identify individuals with learning disabilities (LD) and test anxiety. Eighty-eight students answered the "Minnesota Multiphasic Personality Inventory-2" (MMPI-2). The participants were drawn from the following three groups: (a) adults with LD and test…

Descriptors: Learning Disabilities, Test Anxiety, Comparative Analysis, Test Validity

Aligning "TextEvaluator"® Scores with the Accelerated Text Complexity Guidelines Specified in the Common Core State Standards. Research Report. ETS RR-15-21

Peer reviewed
PDF on ERIC

Download full text

Sheehan, Kathleen M. – ETS Research Report Series, 2015

The "TextEvaluator"® text analysis tool is a fully automated text complexity evaluation tool designed to help teachers, curriculum specialists, textbook publishers, and test developers select texts that are consistent with the text complexity guidelines specified in the Common Core State Standards.This paper documents the procedure used…

Descriptors: Scores, Common Core State Standards, Computer Software, Computational Linguistics

The Role of Item Models in Automatic Item Generation

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012

Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items

Pass-Fail Reliability for Tests with Cut Scores: A Simplified Method.

Download full text

Breyer, F. Jay; Lewis, Charles – 1994

A single-administration classification reliability index is described that estimates the probability of consistently classifying examinees to mastery or nonmastery states as if those examinees had been tested with two alternate forms. The procedure is applicable to any test used for classification purposes, subdividing that test into two…

Descriptors: Classification, Cutting Scores, Objective Tests, Pass Fail Grading

Supporting Efficient, Evidence-Centered Item Development for the GRE Verbal Measure. ETS GRE Board Research Report No. 03-14. ETS RR-07-29

Peer reviewed
PDF on ERIC

Download full text

Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko – ETS Research Report Series, 2007

This paper explores alternative approaches for facilitating efficient, evidence-centered item development for a new type of verbal reasoning item developed for use on the GRE® General Test. Results obtained in two separate studies are reported. The first study documented the development and validation of a fully automated approach for locating the…

Descriptors: College Entrance Examinations, Graduate Study, Test Items, Item Analysis

ITEM SELECTION TECHNIQUES AND EVALUATION OF INSTRUCTIONAL OBJECTIVES.

COX, RICHARD C. – 1965

THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…

Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis

Statistical Approaches to the Study of Item Difficulty.

Download full text

Olson, John F.; And Others – 1989

Traditionally, item difficulty has been defined in terms of the performance of examinees. For test development purposes, a more useful concept would be some kind of intrinsic item difficulty, defined in terms of the item's content, context, or characteristics and the task demands set by the item. In this investigation, the measurement literature…

Descriptors: Classification, Cluster Analysis, Difficulty Level, Educational Research

Computer Aided Tests.

Download full text

Steinke, Elisabeth – 1970

An approach to using the computer to assemble German tests is described. The purposes of the system would be: (1) an expansion of the bilingual lexical memory bank to list and store idioms of all degrees of difficulty, with frequency data and with complete and sophisticated retrieval possibility for assembly; (2) the creation of an…

Descriptors: Classification, Computational Linguistics, Computer Oriented Programs, German

Statistical Risk Assessment: Old Problems and New Applications

Peer reviewed

Direct link

Gottfredson, Stephen D.; Moriarty, Laura J. – Crime & Delinquency, 2006

Statistically based risk assessment devices are widely used in criminal justice settings. Their promise remains largely unfulfilled, however, because assumptions and premises requisite to their development and application are routinely ignored and/or violated. This article provides a brief review of the most salient of these assumptions and…

Descriptors: Risk, Justice, Criminals, Crime

Inside Sourcefinder: Predicting the Acceptability Status of Candidate Reading-Comprehension Source Documents. Research Report. ETS RR-06-24

Peer reviewed
PDF on ERIC

Download full text

Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko; Hemat, Ramin; Zuckerman, Daniel – ETS Research Report Series, 2006

This paper describes the development, implementation, and evaluation of an automated system for predicting the acceptability status of candidate reading-comprehension stimuli extracted from a database of journal and magazine articles. The system uses a combination of classification and regression techniques to predict the probability that a given…

Descriptors: Automation, Prediction, Reading Comprehension, Classification

A Causal Model Analysis Suggests Modification of the Cumulative Hierarchical Structure Assumed in Bloom's Taxonomy of the Cognitive Domain.

Download full text

Madaus, George F.; And Others – 1971

Bloom's taxonomy of the cognitive domain consists of six major levels: Knowledge, Comprehension, Application, Analysis, Synthesis, and Evaluation. The purpose of this study is to construct a quantitative causal model for a set of tests designed to operationally define these six levels in order to further explore the validity of the cumulative…

Descriptors: Achievement Tests, Classification, Cognitive Objectives, Cognitive Processes

Sheehan, Kathleen M.	3
Futagi, Yoko	2
Kostin, Irene	2
Awwad, Abeer	1
Becker, Valerie	1
Breyer, F. Jay	1
Bustamante, Rebecca M.	1
COX, RICHARD C.	1
Demir, Ergul	1
Fuller, Matthew B.	1
Gierl, Mark J.	1
Gottfredson, Stephen D.	1
Hemat, Ramin	1
Holzweiss, Peggy C.	1
Lai, Hollis	1
Lewis, Charles	1
Liu, Ren	1
Lufi, Dubi	1
Madaus, George F.	1
Morgan, Rick	1
Moriarty, Laura J.	1
Olson, John F.	1
Papageorgiou, Spiros	1
Skidmore, Susan T.	1
More ▼