ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Test Length	92
Test Items	32
Sample Size	26
Test Construction	25
Item Response Theory	22
Comparative Analysis	20
Computer Assisted Testing	20
Higher Education	20
Mathematical Models	20
Test Reliability	18
Adaptive Testing	17
Test Validity	17
Mastery Tests	15
Simulation	15
Testing Problems	14
Cutting Scores	13
Estimation (Mathematics)	13
Scores	13
Difficulty Level	12
Computer Simulation	11
Equated Scores	11
Test Format	11
Ability	10
Criterion Referenced Tests	10
Item Banks	10
More ▼

Source

Applied Measurement in…	2
International Educational…	2
AERA Online Paper Repository	1
Assessment	1
Australian Association for…	1
ETS Research Report Series	1
Journal of Educational…	1
Online Submission	1
Pearson	1
Psychological Assessment	1
Turkish Journal of Education	1
More ▼

Publication Type

Speeches/Meeting Papers	92
Reports - Research	59
Reports - Evaluative	28
Journal Articles	7
Information Analyses	3
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Higher Education	1

Audience

Researchers

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	2
Law School Admission Test	2
Wechsler Intelligence Scale…	2
Bar Examinations	1
Bem Sex Role Inventory	1
COMPASS (Computer Assisted…	1
California Achievement Tests	1
California Psychological…	1
Iowa Tests of Basic Skills	1
Matching Familiar Figures Test	1
Medical College Admission Test	1
New Jersey College Basic…	1
Otis Lennon School Ability…	1
Stanford Achievement Tests	1
Texas Assessment of Basic…	1
Texas Educational Assessment…	1
Wechsler Intelligence Scales…	1
Wechsler Memory Scale	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 92 results Save | Export

Comparison of Factor Retention Methods on Binary Data: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kiliç, Abdullah Faruk; Uysal, Ibrahim – Turkish Journal of Education, 2019

In this study, the purpose is to compare factor retention methods under simulation conditions. For this purpose, simulations conditions with a number of factors (1, 2 [simple]), sample sizes (250, 1.000, and 3.000), number of items (20, 30), average factor loading (0.50, 0.70), and correlation matrix (Pearson Product Moment [PPM] and Tetrachoric)…

Descriptors: Simulation, Factor Structure, Sample Size, Test Length

An Empirical Research on Identifiability and Q-Matrix Design for DINA Model

Peer reviewed
PDF on ERIC

Download full text

Xu, Peng; Desmarais, Michel C. – International Educational Data Mining Society, 2018

In most contexts of student skills assessment, whether the test material is administered by the teacher or within a learning environment, there is a strong incentive to minimize the number of questions or exercises administered in order to get an accurate assessment. This minimization objective can be framed as a Q-matrix design problem: given a…

Descriptors: Test Items, Accuracy, Test Construction, Skills

A Comparison of Automated Scale Short Form Selection Strategies

Peer reviewed
PDF on ERIC

Download full text

Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – International Educational Data Mining Society, 2019

Short forms of psychometric scales have been commonly used in educational and psychological research to reduce the burden of test administration. However, it is challenging to select items for a short form that preserve the validity and reliability of the scores of the original scale. This paper presents and evaluates multiple automated methods…

Descriptors: Psychometrics, Measures (Individuals), Mathematics, Heuristics

Illustration of a Survey Refinement Process Using Psychometric Analysis

Peer reviewed

Direct link

Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017

Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…

Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Teacher Educators: Course Experiences of Bachelor of Education Primary Students

Download full text

Bentley-Williams, Robyn; Forbes, Anne – Australian Association for Research in Education (NJ1), 2012

This investigation examined the course experiences of Bachelor of Education Primary students across each year of the course. The aims of the study were to identify gaps in what we know about our students; to identify relevant domains in student experiences and to assist with course improvements. A reflective inquiry paradigm was adopted for…

Descriptors: Foreign Countries, Bachelors Degrees, Preservice Teachers, Student Teacher Attitudes

The Impact of Anchor Test Length on Equating Results in a Nonequivalent Groups Design. Research Report. ETS RR-07-44

Peer reviewed
PDF on ERIC

Download full text

Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007

This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length

The Relationship between the WRAML Memory Screening and General Memory Indices in a Clinical Population.

Peer reviewed

Guilmette, Thomas J.; Kennedy, Mary Lynne – Assessment, 1997

The Wide Range Assessment of Memory and Learning (WRAML) (D. Sheslow and W. Adams, 1990) was given to 51 children. The General Memory Index (GMI) of the WRAML was compared with a short form of the WRAML, the Memory Screening Index (MSI). The MSI was higher than the GMI in 41 of 51 cases. (SLD)

Descriptors: Children, Cognitive Tests, Learning, Memory

A Description and Demonstration of the Polytomous-DFIT Framework.

Download full text

Flowers, Claudia P.; And Others – 1996

N. S. Raju, W. J. van der Linden, and P. F. Fleer (in press) have proposed an item response theory-based, parametric procedure for the detection of differential item functioning (DIF)/differential test functioning (DTF) known as differential functioning of item and test (DFIT). DFIT can be used with dichotomous, polytomous, or multidimensional…

Descriptors: Item Response Theory, Mathematical Models, Simulation, Test Bias

Sacrificing Reliability and Exalting Sampling Error at the Altar of Parsimony: Some Cautions Concerning Short-Form Test Development.

Download full text

Henson, Robin K. – 2000

The purpose of this paper is to highlight some psychometric cautions that should be observed when seeking to develop short form versions of tests. Several points are made: (1) score reliability is impacted directly by the characteristics of the sample and testing conditions; (2) sampling error has a direct influence on reliability and factor…

Descriptors: Factor Structure, Psychometrics, Reliability, Sampling

Detection of Cheating on Multiple-Choice Examinations.

Download full text

Bay, Luz – 1995

An index is proposed to detect cheating on multiple-choice examinations, and its use is evaluated through simulations. The proposed index is based on the compound binomial distribution. In total, 360 simulated data sets reflecting 12 different cheating (copying) situations were obtained and used for the study of the sensitivity of the index in…

Descriptors: Cheating, Class Size, Identification, Multiple Choice Tests

Assessing the Dimensionality of Polytomous Item Responses with Small Sample Sizes and Short Test Lengths: A Comparison of Procedures.

PDF pending restoration

De Champlain, Andre F.; Gessaroli, Marc E.; Tang, K. Linda; De Champlain, Judy E. – 1998

The empirical Type I error rates of Poly-DIMTEST (H. Li and W. Stout, 1995) and the LISREL8 chi square fit statistic (K. Joreskog and D. Sorbom, 1993) were compared with polytomous unidimensional data sets simulated to vary as a function of test length and sample size. The rejection rates for both statistics were also studied with two-dimensional…

Descriptors: Chi Square, Goodness of Fit, Item Response Theory, Sample Size

Statistics Scores and Testing Time.

Download full text

Kennedy, Robert L.; McCallister, Corliss J. – 2000

The purpose of this study was to investigate the relationship between the scores students earned on their statistics final examinations and the number of minutes students required to complete the exams. In a previous study, K. Bridges (1985) extended the range of interest in this relationship from a single study to a course-based series, examining…

Descriptors: College Students, Higher Education, Scores, Statistics

Congeneric Models and Levine's Linear Equating Procedures.

Download full text

Brennan, Robert L. – 1990

In 1955, R. Levine introduced two linear equating procedures for the common-item non-equivalent populations design. His procedures make the same assumptions about true scores; they differ in terms of the nature of the equating function used. In this paper, two parameterizations of a classical congeneric model are introduced to model the variables…

Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Research Design

An Evaluation of "Intentional" Weighting of Extended-Response or Constructed-Response Items in Tests with Mixed Item Types.

Download full text

Ito, Kyoko; Sykes, Robert C. – 2000

This study investigated the practice of weighting a type of test item, such as constructed response, more than other types of items, such as selected response, to compute student scores for a mixed-item type of test. The study used data from statewide writing field tests in grades 3, 5, and 8 and considered two contexts, that in which a single…

Descriptors: Constructed Response, Elementary Education, Essay Tests, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Gessaroli, Marc E.	4
Bergstrom, Betty A.	3
Hambleton, Ronald K.	3
Bergstrom, Betty	2
De Ayala, R. J.	2
De Champlain, Andre	2
De Champlain, Andre F.	2
Frick, Theodore W.	2
Huynh, Huynh	2
Kim, Seock-Ho	2
Livingston, Samuel A.	2
Lunz, Mary E.	2
Pommerich, Mary	2
Reckase, Mark D.	2
Schumacker, Randall E.	2
Wendler, Cathy	2
Wright, Nancy	2
Abdel-fattah, Abdel-fattah A.	1
Allen, Nancy L.	1
Anderson, Judith A.	1
Ang, Cheng	1
Ankenmann, Robert D.	1
Arbet, Scott E.	1
Axelrod, Bradley N.	1
More ▼