ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	41

Descriptor

Test Length	124
Item Response Theory	53
Test Items	40
Sample Size	29
Computer Assisted Testing	28
Adaptive Testing	27
Estimation (Mathematics)	25
Simulation	25
Test Construction	24
Scores	21
Comparative Analysis	19
Mathematical Models	19
Ability	17
Error of Measurement	16
Test Reliability	16
Test Validity	16
Maximum Likelihood Statistics	15
Computation	14
Monte Carlo Methods	14
Testing Problems	14
Bayesian Statistics	13
Classification	13
Computer Simulation	13
Test Format	13
Correlation	12
More ▼

Publication Type

Reports - Evaluative	124
Journal Articles	76
Speeches/Meeting Papers	28
Numerical/Quantitative Data	4
Reports - Research	4
Information Analyses	2
Collected Works - General	1
Guides - Non-Classroom	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Practitioners

Location

Netherlands	2
Asia	1
Japan	1
Maryland	1
New York	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	2
Test of English as a Foreign…	2
ACTFL Oral Proficiency…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Developmental Indicators for…	1
International English…	1
Measures of Academic Progress	1
Medical College Admission Test	1
Peabody Picture Vocabulary…	1
Program for International…	1
Raven Advanced Progressive…	1
Trends in International…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 124 results Save | Export

Are We There Yet? Evaluating the Effectiveness of a Recurrent Neural Network-Based Stopping Algorithm for an Adaptive Assessment

Peer reviewed

Direct link

Matayoshi, Jeffrey; Cosyn, Eric; Uzun, Hasan – International Journal of Artificial Intelligence in Education, 2021

Many recent studies have looked at the viability of applying recurrent neural networks (RNNs) to educational data. In most cases, this is done by comparing their performance to existing models in the artificial intelligence in education (AIED) and educational data mining (EDM) fields. While there is increasing evidence that, in many situations,…

Descriptors: Artificial Intelligence, Data Analysis, Student Evaluation, Adaptive Testing

A Comparison of Automated Scale Short Form Selection Strategies

Peer reviewed
PDF on ERIC

Download full text

Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – International Educational Data Mining Society, 2019

Short forms of psychometric scales have been commonly used in educational and psychological research to reduce the burden of test administration. However, it is challenging to select items for a short form that preserve the validity and reliability of the scores of the original scale. This paper presents and evaluates multiple automated methods…

Descriptors: Psychometrics, Measures (Individuals), Mathematics, Heuristics

Test Review: Current Options in At-Home Language Proficiency Tests for Making High-Stakes Decisions

Peer reviewed

Direct link

Isbell, Daniel R.; Kremmel, Benjamin – Language Testing, 2020

Administration of high-stakes language proficiency tests has been disrupted in many parts of the world as a result of the 2019 novel coronavirus pandemic. Institutions that rely on test scores have been forced to adapt, and in many cases this means using scores from a different test, or a new online version of an existing test, that can be taken…

Descriptors: Language Tests, High Stakes Tests, Language Proficiency, Second Language Learning

Non-Response Rates to Individual Items on the IDEA Student Ratings of Instruction Forms. IDEA Research Note #5

Download full text

Li, Dan; Benton, Stephen L. – IDEA Center, Inc., 2017

In the study evaluated in this report, the authors asked what effect survey length has on student non-response rates to individual items on IDEA's "Diagnostic Feedback" (DF) and "Learning Essentials" (LE) forms. The approach was to analyze individual student ratings of classes contained in the 2015-2016 IDEA-CL database.…

Descriptors: Response Rates (Questionnaires), Student Surveys, Test Length, Test Items

Test Review: TestDaF

Peer reviewed

Direct link

Norris, John; Drackert, Anastasia – Language Testing, 2018

The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…

Descriptors: German, Second Language Learning, Language Tests, Language Proficiency

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Student Outcomes on MAP Growth: Comparison of Virtual and In-Person Administrations

Download full text

James, Syretta R.; Liu, Shihching Jessica; Maina, Nyambura; Wade, Julie; Wang, Helen; Wilson, Heather; Wolanin, Natalie – Montgomery County Public Schools, 2021

The impact of the COVID-19 pandemic continues to overwhelm the functioning and outcomes of educational systems throughout the nation. The public education system is under particular scrutiny given that students, families, and educators are under considerable stress to maintain academic progress. Since the beginning of the crisis, school-systems…

Descriptors: Achievement Tests, COVID-19, Pandemics, Public Schools

Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

Direct link

Jacob, Brian A. – Center on Children and Families at Brookings, 2016

Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

Descriptors: Scores, Common Core State Standards, Test Length, Test Content

The Incremental Validity of a Short Form of the Ideational Behavior Scale and Usefulness of Distractor, Contraindicative, and Lie Scales

Peer reviewed

Direct link

Runco, Mark A.; Walczyk, Jeffrey John; Acar, Selcuk; Cowger, Ernest L.; Simundson, Melissa; Tripp, Sunny – Journal of Creative Behavior, 2014

This article describes an empirical refinement of the "Runco Ideational Behavior Scale" (RIBS). The RIBS seems to be associated with divergent thinking, and the potential for creative thinking, but it was possible that its validity could be improved. With this in mind, three new scales were developed and the unique benefit (or…

Descriptors: Behavior Rating Scales, Creative Thinking, Test Validity, Psychometrics

A Nonparametric Approach to Estimate Classification Accuracy and Consistency

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014

When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…

Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics

Test Review: C. Mardell & D. S. Goldenberg. "Speed Developmental Indicators for the Assessment of Learning-Fourth Edition" ("Speed DIAL-4")

Peer reviewed

Direct link

Doskey, Elena M.; Lagunas, Brenda; SooHoo, Michelle; Lomax, Amanda; Bullick, Stephanie – Journal of Psychoeducational Assessment, 2013

The Speed DIAL-4 was developed from the Developmental Indicators for the Assessment of Learning, Fourth Edition (DIAL-4), a screening designed to identify children between the ages of 2 years, 6 months through 5 years, 11 months "who are in need of intervention or diagnostic assessment in the following areas: motor, concepts, language,…

Descriptors: Screening Tests, Young Children, Test Length, Scoring

Using Logistic Approximations of Marginal Trace Lines to Develop Short Assessments

Peer reviewed

Direct link

Stucky, Brian D.; Thissen, David; Edelen, Maria Orlando – Applied Psychological Measurement, 2013

Test developers often need to create unidimensional scales from multidimensional data. For item analysis, "marginal trace lines" capture the relation with the general dimension while accounting for nuisance dimensions and may prove to be a useful technique for creating short-form tests. This article describes the computations needed to obtain…

Descriptors: Test Construction, Test Length, Item Analysis, Item Response Theory

Identification of Differential Item Functioning in Assessment Booklet Designs with Structurally Missing Data

Peer reviewed

Direct link

Goodman, Joshua T.; Willse, John T.; Allen, Nancy L.; Klaric, John S. – Educational and Psychological Measurement, 2011

The Mantel-Haenszel procedure is a popular technique for determining items that may exhibit differential item functioning (DIF). Numerous studies have focused on the strengths and weaknesses of this procedure, but few have focused the performance of the Mantel-Haenszel method when structurally missing data are present as a result of test booklet…

Descriptors: Test Bias, Identification, Tests, Test Length

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Applied Psychological…	21
Educational and Psychological…	12
Journal of Educational…	8
Applied Measurement in…	7
Language Testing	4
Educational Research and…	2
Psychological Assessment	2
Psychometrika	2
Academic Medicine	1
Assessment & Evaluation in…	1
Center on Children and…	1
Educational Measurement:…	1
European Journal of Science…	1
Evaluation in Education:…	1
IDEA Center, Inc.	1
Intelligence	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Creative Behavior	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Journal of Visual Impairment…	1
Machine-Mediated Learning	1
Montgomery County Public…	1
More ▼

Gessaroli, Marc E.	5
Wang, Wen-Chung	5
De Ayala, R. J.	4
Wainer, Howard	4
De Champlain, Andre	3
Kim, Seock-Ho	3
Livingston, Samuel A.	3
Meijer, Rob R.	3
Allen, Nancy L.	2
Ankenmann, Robert D.	2
Chen, Shu-Ying	2
De Champlain, Andre F.	2
Eggen, Theo J. H. M.	2
Finch, Holmes	2
Fitzpatrick, Anne R.	2
Hambleton, Ronald K.	2
Lewis, Charles	2
Pommerich, Mary	2
Schumacker, Randall E.	2
Sijtsma, Klaas	2
Song, Hao	2
Spray, Judith A.	2
Stone, Clement A.	2
Thissen, David	2
More ▼