ERIC - Search Results

Publication Date

In 2025	3
Since 2024	18
Since 2021 (last 5 years)	69
Since 2016 (last 10 years)	161
Since 2006 (last 20 years)	317

Descriptor

Test Length	624
Test Items	218
Item Response Theory	197
Test Construction	149
Sample Size	137
Test Reliability	130
Computer Assisted Testing	117
Test Validity	108
Simulation	107
Adaptive Testing	98
Comparative Analysis	96
Test Format	88
Scores	86
Error of Measurement	75
Statistical Analysis	71
Correlation	68
Foreign Countries	68
Item Analysis	65
Computation	61
Higher Education	61
Models	61
Difficulty Level	57
Accuracy	55
Testing Problems	54
Monte Carlo Methods	51
More ▼

Education Level

Higher Education	44
Postsecondary Education	36
Elementary Education	21
Secondary Education	18
Middle Schools	11
Elementary Secondary Education	10
High Schools	9
Early Childhood Education	8
Junior High Schools	8
Primary Education	7
Grade 3	6
Intermediate Grades	6
Grade 6	5
Grade 8	5
Grade 2	3
Grade 4	3
Grade 5	3
Grade 7	3
Kindergarten	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	23
Practitioners	7
Administrators	2
Community	1
Students	1
Support Staff	1
Teachers	1

Location

Turkey	8
Australia	7
Canada	7
China	5
Netherlands	5
Japan	4
Taiwan	4
United Kingdom	4
Germany	3
Michigan	3
Singapore	3
South Korea	3
Ireland	2
New York	2
New Zealand	2
Pennsylvania	2
Alabama	1
Armenia	1
Asia	1
Brazil	1
California	1
Colombia	1
Florida	1
Ghana	1
Illinois (Chicago)	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Equal Access	1
Job Training Partnership Act…	1
Race to the Top	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Test Length X

Showing 121 to 135 of 624 results Save | Export

The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017

There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…

Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Non-Response Rates to Individual Items on the IDEA Student Ratings of Instruction Forms. IDEA Research Note #5

Download full text

Li, Dan; Benton, Stephen L. – IDEA Center, Inc., 2017

In the study evaluated in this report, the authors asked what effect survey length has on student non-response rates to individual items on IDEA's "Diagnostic Feedback" (DF) and "Learning Essentials" (LE) forms. The approach was to analyze individual student ratings of classes contained in the 2015-2016 IDEA-CL database.…

Descriptors: Response Rates (Questionnaires), Student Surveys, Test Length, Test Items

Test Review: TestDaF

Peer reviewed

Direct link

Norris, John; Drackert, Anastasia – Language Testing, 2018

The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…

Descriptors: German, Second Language Learning, Language Tests, Language Proficiency

Designing CAT MOCCA: Guiding Principles and Simulation Research. MOCCA Technical Report MTR-2021-1

Peer reviewed
PDF on ERIC

Download full text

Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021

MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…

Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

The Big Three Perfectionism Scale--Short Form (BTPS-SF): Development of a Brief Self-Report Measure of Multidimensional Perfectionism

Peer reviewed

Direct link

Feher, Anita; Smith, Martin M.; Saklofske, Donald H.; Plouffe, Rachel A.; Wilson, Claire A.; Sherry, Simon B. – Journal of Psychoeducational Assessment, 2020

The Big Three Perfectionism Scale (BTPS) is a 45-item self-report measure of perfectionism with three overarching factors: rigid, self-critical, and narcissistic perfectionism. Our objective was to create a brief version of the BTPS, the Big Three Perfectionism Scale--Short Form (BTPS-SF). Sixteen items were selected, and confirmatory factor…

Descriptors: Personality Measures, Personality Traits, Test Construction, Measurement Techniques

Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

Peer reviewed

Direct link

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…

Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation

ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

Peer reviewed
PDF on ERIC

Download full text

Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C. – Journal of Learning in Higher Education, 2016

Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…

Descriptors: Statistical Analysis, Scores, Tests, Testing

Feasibility and Effectiveness of Group Exams in Mathematics Courses

Peer reviewed

Direct link

Garaschuk, Kseniya M.; Cytrynbaum, Eric N. – PRIMUS, 2019

Active learning techniques, such as peer instruction and group work, have been gaining a lot of traction in universities. Taking a natural next step in re-evaluating current practices, many institutions recently started experimenting student-centred group exams. In order to assess the feasibility and effectiveness of collaborative assessments, we…

Descriptors: Instructional Effectiveness, Mathematics Instruction, Group Testing, Group Activities

Comparative Analyses of MIRT Models and Software (BMIRT and flexMIRT)

Peer reviewed

Direct link

Yavuz, Guler; Hambleton, Ronald K. – Educational and Psychological Measurement, 2017

Application of MIRT modeling procedures is dependent on the quality of parameter estimates provided by the estimation software and techniques used. This study investigated model parameter recovery of two popular MIRT packages, BMIRT and flexMIRT, under some common measurement conditions. These packages were specifically selected to investigate the…

Descriptors: Item Response Theory, Models, Comparative Analysis, Computer Software

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Modelling Student Misconceptions Using Nested Logit Item Response Models

Direct link

Yildiz, Mustafa – ProQuest LLC, 2017

Student misconceptions have been studied for decades from a curricular/instructional perspective and from the assessment/test level perspective. Numerous misconception assessment tools have been developed in order to measure students' misconceptions relative to the correct content. Often, these tools are used to make a variety of educational…

Descriptors: Misconceptions, Students, Item Response Theory, Models

Dimensionality in Compensatory MIRT When Complex Structure Exists: Evaluation of DETECT and NOHARM

Peer reviewed

Direct link

Svetina, Dubravka; Levy, Roy – Journal of Experimental Education, 2016

This study investigated the effect of complex structure on dimensionality assessment in compensatory multidimensional item response models using DETECT- and NOHARM-based methods. The performance was evaluated via the accuracy of identifying the correct number of dimensions and the ability to accurately recover item groupings using a simple…

Descriptors: Item Response Theory, Accuracy, Correlation, Sample Size

Evaluating the Impact of Guessing and Its Interactions with Other Test Characteristics on Confidence Interval Procedures for Coefficient Alpha

Peer reviewed

Direct link

Paek, Insu – Educational and Psychological Measurement, 2016

The effect of guessing on the point estimate of coefficient alpha has been studied in the literature, but the impact of guessing and its interactions with other test characteristics on the interval estimators for coefficient alpha has not been fully investigated. This study examined the impact of guessing and its interactions with other test…

Descriptors: Guessing (Tests), Computation, Statistical Analysis, Test Length

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ... | 42

Educational and Psychological…	86
Applied Psychological…	45
Journal of Educational…	28
ProQuest LLC	28
Applied Measurement in…	21
ETS Research Report Series	15
Journal of Psychoeducational…	13
Psychological Assessment	12
International Journal of…	11
Psychometrika	10
Measurement:…	9
International Journal of…	8
Journal of Educational and…	7
Journal of Experimental…	6
Educational Sciences: Theory…	5
Journal of Speech, Language,…	5
Language Testing	5
Assessment	4
Educational Measurement:…	4
Grantee Submission	4
Eurasian Journal of…	3
Field Methods	3
Journal of Clinical Psychology	3
Perceptual and Motor Skills	3
Physical Review Physics…	3
More ▼

Hambleton, Ronald K.	15
Wang, Wen-Chung	9
Livingston, Samuel A.	6
Sijtsma, Klaas	6
Wainer, Howard	6
Weiss, David J.	6
Wilcox, Rand R.	6
Cheng, Ying	5
Gessaroli, Marc E.	5
Lee, Won-Chan	5
Lewis, Charles	5
Reckase, Mark D.	5
Cohen, Allan S.	4
De Ayala, R. J.	4
Drasgow, Fritz	4
Huynh, Huynh	4
Kim, Seock-Ho	4
Meijer, Rob R.	4
Paek, Insu	4
Schumacker, Randall E.	4
Tay, Louis	4
Wang, Chun	4
Wells, Craig S.	4
Axelrod, Bradley N.	3
More ▼

Reports - Research	411
Journal Articles	393
Reports - Evaluative	124
Speeches/Meeting Papers	92
Dissertations/Theses -…	28
Reports - Descriptive	21
Numerical/Quantitative Data	14
Guides - Non-Classroom	11
Tests/Questionnaires	11
Information Analyses	10
Opinion Papers	7
Reference Materials -…	2
Reports - General	2
Collected Works - General	1
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Historical Materials	1
More ▼

Test of English as a Foreign…	9
Wechsler Adult Intelligence…	9
SAT (College Admission Test)	8
Law School Admission Test	5
Minnesota Multiphasic…	5
Wechsler Intelligence Scale…	5
Graduate Record Examinations	4
Trends in International…	4
Iowa Tests of Basic Skills	3
Kaufman Brief Intelligence…	3
National Assessment of…	3
Program for International…	3
Advanced Placement…	2
Bem Sex Role Inventory	2
Comprehensive Tests of Basic…	2
MacArthur Communicative…	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Nelson Denny Reading Tests	2
Peabody Picture Vocabulary…	2
Self Description Questionnaire	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scales…	2
ACTFL Oral Proficiency…	1
Academic Motivation Scale	1
More ▼