Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 14 |
Descriptor
Evaluation Research | 17 |
Statistical Analysis | 17 |
Test Reliability | 17 |
Test Validity | 8 |
Test Construction | 7 |
Item Response Theory | 6 |
Screening Tests | 5 |
Testing Programs | 5 |
Curriculum Based Assessment | 4 |
Educational Testing | 4 |
Evaluation Methods | 4 |
More ▼ |
Source
Author
Alonzo, Julie | 4 |
Irvin, P. Shawn | 4 |
Lai, Cheng-Fei | 4 |
Park, Bitnara Jasmine | 4 |
Tindal, Gerald | 4 |
Avanzi, Lorenzo | 1 |
Balducci, Cristian | 1 |
Barford, Sean W. | 1 |
Bergsmann, Evelyn | 1 |
Bodur, Yasar | 1 |
Bradshaw, William S. | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Research | 9 |
Reports - Evaluative | 8 |
Numerical/Quantitative Data | 4 |
Guides - Non-Classroom | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 10 |
Elementary Education | 6 |
Higher Education | 6 |
Postsecondary Education | 4 |
Secondary Education | 4 |
High Schools | 2 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Location
California | 1 |
Italy | 1 |
Norway | 1 |
Ohio | 1 |
Turkey | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Safe and Drug Free Schools… | 1 |
Assessments and Surveys
ACT Assessment | 1 |
ACT Interest Inventory | 1 |
California Achievement Tests | 1 |
Trends in International… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Bergsmann, Evelyn; Klug, Julia; Burger, Christoph; Först, Nora; Spiel, Christiane – Assessment & Evaluation in Higher Education, 2018
There is a lively discussion on how to evaluate competence-based higher education in both evaluation and competence research. The instruments used are often limited to course evaluation or specific competences, taking a rather narrow perspective. Furthermore, the instruments often comprise predetermined competences that cannot be adapted to higher…
Descriptors: Questionnaires, Minimum Competency Testing, Screening Tests, Higher Education
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Iannone, Paola; Simpson, Adrian – Research in Mathematics Education, 2013
A consistent message emerges from research on undergraduate students' perceptions of assessment which describes traditional assessment as detrimental to learning. However this literature has not included students in the pure sciences. Mathematics education literature advocates the introduction of innovative assessment at university. In this…
Descriptors: Undergraduate Students, Student Attitudes, Mathematics Tests, Alternative Assessment
Avanzi, Lorenzo; Miglioretti, Massimo; Velasco, Veronica; Balducci, Cristian; Vecchio, Luca; Fraccaroli, Franco; Skaalvik, Einar M. – Teaching and Teacher Education: An International Journal of Research and Studies, 2013
The study assesses the psychometric properties of the Italian version of the Norwegian Teacher Self-Efficacy Scale--NTSES. Multiple group confirmatory factor analysis was used to explore the measurement invariance of the scale across two countries. Analyses performed on Italian and Norwegian samples confirmed a six-factor structure of the scale…
Descriptors: Foreign Countries, Factor Analysis, Self Efficacy, Well Being
Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs
Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs
Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs
Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 6, Grade 3, Curriculum Based Assessment, Educational Testing
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Unal, Zafer; Bodur, Yasar; Unal, Aslihan – Contemporary Issues in Technology and Teacher Education (CITE Journal), 2012
The researchers in this study undertook development of a webquest evaluation rubric and investigated its reliability. The rubric was created using the strengths of the currently available webquest rubrics with improvements based on the comments provided in the literature and feedback received from educators. After the rubric was created, 23…
Descriptors: Test Construction, Test Reliability, Instructional Material Evaluation, Scoring Rubrics
Cakir, Mustafa – Educational Sciences: Theory and Practice, 2011
The purpose of the study was to investigate the reliability and validity of a Turkish adaptation of Technology-Rich Outcomes-Focused Learning Environment Inventory (TROFLEI) which was developed by Aldridge, Dorman, and Fraser. A sample of 985 students from 16 high schools (Grades 9-12) participated in the study. Translation process followed…
Descriptors: Foreign Countries, Translation, Construct Validity, Factor Structure
ACT, Inc., 2013
This manual contains information about the American College Test (ACT) Plan® program. The principal focus of this manual is to document the Plan program's technical adequacy in light of its intended purposes. This manual supersedes the 2011 edition. The content of this manual responds to requirements of the testing industry as established in the…
Descriptors: College Entrance Examinations, Formative Evaluation, Evaluation Research, Test Bias
Erceg-Hurn, David M.; Mirosevich, Vikki M. – American Psychologist, 2008
Classic parametric statistical significance tests, such as analysis of variance and least squares regression, are widely used by researchers in many disciplines, including psychology. For classic parametric tests to produce accurate results, the assumptions underlying them (e.g., normality and homoscedasticity) must be satisfied. These assumptions…
Descriptors: Statistical Significance, Least Squares Statistics, Effect Size, Statistical Studies
Sudweeks, Richard R.; Reeve, Suzanne; Bradshaw, William S. – Assessing Writing, 2004
A pilot study was conducted to evaluate and improve the rating procedure proposed for use in a research effort designed to assess the essay writing ability of college sophomores. Generalizability theory and the Many-Facet Rasch Model were each used to (a) estimate potential sources of error in the rating, (b) to obtain reliability estimates, and…
Descriptors: Generalizability Theory, College Students, Writing Ability, Writing Evaluation

Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 1999
Combines item response theory (IRT) and statistical methods to analyze California Achievement Test-Mathematics (CAT-M) results for 4,135 seventh graders in northeast Ohio. Provides information to educational analysts about which IRT model fits CAT-M data for the target population, test accuracy in estimating students' abilities at different…
Descriptors: Achievement Tests, Evaluation Research, Grade 7, Item Response Theory
Previous Page | Next Page »
Pages: 1 | 2