Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 13 |
Descriptor
Source
Author
Diezmann, Carmel M. | 2 |
Lowrie, Tom | 2 |
Arends-Tòth, Judit | 1 |
Baird, Jo-Anne | 1 |
Bais, Frank | 1 |
Bohm, Isabell | 1 |
Di Mitri, Daniele | 1 |
Douhou, Salima | 1 |
Drachsler, Hendrik | 1 |
El Masri, Yasmine H. | 1 |
Ferrara, Steve | 1 |
More ▼ |
Publication Type
Reports - Research | 12 |
Journal Articles | 11 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Education | 6 |
Secondary Education | 5 |
Early Childhood Education | 2 |
Grade 2 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Primary Education | 2 |
Grade 1 | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Intermediate Grades | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
Flesch Kincaid Grade Level… | 1 |
Raven Progressive Matrices | 1 |
What Works Clearinghouse Rating
Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023
Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…
Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation
Bais, Frank; Schouten, Barry; Lugtig, Peter; Toepoel, Vera; Arends-Tòth, Judit; Douhou, Salima; Kieruj, Natalia; Morren, Mattijn; Vis, Corrie – Sociological Methods & Research, 2019
Item characteristics can have a significant effect on survey data quality and may be associated with measurement error. Literature on data quality and measurement error is often inconclusive. This could be because item characteristics used for detecting measurement error are not coded unambiguously. In our study, we use a systematic coding…
Descriptors: Foreign Countries, National Surveys, Error of Measurement, Test Items
Hubbard, Jane; Russo, James; Livy, Sharyn – Mathematics Education Research Group of Australasia, 2022
Making accurate judgements and interpretations about student growth and progress in mathematics can be problematic when using open-ended assessments. This study reports on the development of a class-based assessment instrument and marking key designed to assess Year 2 students' mathematics competence to reflect their learning of mathematics…
Descriptors: Mathematics Skills, Mathematics Instruction, Grading, Mathematics Tests
Solano-Flores, Guillermo; Wang, Chao; Shade, Chelsey – International Journal of Testing, 2016
We examined multimodality (the representation of information in multiple semiotic modes) in the context of international test comparisons. Using Program of International Student Assessment (PISA)-2009 data, we examined the correlation of the difficulty of science items and the complexity of their illustrations. We observed statistically…
Descriptors: Semiotics, Difficulty Level, Test Items, Science Tests
Predicting Item Difficulty of Science National Curriculum Tests: The Case of Key Stage 2 Assessments
El Masri, Yasmine H.; Ferrara, Steve; Foltz, Peter W.; Baird, Jo-Anne – Curriculum Journal, 2017
Predicting item difficulty is highly important in education for both teachers and item writers. Despite identifying a large number of explanatory variables, predicting item difficulty remains a challenge in educational assessment with empirical attempts rarely exceeding 25% of variance explained. This paper analyses 216 science items of key stage…
Descriptors: Predictor Variables, Test Items, Difficulty Level, Test Construction
Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016
Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…
Descriptors: Educational Assessment, Coding, Automation, Responses
van der Ven, Sanne H. G.; Klaiber, Jonathan D.; van der Maas, Han L. J. – Educational Psychology, 2017
Writing down spoken number words (transcoding) is an ability that is predictive of math performance and related to working memory ability. We analysed these relationships in a large sample of over 25,000 children, from kindergarten to the end of primary school, who solved transcoding items with a computer adaptive system. Furthermore, we…
Descriptors: Short Term Memory, Foreign Countries, Mathematics, Mathematics Instruction
Oluseyi, Adeyemo Emily; Oreoluwa, Shaba Veronica – World Journal of Education, 2014
The study developed a set of items that could measure counsellor effectiveness. It reduced the initial set of variables related to counsellor effectiveness to such number of variables that are generally perceived as indicative of counsellor effectiveness and determined the factorial composition of the scale. in order to identify the major factors…
Descriptors: Counselor Qualifications, Counseling Effectiveness, Factor Analysis, Foreign Countries
Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015
The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…
Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items
Schönborn, K. J.; Höst, G. E.; Lundin Palmerius, K. E. – Chemistry Education Research and Practice, 2015
As the application of nanotechnology in everyday life impacts society, it becomes critical for citizens to have a scientific basis upon which to judge their perceived hopes and fears of 'nano'. Although multiple instruments have been designed for assessing attitudinal and affective aspects of nano, surprisingly little work has focused on…
Descriptors: Molecular Structure, Technology, Test Construction, Test Validity
Lowrie, Tom; Diezmann, Carmel M.; Kay, Russell – Evaluation & Research in Education, 2011
The graphics-decoding proficiency (G-DP) instrument was developed as a screening test for the purpose of measuring students' (aged 8-11 years) capacity to solve graphics-based mathematics tasks. These tasks include number lines, column graphs, maps and pie charts. The instrument was developed within a theoretical framework which highlights the…
Descriptors: Screening Tests, Mathematics Achievement, Mathematical Aptitude, Graphs
OECD Publishing, 2014
The "PISA 2012 Technical Report" describes the methodology underlying the PISA 2012 survey, which tested 15-year-olds' competencies in mathematics, reading and science and, in some countries, problem solving and financial literacy. It examines the design and implementation of the project at a level of detail that allows researchers to…
Descriptors: International Assessment, Secondary School Students, Foreign Countries, Achievement Tests
Lowrie, Tom; Diezmann, Carmel M. – Journal of Educational Research, 2007
The authors investigated the performance of 172 Grade 4 students (9 to 10 years) over 12 months on a 36-item test that comprised items from 6 distinct graphical languages (e.g., maps) commonly used to convey mathematical information. Results revealed (a) difficulties in Grade 4 students' capacity to decode a variety of graphics, (b) significant…
Descriptors: Grade 4, Grade 5, Spatial Ability, Gender Differences