title: Multiple Instance Learning with Random Forests and Applications in Industrial Optical Inspection creator: Wieler, Matthias subject: 004 subject: 004 Data processing Computer science subject: 310 subject: 310 General statistics subject: 670 subject: 670 Manufacturing description: Automatic defect detection in industrial optical inspection requires algorithms that can learn from data. A special challenge is data with incomplete labels. One of the methods that the field of machine learning has brought forth to deal with incomplete labels is multiple instance learning. One trait of this setting is that it groups datapoints (instances) into bags. We propose a novel method to predict bag probabilities from given instance probabilities that has the advantage that its results do not depend on bag size. Also, we propose an extension of the multiple instance model that allows the user to steer the number of instances that are classified as positive. We implement these methods with an algorithm based on the well-known random forest classifier. Results on a standard benchmark dataset show competitive performance. Furthermore, we apply this algorithm to image data that reflects the challenges of industrial optical inspection, and we show that in this setting it improves over the standard random forest. date: 2014 type: Dissertation type: info:eu-repo/semantics/doctoralThesis type: NonPeerReviewed format: application/pdf identifier: https://archiv.ub.uni-heidelberg.de/volltextserverhttps://archiv.ub.uni-heidelberg.de/volltextserver/17287/1/PhDThesis_Wieler.pdf identifier: DOI:10.11588/heidok.00017287 identifier: urn:nbn:de:bsz:16-heidok-172875 identifier: Wieler, Matthias (2014) Multiple Instance Learning with Random Forests and Applications in Industrial Optical Inspection. [Dissertation] relation: https://archiv.ub.uni-heidelberg.de/volltextserver/17287/ rights: info:eu-repo/semantics/openAccess rights: http://archiv.ub.uni-heidelberg.de/volltextserver/help/license_urhg.html language: eng