Hamidreza Kasaei
From IEETA
Subject | Interactive Open-Ended Learning for 3D Object Recognition | |
---|---|---|
Advisor | Luís Seabra Lopes, Ana Maria Tomé | |
Group | Intelligent Robotics and Systems | |
Status | PhD student | |
Starts | 2012/10/01 | |
Ends | 2019/04/10 | |
Country | Iran | |
Projects | ||
Past projects | ||
Events | ||
Proposals | ||
Personal page |
I have joined the Department of Artificial Intelligence of the University of Groningen (the Netherlands) as a Faculty of Science and Engineering (FSE) Fellow in August 2018.
Please check my new webpage here!!!
About Me
S. Hamidreza Kasaei joined the IEETA and IRIS Labs as a Ph.D. student in October 2012, to work on FP7 Project named RACE: Robustness by Autonomous Competence Enhancement under the supervision of Luís Seabra Lopes and Ana Maria Tomé . During his Ph.D., Hamidreza is working on conceptualizing 3D objects, such that robots’ performance on object recognition improves with accumulated experiences and conceptualizations. This project is based on 3D environment perception and open-ended learning, as well as human-robot interaction for labeling objects and providing feedback. Before his Ph.D., Hamidreza completed his Master Degree in Computer Engineering field of Artificial Intelligence at the University of Isfahan. His thesis entitled “Face Recognition Using Single Normal Reference Image and Feature Statistics" was written under the supervision of Amirhassan Monadjemi. Besides, he worked on Middle size soccer robot and Humanoid robot and obtained different ranks in RoboCup competitions. The current version of his CV is available here, and his full list of publications and corresponding BibTeX files can be found on his Google scholar account.
Latest News
|
|
Research
My research interests focus on the intersection of robotics, machine learning, and machine vision. I am interested in developing algorithms for an adaptive perception system based on interactive environment exploration and open-ended learning, which enables robots to learn from past experiences and interact with human users. This is necessary for assistive robots, not only to perform manipulation tasks in a reasonable amount of time and in an appropriate manner, but also to robustly adapt to new environments by handling new objects. I investigate active perception, where robots use their mobility and manipulation capabilities not only to gain the most useful perceptual information to model the world, also to predict the next best view for improving object detection and manipulation performances. I have evaluated my work on different robotic platforms including PR2, robotic arms, and humanoid robots. My research is summarized by the following projects:
‡ Coping with Context Change in Open-Ended Object Recognition without Explicit Context Information
To deploy a robot in a human-centric environment, it is important that the robot is able to continuously acquire and update object categories while working in the environment. Therefore, autonomous robots must have the ability to continuously execute learning and recognition in a concurrent and interleaved fashion. One of the main challenges in unconstrained human environments is to cope with the effects of context change. This paper presents two main contributions: (i) an approach for evaluating open-ended object category learning and recognition methods in multi-context scenarios; (ii) evaluation of different object category learning and recognition approaches regarding their ability to cope with the effects of context change. Off-line evaluation approaches such as cross-validation do not comply with the simultaneous nature of learning and recognition. A teaching protocol, supporting context change, was therefore designed and used in this work for experimental evaluation. Seven learning and recognition approaches were evaluated and compared using the protocol. The best performance, in terms of the number of learned categories, was obtained with a recently proposed local variant of Latent Dirichlet Allocation (LDA), closely followed by a Bag-of-Words (BoW) approach. In terms of adaptability, i.e. coping with context change, the best result was obtained with BoW, immediately followed by the local LDA variant.
| |
‡ Perceiving, Learning, and Recognizing 3D Objects: An Approach to Cognitive Service Robots
There is a growing need for robots that can interact with people in everyday situations. For service robots, it is not reasonable to assume that one can pre-program all object categories. Instead, apart from learning from a batch of labeled training data, robots should continuously update and learn new object categories while working in the environment. This paper proposes a cognitive architecture designed to create a concurrent 3D object category learning and recognition in an interactive and open-ended manner. In particular, this cognitive architecture provides automatic perception capabilities that will allow robots to detect objects in highly crowded scenes and learn new object categories from the set of accumulated experiences in an incremental and open-ended way. Moreover, it supports constructing the full model of an unknown object in an on-line manner and predicting next best view for improving object detection and manipulation performance. We provide extensive experimental results demonstrating system performance in terms of recognition, scalability, next-best-view prediction and real-world robotic applications.
| |
‡ Hierarchical Object Representation for OpenEnded Object Category Learning and Recognition (Local Open-Ended LDA)
Service robots are expected to be more autonomous and work effectively in human-centric environments. This implies that robots should have special capabilities, such as learning from past experiences and real-time object category learning and recognition. This paper proposes an open-ended 3D object recognition system which concurrently learns both the object categories and the statistical features for encoding objects. In particular, we propose an extension of Latent Dirichlet Allocation to learn structural semantic features (i.e. topics), from low-level feature co-occurrences, for each category independently. Moreover, topics in each category are discovered in an unsupervised fashion and are updated incrementally using new object views. In this way, the advantage of both the local hand-crafted and the structural semantic features have been considered in an efficient way. A set of extensive experiments has been performed to assess the performance of the proposed Local Open-Ended LDA in terms of descriptiveness, scalability, and computational time. Experimental results show that the overall classification performance obtained with Local Open-Ended LDA is clearly better than the best performances obtained with the state-of-the-art approaches. Moreover, the best scalability, in terms of number of learned categories, was obtained with the proposed Local Open-Ended LDA approach, closely followed by a Bag-of-Words (BoW) approach. Concerning computational time, the best result was obtained with BoW, immediately followed by the Local Open-Ended LDA approach.
| |
‡ GOOD: A Global Orthographic Object Descriptor for 3D Object Recognition and Manipulation
The Global Orthographic Object Descriptor (GOOD) has been designed to be robust, descriptive and efficient to compute and use. GOOD descriptor has two outstanding characteristics: (1) Providing a good trade-off among: descriptiveness, robustness, computation time, memory usage; (2) Allowing concurrent object recognition and pose estimation for manipulation. The performance of the proposed object descriptor is compared with the main state-of-the-art descriptors. Experimental results show that the overall classification performance obtained with GOOD is comparable to the best performances obtained with the state-of-the-art descriptors. Concerning memory and computation time, GOOD clearly outperforms the other descriptors. Therefore, GOOD is especially suited for real-time applications. The current implementation of GOOD descriptor supports several functionalities for 3D Object Recognition and Object Manipulation.
| |
‡ Towards Lifelong Assistive Robotics: A Tight Coupling between Object Perception and Manipulation
In this work, we propose a cognitive architecture designed to create a tight coupling between perception and manipulation for assistive robots. This is necessary for assistive robots, not only to perform manipulation tasks in a reasonable amount of time and in an appropriate manner, but also to robustly adapt to new environments by handling new objects. In particular, this cognitive architecture provides perception capabilities that will allow robots to, incrementally learn object categories from the set of accumulated experiences and reason about how to perform complex tasks. To achieve these goals, it is critical to detect, track and recognize objects in the environment as well as to conceptualize experiences and learn novel object categories in an open-ended manner, based on human-robot interaction. Interaction capabilities were developed to enable human users to teach new object categories and instruct the robot to perform complex tasks. A Bayesian learning approach with a Bag-of-Words object representation are used to acquire and refine object category models. Perceptual memory is used to store object experiences, feature dictionary and object category models. Working memory is employed to support communication purposes between the different modules of the architecture. A reactive planning approach is used to carry out complex tasks. To examine the performance of the proposed architecture, a quantitative evaluation and a qualitative analysis are carried out. Experimental results show that the proposed system is able to interact with human users, learn new object categories over time, as well as perform complex tasks.
| |
‡ Interactive Open-Ended Learning for 3D Object Recognition: An Approach and Experiments
Nowadays robots already utilize 3D computer vision algorithms broadly to perform complex tasks such as object detection and manipulation. However, most 3D vision algorithms are not efficient for using in real-time and open-ended robotic domains. Many problems have already been understood and solved successfully; however, many issues remain open. Open-ended object recognition is one of these issues waiting for many improvements. This work presents an efficient approach capable of learning and recognizing object categories in an interactive and open-ended manner. In this work, open-ended implies that the set of object categories to be learned is not known in advance. The training instances are extracted from on-line experiences of a robot, and thus become gradually available over time, rather than at the beginning of the learning process. This paper focuses on two state-of-the-art questions: (1) How to automatically detect, conceptualize and recognize objects in 3D scenes in an open-ended manner? (2) How to acquire and use high-level knowledge obtained from the interaction with human users, namely when they provide category labels, in order to improve the system performance?
| |
‡ Learning to Grasp Familiar Objects using Object View Recognition and Template Matching
Robots are still not able to grasp all unforeseen objects. Finding a proper grasp configuration, i.e. the position and orientation of the arm relative to the object, is still challenging. One approach for grasping unforeseen objects is to recognize an appropriate grasp configuration from previous grasp demonstrations. The underlying assumption in this approach is that new objects that are similar to known ones (i.e. they are familiar) can be grasped in a similar way. However finding a grasp representation and a grasp similarity metric is still the main challenge in developing an approach for grasping familiar objects. In this work, interactive object view learning and recognition capabilities are integrated in the process of learning and recognizing grasps. The object view recognition module uses an interactive incremental learning approach to recognize object view labels. The grasp pose learning approach uses local and global visual features of a demonstrated grasp to learn a grasp template associated with the recognized object view. A grasp distance measure based on Mahalanobis distance is used in a grasp template matching approach to recognize an appropriate grasp pose. The experimental results demonstrate the high reliability of the developed template matching approach in recognizing the grasp poses. Experimental results also show how the robot can incrementally improve its performance in grasping familiar objects.
|
Publications
Articles in international journals listed in the ISI
- Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé. Local-LDA: Open-Ended Learning of Latent Topics for 3D Object Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 10, October 2020 [IF 2018: 17.730 (Q1, 1/266)]
- Hamidreza Kasaei, Miguel Riem Oliveira, Gi Hyun Lim, Luís Seabra Lopes, Ana Maria Tomé. Towards Lifelong Assistive Robotics: A Tight Coupling Between Object Perception and Manipulation. Neurocomputing, vol. 291, p. 151-166, May 2018 [IF 2018: 4.072 (Q1, 28/134)]
- Hamidreza Kasaei, Ana Maria Tomé, Luís Seabra Lopes, Miguel Riem Oliveira. GOOD: A Global Orthographic Object Descriptor for 3D Object Recognition and Manipulation. Pattern Recognition Letters, vol. 83, Part 3, p. 312–320, November 2016 [IF 2016: 1.995 (Q2, 58/133)] [IF 2018: 2.810 (Q2, 50/134)]
- Miguel Riem Oliveira, Luís Seabra Lopes, Gi Hyun Lim, Hamidreza Kasaei, Ana Maria Tomé, Aneesh Chauhan. 3D object perception and perceptual learning in the RACE project. Robotics and Autonomous Systems, vol. 75, p. 614–626, January 2016 [IF 2016: 1.950 (Q2, 60/133)] [IF 2018: 2.928 (Q2, 45/134)]
- Hamidreza Kasaei, Miguel Riem Oliveira, Gi Hyun Lim, Luís Seabra Lopes, Ana Maria Tomé. Interactive Open-Ended Learning for 3D Object Recognition: An Approach and Experiments. Journal of Intelligent & Robotic Systems, vol. 80, no. 3, p. 537-553, December 2015 [IF 2015: 0.932 (Q3, 16/25)] [IF 2018: 2.020 (Q3, 17/26)]
Other articles in journals
- Joachim Hertzberg, Jianwei Zhang, Liwei Zhang, Sebastian Rockel, Bernd Neumann, Jos Lehmann, Krishna S. R. Dubba, Anthony G. Cohn, Alessandro Saffiotti, Federico Pecora, Masoumeh Mansouri, Štefan Konečný, Martin Günther, Sebastian Stock, Luís Seabra Lopes, Miguel Riem Oliveira, Gi Hyun Lim, Hamidreza Kasaei, Vahid Mokhtari, Lothar Hotz, Wilfried Bohlken. The RACE Project: Robustness by Autonomous Competence Enhancement. Künstliche Intelligenz, vol. 28, no. 4, p. 297-304, November 2014
Chapters in books
- Hamidreza Kasaei, Nima Shafii, Luís Seabra Lopes, Ana Maria Tomé. Object Learning and Grasping Capabilities for Robotic Home Assistants. RoboCup-2016: Robot Soccer World Cup, Lecture Notes in Computer Science,, Springer, vol. 9776, July 2016
- Gi Hyun Lim, Miguel Riem Oliveira, Hamidreza Kasaei, Luís Seabra Lopes. Hierarchical Nearest Neighbor Graphs for Building Perceptual Hierarchies. Neural Information Processing: 22nd International Conference, ICONIP 2015, S. Arik, T. Huang, W.K. Lai, Q. Liu (Ed.), Springer, p. 646-655, 2015
Articles in conference proceedings
- Hamidreza Kasaei, Nima Shafii, Luís Seabra Lopes, Ana Maria Tomé. Interactive Open-Ended Object, Affordance and Grasp Learning. Proc. IEEE International Conference on Robotics and Automation (ICRA’2019), p. 3747-3753, May 2019
- Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé. Coping with Context Change in Open-Ended Object Recognition without Explicit Context Information. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), October 2018
- Hamidreza Kasaei, Juil Sock, Luís Seabra Lopes, Ana Maria Tomé, Tae-Kyun Kim. Perceiving, Learning, and Recognizing 3D Objects: An Approach to Cognitive Service Robots. Thirty-Second Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, February 2018
- Hamidreza Kasaei, Ana Maria Tomé, Luís Seabra Lopes. Hierarchical Object Representation for Open-Ended Object Category Learning and Recognition. Proc. Thirtieth Annual Conference on Neural Information Processing Systems (NIPS’2016), 2016
- Hamidreza Kasaei, Ana Maria Tomé, Luís Seabra Lopes, Miguel Riem Oliveira. An Orthographic Descriptor for 3D Object Learning and Recognition. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Seoul, Korea, October 2016
- Nima Shafii, Hamidreza Kasaei, Luís Seabra Lopes. Learning to Grasp Familiar Objects using Object View Recognition and Template Matching. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Seoul, Korea, October 2016
- Nima Shafii, Hamidreza Kasaei, Luís Seabra Lopes. A Learning Approach for Robotic Grasp Selection in Open-ended Domains. Proceedings of the 2016 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC 2016), Bragança, Portugal, May 2016
- Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé. Concurrent 3D Object Category Learning and Recognition based on Topic Modelling and Human Feedback. Proceedings of the 2016 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC 2016), Bragança, Portugal, April 2016, May 2016
- Mohammadreza Kasaei, Hamidreza Kasaei, Ehsan Shahri, Ali Ahmadi, Nuno Lau, Artur Pereira. How to Select A Suitable Action Against Strong Pushes in Adult--Size Humanoid Robot: Learning From Past Experiences. IEEE International Conference on Autonomous Robot Systems and Competitions - ICARSC, Bragança, May 2016
- Miguel Riem Oliveira, Luís Seabra Lopes, Gi Hyun Lim, Hamidreza Kasaei, Angel D. Sappa, Ana Maria Tomé. Concurrent Learning of Visual Codebooks and Object Categories in Open-ended Domains. 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, September 2015
- Hamidreza Kasaei, Miguel Riem Oliveira, Gi Hyun Lim, Luís Seabra Lopes, Ana Maria Tomé. An Adaptive Object Perception System Based on Environment Exploration and Bayesian Learning. Proceedings of the 2015 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC 2015), Vila Real, Portugal, April 2015
- Hamidreza Kasaei, Miguel Riem Oliveira, Gi Hyun Lim, Luís Seabra Lopes, Ana Maria Tomé. An Interactive Open-Ended Learning Approach for 3D Object Recognition. Proceedings of the 2014 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), 2014
- Krishna S.R. Dubba, Miguel Riem Oliveira, Gi Hyun Lim, Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé, Anthony G. Cohn. Grounding Language in Perception for Scene Conceptualization in Autonomous Robots. Proceedings of the 2014 AAAI Spring Symposium on Qualitative Representations for Robots, 2014
- Gi Hyun Lim, Miguel Riem Oliveira, Vahid Mokhtari, Hamidreza Kasaei, Luís Seabra Lopes, Ana Maria Tomé. Interactive Teaching and Experience Extraction for Learning about Objects and Robot Activities. Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN'2014), p. 153-160, 2014
- Miguel Riem Oliveira, Gi Hyun Lim, Luís Seabra Lopes, Hamidreza Kasaei, Ana Maria Tomé, Aneesh Chauhan. A Perceptual Memory System for Grounding Semantic Representations in Intelligent Service Robots. Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2014), September 2014
PhD thesis
- Hamidreza Kasaei. Interactive Open-Ended Learning for 3D Object Recognition. PhD Thesis, University of Aveiro, Aveiro, Portugal, April 2019