I. INTRODUCTION
Recently, high-performance processing device, and the interest in the object tracking algorithm for automated image analysis in the spread of high quality cameras is large [1]. In particular, the intelligent monitoring system, video compression and editing, robot vision, the importance in the computer vision field, because the spotlight in various fields such as the augmented reality, it is emphasized. However, the object tracking performance is significantly affected by changes in illumination, the pose of the object, object size, and etc. Also it is a difficult problem because of the accurate positioning objects with low resolution and low quality image. To solve this problem, various studies have recently been actively conducted.
Object tracking areas has been studied greatly progressed in three directions [1]. Object tracking is a method for tracking an object by using the first feature such as a characteristic of Gradients, Color, and Texture. Also the combination of multiple features can be employed. The representative methods are typically a Mean Shift, and the Cameshift [1], [2], [3].
Monte Carlo approach using the second statistical data tracks an object using a mathematical algorithm for simulating the behavior. Typically, the particle filter and the Kalman filter are similar. In the case of a particle filter using Monte Carlo method, it measures repeatedly the posterior probability of the state to track the strong object because it keeps the track of the object based on the measured posterior probability.
Over time through the study of learning based on the object-tracking method as the third, the more accurate data learning changes in the varying objects according to the movement. The object tracking method of an exemplary learning-based is Tracking Learning Detection (TLD) [4].
This paper is organized as the following: In the next Section, the proposed algorithm will be introduced. In Section 3, we will discuss the result of experiments briefly. Finally, we will make a concluding remark in Section 4.
II. PROPOSED ALGORITHM
TLD method and the proposed method are also known as PN Tracker that tracks the objects using optical flow-based tracker and Random Ferns detector [4]. It initializes both the tracker and detector after the first frame in video, and looks for a tracker to pursue a target. If you succeed in even trace detector to find the target, thereby it learns the detector by utilizing the training data. If there is failure to find an object tracking detector again until you stop tracking the target initialize the tracker to the detected position to have a way to back track.
Object detection capability to track moving objects with respect to the correct object is also to be correct. Because the shape change with the sensitive area in a strong local and scale change estimate for the shape change with the scale and proposes using the TLD data of existing data (HOG, Haar, LBP).
Figure 2 shows a system flow chart. In a typical structure for learning TLD it is to collect data in order to track a specific part without using the learned data. In this structure, it is learning how to detect features to existing pedestrians, vehicles, etc. and faces bond TLD module is tailored to the generally available modules. The most important point is usually the step of combining the training set and the training of the existing TLD made using feature detection.
The long-term tracking problem addressed by TLD can be constrained when applied to faces. In the original formulation, the entire detector was learned online, starting from a single frame. An efficient classifier (randomized forest) was used to represent the decision boundary between the object and its background. In the case of face tracking, building the entire detector is not necessary since a range of face detectors is readily available. The learning therefore consists of building a validator that decides whether a face patch corresponds to the target or not. The validation is significantly less time demanding than the face detection since only a fraction of candidates have to be verified. On the other hand, its precision has to be high in order to avoid confusing two different identities. Face recognition is in general very challenging and a large number of sophisticated methods have been designed already. Here we show that a very simple validator and a learning method works very well when tracking a face in cluttered background.
We adopt a frontal face detection which demonstrated state-of-the-art performance and runs at 20 frames per second (FPS). On the top of the detector we incorporated a validator, a module that analyzes a face patch and outputs a confidence that the patch corresponds to the specific face. The validator is realized by a collection of examples. This collection is initialized by a single example in one frame and it is extended during tracking by inserting more examples.
III. PERFORMANCE ANALYSIS
Figure 3 shows the experimental results by the sequence. From the results, we can see that it has advantage in terms of processing speed. Especially, when Tracker and Detector are integrated at the same time, the processing time is better performance than when the Detector is only operated. In case of the GPU platform, the processing time was good with only Detector.
In our experiment, the multi-tracking was applied in the CPU platform. In terms of processing time, about 4.88ms was observed on CPU and approximately 7.21ms was measured on the GPU. From the results, we can see up-to 29.69 FPS. It means this scheme can support a real-time processsing system.
IV. CONCULSIONS
In this paper, in conjunction with the training set, we made a general characteristic of the TLD framework which can be applied by different characteristics. The proposed scheme can be expected to be adopted for wide range of applications. In addition, except for the multi-tracking experiments, the proposed scheme confirmed real-time processing. This algorithm will be very useful to make a real-time gesture recognition and control in smart car technology.