SYSTEM AND METHOD FOR THE MEASUREMENT OF THE RELATIVE POSITION OF AN OBJECT WITH RESPECT TO A POINT OF REFERENCE |
|||||||
申请号 | EP02712268.8 | 申请日 | 2002-01-23 | 公开(公告)号 | EP1402476B1 | 公开(公告)日 | 2007-08-01 |
申请人 | CONSIGLIO NAZIONALE DELLE RICERCHE; | 发明人 | ANCONA, Nicola, Cons. Nazionale d. Ricerche; CICIRELLI, Grazia, Cons. Nazionale d. Ricerche; DISTANTE, Arcangelo, Cons. Nazionale d. Ricerche; ATTOLICO, Giovanni, Cons. Nazionale d. Ricerche; BRANCA, Antonella, Cons. Nazionale d. Ricerche; STELLA, Ettore, Cons. Nazionale d. Ricerche; MALAVASI, Marco, Cons. Nazionale d. Ricerche; | ||||
摘要 | System for the measurement of the relative position of an object with respect to a point of reference, comprising one or more image acquisition subsystems and a processing unit of said acquired images apt to analyze such images according to recognition and localization techniques. | ||||||
权利要求 | |||||||
说明书全文 | The present invention refers to a system and to a method for the measurement of the relative position of an object with respect to a predetermined point of reference. An exemplary application of the system and of the method according to the present invention is that of the measurement of the relative position of a ball with respect to a specific line of a field of play. In several sports, a referee acknowledges an event depending on the relative position of a ball with respect to a specific line of the field of play. E.g., in Soccer a referee awards a score to a team only when the ball has crossed over the goal line. However, the referee can autonomously note the scoring of a goal solely when the latter is apparent, e.g. due to the swelling out of the netting or to the ball remaining inside of the goal. However, quite frequently the ball crosses over the goal plane and, without touching the net, immediately exits therefrom due to an odd rebound onto the soccer field, a contact with the goal posts, or a player's clearance. When the ball speed is not overly high (it could even reach the 120 Km/h) the referee, in order to decide on the event at issue, can be aided by assistants, who however should have ideal observing conditions, i.e. be positioned on the goal line having the goal in sight. Otherwise the referee, having to decide one way or the other, runs the risk of awarding a non-existent (phantom) scored goal, or of not awarding an actually scored goal. To solve this problem, several known systems, all referable to the same category, provide the use of sensors, inserted inside of the goal structure, receiving a signal from transmitters applied into the playing ball when the latter crosses the goal mouth. However, such systems entail the remarkable disadvantage of being invasive, requiring electronic devices to be inserted in the playing structures (goals and ball). Hence, in order to use the former a general modification to the fields of play and the use of specific balls would be required. These devices are not always applicable, as the modification required to the playing structures could interfere with the laws of the game. Moreover, a known visual-type system relies instead on the observation of the field of play by suitably positioned cameras. This system determines the position of the vertical projection of the center of mass of the ball onto the plane of the field of play, exploiting the information (known a priori) on the dimensions of the various areas thereof. However, this system entails the disadvantage of exclusively detecting the ball crossing over a determined line, yet providing no indication about the height above ground of the ball during its crossing, an information crucial in order to confidently claim that the ball has crossed over the goal plane. A first system for determining the position of an object in a space is known by Another system is disclosed in Nevertheless, such system has several drawbacks. In particular the system is only capable of determining the position of an object that moves on a single plan, in a predetermined direction. Furthermore, the recognition of the object into the acquired images is simply performed by comparison with fixed tresholds, making the result dependent from the environmental conditions. Purpose of the present invention is to solve the abovecited problems of the known art providing a method for the measurement of the relative position of an object with respect to a point of reference, comprising the following steps:
characterised in that said step of processing each image further comprises a step of recognizing said object inside each of said images, said step of recognizing said object being performed by a classifier. The present invention further provides a system for the measurement of the relative position of an object with respect to a point of reference comprising:
characterised in that said processing unit comprises a classifier for recognising said object inside each of said image. Hence, a field of play comprising such system could advantageously be provided. For simplicity's sake, hereinafter reference will still be made to the application of the system in the case of Soccer. Of course, it is understood that the described system and method could be useful in any other application entailing the same technical problem. The main advantage of the method and of the system according to the present invention lies in that those entail no modification to any component of the field of play, or to the ball. A second advantage lies in the robustness and in the reliability of the detection of the scored goal event. Said method integrates a geometrical measurement of the position of the ball in the three-dimensional space (obtained by a binocular system) to a qualitative assessment of the observed event (obtained with a monocular system) thereby emulating the activity of a human observer enjoying the best observing conditions. The integration of the measurements provided by said systems ensures a high precision rate and a minimum error probability in any situation, including those of partial ball obstruction, e.g. by one or more players. A third advantage lies in the completeness of the information on the position of the object with respect to a point of reference, as the method provides the three-dimensional coordinates thereof. A fourth advantage lies in that the system according to the present invention uses, for image acquisition, high-speed digital cameras, whose performances surpass those of the common cameras as well as those of the human eye. A further advantage lies in that the system according to the present invention provides an objective digital recording of the event itself, obtained by an advantageous positioning of the cameras and the concomitant acquisition of the observed scene. This recording enable to subsequently review the scene at will, to validate the signaled event (goal/non-goal), as it typically happens in the case of a digital viewer having high time resolution and simultaneous multivision. Further advantages, features and operation modes of the present invention will be made apparent by the following detailed description of preferred embodiments thereof, given by way of a non-limiting example, making reference to the figures of the attached drawings, wherein:
With initial reference to figure 1, a first embodiment of the system according to the present invention is shown. According to this embodiment, the system 1 comprises one or more subsystems 2, 3 apt to concomitantly acquire images of the portion of the field of play at issue comprising the selected point of reference, in particular of the goal area. The subsystem 2 is of a monocular type, apt to process images acquired from a single position, whereas the subsystem 3 is of a binocular type, apt to process pairs of acquired images from two distinct positions. The operation of the two types of subsystems will be detailed hereinafter. The monocular subsystem 2 and the binocular subsystem 3 are independent therebetween, each one being apt to provide information data related to the position of the ball when the latter enters the respective visual field. These data are forwarded to a processing unit 4 comparing them therebetween and computing an end result 5. Next, figure 2 is a block diagram depicting a monocular-type subsystem 2. The subsystem 2 comprises an image transducer 11, e.g. a camera. This camera 11 should be selected from those enabling to acquire the greatest possible number of images per second, so as to minimize the ball translation Δs between two successive images. In fact, the camera can detect with certainty the goal-scoring event solely when the ball has crossed the goal line of a distance at least equal to Δs/2. By way of an example, using a 'Dalsa Motion® Vision progressive Scan Area' - type camera and in particular an CA-D6 0512 model, enabling the acquisition of 262 images/sec with a 536x516 pixel resolution, for a hypothetical ball speed of 120Km/h, the subsystem 2 can detect with certainty the goal scoring event when the ball has crossed the goal line of 6.5 cm. The format of the data outputted by the camera is digital, and it meets the standard EIA-644, i.e. it is apt to be interfaced with other electronic apparatuses for the storing and the processing of the acquired images. In particular, high-speed storing means 12, based e.g. on Ultra 2 SCSI technology with 10000 rpm disks (Seagate®) are provided. Thus, all the acquired images are stored in a database 13. The acquired images are processed by specific computing means 14. Such computing means 14 comprises dedicated systems, like e.g.:
Figure 3 shows a preferred positioning for a camera A with respect to the field of play 100 during the use of a monocular subsystem 2 and the image Va thereof acquired by the camera A. According to this first embodiment of the system 1, the monocular subsystem 2 comprises a camera A positioned with the optical axis lying on the goal line plane in the direction of the goal, apt to autonomously detect the goal scoring event represented by the ball crossing over the goal plane. Of course, other positionings could be provided in order to attain improved system performances. The use of a monocular subsystem 2 as the one hereto described enables to implement a qualitative-type method for the detection of the goal scoring event according to the visual information obtained by a camera with no measuring, merely referring the position of the ball observed on each image to several fixed points of reference. In this context, the latter are provided by the goal structure. Hence, hardware and software processing means implement a first qualitative-type decision-making system. The latter, upon being inputted an image, outputs in real-time a signal indicating whether the image be an instance of the considered event. According to a first embodiment of the monocular subsystem 2, the processing of the acquired images comprises two essential steps:
The recognizing of the object ball can be performed by object recognition technologies based on contour or area analysis. These methodologies are well known to those skilled in the art, hence a further detailing thereof will be omitted. In particular, according to this first embodiment of the monocular subsystem, a recognition technique based on example learning called SVM (Support Vector Machine) was used. Next, figures 4a and 4b show some sets of exemplary images used to this end. The ball-recognizing step is performed so as to solve various problems, which may be encountered in a likewise application. In particular, the difficulties related to the environmental conditions (intensity and direction of the lighting of the taken area, presence of shadows), to the indistinctness of the ball onto the background of the taken scene and to the measurement of 'false positives' and/or of 'false negatives' are taken into account and overcome. This problems are tackled and solved by a preprocessing of each acquired image, substantially performed by carrying out software-type procedures aimed at:
In order to solve the abovecited problems, a preprocessing module, acting on an analysis of the gray tones, on the correlation among consecutive images and on a windowing around the position of the preceding image, is provided. The method according to the present invention provides a training step with examples aimed at implementing a classifier, e.g. of the two-class (ball/non-ball) type, apt to catalogue each image in one of the two classes with the least possible error. This training step comprises the following steps:
Hence, the method for the detection of the considered event provides:
According to a second embodiment of the monocular subsystem 2, the processing of the acquired images provides a training step aimed at implementing a classifier, in particular of the two-class type, apt to catalogue with the least possible error each image in one of the two goal/non-goal classes. With respect to the previous embodiment, the classifier directly provides the end result, i.e. a decision on the occurrence of the considered event (goal/non-goal). This training step comprises the following steps:
In this case, the method for the detection of a goal scoring provides:
The neural network used consists of three levels: a first input level, a second intermediate level and a third output level. The number of nodes of the levels is determined according to the size of the problem at issue. The first level has a number of nodes, defined by the dimension of the input subimages, whereas the output level has a single node, as the problem requires a binary-type (goal/non-goal) response. The operation principles of a neural network are well known to those skilled in the art, hence will not be detailed hereinafter. Figures 5a and 5b show some subimage sets, respectively used as positive and negative examples, of goal scoring during the training step of the decision-making system. These examples substantially consist of real images depicting a side view of the goal with a ball taken in all the possible positions and under different visibility conditions. During the training step, the neural network classifier learns to recognize the occurrence of the goal-scoring event according to the information contained in the exemplary images provided. Each taken image is processed by the decision-making system, which returns a response on the actual detection of a goal scoring. The image sets employed during the classifier training step should cover most of the actual situations which might occur, thereby ensuring a correct operation of the system and reliable responses even under less-than-ideal conditions, e.g. of visibility. Of course, the classification techniques (SVM and neural network) adopted in the abovedescribed two different embodiments of the monocular subsystem 2 can indifferently be used in the two cases, without thereby changing the overall operation principle of the system. I.e., for the ball recognition also a neural network-based classifier or an SVM-type classifier for the identification of the goal-scoring event could be used. However, the abovedescribed classification techniques are not the only ones useful in tackling the problem at issue. Actually, any two-class classifier based on an example-learning technique could be used in both embodiments of the monocular subsystem 2 according to the present invention. Figure 6 is a block diagram of the binocular subsystem 3. A pair of image transducers 11, e.g. high-speed cameras of the abovedescribed type, acquires images of the area of the field of play at issue. These images are stored by storing means 12' on a file 13' and processed by computing means 14'. These computing means are alike those hereto described in connection with the monocular systems, hence will not be detailed anew hereinafter. With reference to figure 7, the binocular subsystem 3 is based onto the measuring of the three-dimensional position of the center of mass B of the ball taken as point of intersection of the lines of sight {Ri} generated by two cameras 11 taking the area from different points of sight: For each point of sight considered a line of sight Ri is generated by the optical center of the camera, intersecting the center of mass B of the ball in the three-dimensional space and the center bi of the projection of the ball on the image plane Ii. For each camera, the line of sight intersecting the ball center is determined by the two intersections thereof with the goal plane Π and the field of play plane Γ. These intersections are estimated from the position of the ball on the image plane and from information known a priori on the spatial position of the goalposts and of the lines delimiting the goal area. In particular, each line of sight is computed as follows:
The binocular subsystem requires the following information input:
Figure 8 is a top plan view of the field of play showing an advantageous positioning of the two cameras B, C of the binocular subsystem 3 near the goal, apt to minimize the error in the estimate of the distance of the ball from the goal plane. The cameras acquire images alike those indicated with Vb and Vc in the figure. Figure 9 shows the system 1 according to the disclosed first embodiment, providing the combined use of a monocular subsystem 2 (camera A) and of a binocular subsystem 3 (cameras B and C) and the respective views of the acquired images Va, Vb and Vc. In this case as well, other positions could be provided in order to better adjust the system operation to the specific application. The binocular subsystem 3 can autonomously detect the goal scoring event, intended as the crossing over of the goal line by the ball, via a second metric-type decision-making system using the measurement of the three-dimensional positioning, yet, generally speaking, it cannot provide a visually assessable confirmation thereof. The monocular subsystem 2 can autonomously determine only the position of the ball with respect to the goal plane, providing an objective confirmation thereof in the related recording. In fact, without additional sights, a ball having crossed over the goal line yet lying outside of the goal can appear to lie therein. According to the first embodiment, the system 1 integrates the two subsystems, overcoming the limitations that each one thereof would entail when individually used. As the monocular subsystem 2 enables to determine solely the position of the ball with respect to the goal plane, the addition of the sights of the binocular subsystem 3 allows to recognize also those crossings of the goal line taking place outside of the goal. The addition of the monocular subsystem to the binocular one, besides enhancing the reliability of the automated detection of the event, further enables an advantageous visual confirmation thereof. The binocular 3 and the monocular 2 subsystems independently assess the crossing of the goal line. The redundancy of the assessments provided by the two subsystems increases the reliability of the automated detection of the goal-scoring event. The integration of the results provided by the two hereto described subsystems enables to obtain a single final assessment of the event with the utmost rate of certainty available. Making reference to figure 10, a second embodiment of the system according to the present invention is shown. According to this embodiment, the system 1 comprises two monocular subsystems 2, 2' of the hereto-described type. Next, figure 11 is a plan view of the field of play 100 onto which the cameras A and D of the monocular subsystems 2 and 2' are positioned. The cameras A and D acquire images alike those shown in views Va and Vd, respectively. In particular, the camera D is positioned with the optical axis perpendicular to the goal plane and passing through the center thereof. The processing of the acquired images by the camera D consists in singling out the ball in the image, inside of the goal structure. This step is implemented using an SVM-type classifier, referring the position of the ball to the goalposts and crossbar. Then, this information is integrated to that provided by the monocular subsystem 2, i.e. to the information on the position of the ball with respect to the goal plane enabling the qualitative detection the crossing over of the goal line by the ball. Figure 12 shows a third embodiment of the system 1 according to the present invention. According to such third embodiment the system 1 comprises three monocular subsystems 2, 2' and 2". In this case the system 1 provides redundant information, however the redundancy enhances the reliability of the measurement of the scored goal event. Next, figure 13 shows a plan view of the field of play 100 and a preferred positioning of the three cameras A, E and D related to the three monocular subsystems. The views Va, Ve and Vd are examples of images acquired by the three cameras. The use of redundant information is advantageous in order to solve cases of ball obstruction with respect to one of the two symmetrical cameras A and E. Figures 14 and 15 show a fourth embodiment of the system 1 comprising two binocular subsystems 3, 3' of the abovedisclosed type. In this case, four cameras B, C, F and G are positioned onto the field of play 100, so as to provide respective images alike those shown in views Vb, Vc, Vf and Vg. The combined use of the four cameras and the processing of the images provided enhances the reliability of the detection of the scored goal event and reduces the instances of non-detection, caused e.g. by ball obstruction. The present invention has hereto been described according to preferred embodiments thereof given as non-limiting examples. It is understood that other embodiments may be provided, all however to be construed as falling within the protective scope thereof, as defined by the appended claims. |