First, the table gets detected by using a mask for the table color and the difference of color of the dark areas beside the cushions. Second, the balls get detected by a fine-tuned YOLOX model. Third, detect the direction of the cue stick and calculate the path of the ball and whether it will go in or not. Last, the result gets frozen when the two balls touch.