Box 1: Yes -
Custom Vision functionality can be divided into two features. Image classification applies one or more labels to an image. Object detection is similar, but it also returns the coordinates in the image where the applied label(s) can be found.
Box 2: Yes -
The Custom Vision service uses a machine learning algorithm to analyze images. You, the developer, submit groups of images that feature and lack the characteristics in question. You label the images yourself at the time of submission. Then, the algorithm trains to this data and calculates its own accuracy by testing itself on those same images.
Box 3: No -
Custom Vision service can be used only on graphic files.
Reference:
https://docs.microsoft.com/en-us/azure/cognitive-services/Custom-Vision-Service/overview