- Microsoft Kinect camera mounted on a corner of the wall.
- It is connected to an RaspberryPi which is in turn connected to Alexa.
I imagine the flow would be something like this:
In order for an endpoint to talk to Alexa, the endpoint needs to implements AVS. AVS is the voice interface between your endpoint and Alexa. On the other hand, your application logic lives in a skill.
If the endpoint you are using is an Echo, you do not need to implement AVS. But in any case, you still need a skill to handle your application logic.
1 Person is following this question.