You are looking for documentation on the SDKs, not the sensor itself.
- Microsoft Kinect SDK, on MSDN
- OpenKinect
- OpenNI
These are what provide you the information you can parse to determine if something is in front of the camera and if it is a person.
If you just wanting to use the device as a camera, without depth and skeleton analysis, then you can use one of the many image processing libraries.