Projection of 3D Coordinates onto a 2D image with known points

Question 1

I was able to solve this problem using an implementation of the Tsai algorithm (http://en.wikipedia.org/wiki/Camera_resectioning#Algorithms) which can compute a projection matrix using a minimum of 4 known points. Basically, I allow the user to specify the world coordinates of a point and then click on the image to specify the image coordinates. The algorithm uses these mappings (the more mappings, the more accurate the solution is), along with the image width and height to calculate a projection matrix. This projection matrix can then be used to project additional points onto the image using world coordinates.

Question 2

EDIT: I got myself confused by mixing units on the schema. I thus reworked a bit my answer.

It sounds feasible to me, if we look at the maths behind the projection.

Here is a not-so-rigorous schema of the situation for the horizontal coordinate (I'm mixing real world coordinates and pixels one to try to illustrate your situation):

Schema of a projection

With:

D, one of the points given by the users, with (x,y,z) its projected position with respects to the relative coordinate system defined by the camera (so after applying its translation and rotation)
E the camera point - origin of the coordinate system described above.
B the resulting point in your picture plane, with u and v in pixels. The picture plane has for dimensions w x h pixels.
f the focal length ~~(same unit as for x, y, z...)~~ in pixels, F its value in the real-world unit, and α the horizontal half-angle of view - the values you want to evaluate

You can see that the triangles ECD and EBM are similar, so using the Side-Splitter Theorem, we get:

EM / EC = MB / CD <=> f / z = u / x (we are comparing ratio, so no problem if the left member of the equation ~~uses a real-world unit while the right one uses pixels~~ are real-world values divided by pixels one)

We thus get:

f = u / x * z

Now if you want ~~α~~ F, I think you'll need to know the dimensions r_x x r_y (real-world unit) of your camera's sensor, since:

~~tan(α) = (r_x / 2) / f~~ F = r_x / (w / 2) * f

But as for α, you can get it through:

tan(α) = (w / 2) / f

If you want to do the parallel with the Wikipedia article you're pointing out, we've been using:

Projection to screen

Where:

(d_x,d_y,d_z) = (x,y,z), position of the point in the camera system
(s_x,s_y) = (w,h), size of your printable surface
(r_x,r_y,r_z) = (r_x,r_y,f), characteristics of your recording surface
(b_x,b_y) = (u,v), position on your printable surface