To detect where the text is on the page I would recommend using OpenCV to do that, then send the regions of text to tesseract.
Find text:
Erode Image
Find Contours
Get bounding boxes of contours
Those bounding boxes should contain text or logo/picture.