Question

I am using xlrd to process Excel files. I am running a script on a folder that contains many files, and I am printing messages related to the files. However, for each file I run, I get the following xlrd-generated error message as well:

WARNING *** OLE2 inconsistency: SSCS size is 0 but SSAT size is non-zero

Is there a way to suppress this error message, so the CLI will only print the message I want it to?

Was it helpful?

Solution

Check out the relevant part of the xlrd docs. The 2nd arg of the open_workbook function is logfile which should be an open file object or act-alike. All it needs to support is a write method. It defaults to sys.stdout.

So, something like this (untested) should do the job:

class MyFilter(object):
    def __init__(self, mylogfile=sys.stdout):
        self.f = mylogfile
    def write(self, data):
        if "WARNING *** OLE2 inconsistency" not in data:
            self.f.write(data)

#start up
log = open("the_log_file.txt", "w")
log_filter = MyFilter(log)
book = xlrd.open_workbook("foo.xls", logfile=log_filter)

# shut down
log.close()
# or use a "with" statement

Update in response to answer by @DaniloBargen:

It's not xlrd that's writing the newline separately, it's the Python print statement/function. This script:

class FakeFile(object):
    def write(self, data):
        print repr(data)

ff = FakeFile()
for x in "foo bar baz".split():
    print >> ff, x

produces this output for all Pythons 2.2 to 2.7 both inclusive:

'foo'
'\n'
'bar'
'\n'
'baz'
'\n'

A suitably modernised script (print as a function instead of a statement) produces identical output for 2.6, 2.7, 3.1, 3.2, and 3.3. You can work around this with a more complicated filter class. The following example additionally allows a sequence of phrases to be checked for:

import sys, glob, xlrd

class MyFilter(object):
    def __init__(self, mylogfile=sys.stdout, skip_list=()):
        self.f = mylogfile
        self.state = 0
        self.skip_list = skip_list
    def write(self, data):
        if self.state == 0:
            found = any(x in data for x in self.skip_list)
            if not found:
                self.f.write(data)
                return
            if data[-1] != '\n':
                self.state = 1
        else:
            if data != '\n':
                self.f.write(data)
            self.state = 0

logf = open("the_log_file.txt", "w")
skip_these = (
    "WARNING *** OLE2 inconsistency",
    )
try:        
    log_filter = MyFilter(logf, skip_these)
    for fname in glob.glob(sys.argv[1]):
        logf.write("=== %s ===\n" % fname)
        book = xlrd.open_workbook(fname, logfile=log_filter)
finally:
    logf.close()

OTHER TIPS

The answer by John works, but has a small problem:

xlrd writes that warning message and the following newline character separately to the logfile. Therefore you will get an empty line in your stdout instead of the message, if you use the filter class proposed by John. You shouldn't simply filter out all newlines from the log output either though, because there could be "real" warnings that would then be missing the newlines.

If you want to simply ignore all log output by xlrd, this is probably the simplest solution:

book = xlrd.open_workbook("foo.xls", logfile=open(os.devnull, 'w'))
import warnings

def fxn():
    warnings.warn("deprecated", DeprecationWarning)

with warnings.catch_warnings():
    warnings.simplefilter("ignore")
    fxn()

-> http://docs.python.org/library/warnings.html#temporarily-suppressing-warnings

For what it's worth I had the same warning message; when I deleted the first row (which was empty) the warning disappeared.

Based on John Machin's answer, here is some working code (tested with Python 3.6) that works with sys.stdout:

import io
import sys
import xlrd

path = "somefile.xls" # path to an XLS file to load or a file-like object for one
encoding_override = None

# Mute xlrd warnings for OLE inconsistencies
class LogFilter(io.TextIOWrapper):
    def __init__(self, buffer=sys.stdout, *args, **kwargs):
        self.buffer = buffer
        super(LogFilter, self).__init__(buffer, *args, **kwargs)
    def write(self, data):
        if isinstance(data, str):
            if not data.startswith("WARNING *** OLE2 inconsistency: "):
                super(LogFilter, self).write(data)
        elif isinstance(data, bytes):
            super(LogFilter, self).write(data.decode(self.buffer.encoding))
        else:
            super(LogFilter, self).write(data)

def open_workbook(file_contents, encoding_override):
    logfilter = LogFilter()
    return xlrd.open_workbook(file_contents=file_contents, logfile=logfilter,
                              encoding_override=encoding_override)

if hasattr(path, 'read'):
    book = open_workbook(file_contents=path.read(), encoding_override=encoding_override)
else:
    with open(path, 'rb') as f:
        book = open_workbook(file_contents=f.read(), encoding_override=encoding_override)
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top