negative lookbehind stopping at quantified whitespace?

Question 1

Because the regex is not anchored in any way, it is free to be as loose as it likes.

In this case, let's look at how your string can be broken down. the square brackets indicate the attempted match.

... </p>[   </li>] // Fails, lookbehind assertion denies match
... </p> [  </li>] // Succeeds, lookbehind sees a space, not </p>

So you see the match succeeds simply by matching one less space, which is why you see a space between the two </p> in the result.

There's no easy fix for this in Regex. THE PONY HE COMES. So instead try using a parser.

$dom = new DOMDocument();
$dom->loadHTML($html);
$lis = $dom->getElementsByTagName('li');
foreach($lis as $li) {
    if( !$li->getElementsByTagName('p')->length) {
        $p = $dom->createElement("p");
        while($li->firstChild) $p->appendChild($li->firstChild);
        $li->appendChild($p);
    }
}
$output = $dom->saveHTML($dom->getElementsByTagName('body')->item(0));
$output = substr($output,strlen("<body>"),-strlen("</body>")); // strip body tag

Question 2

You have this:

</p>   </li>

And your regex doesn't match here:

</p>   </li>
    ^

because there's a </p> immediately preceding. But it DOES match here:

</p>   </li>
     ^

because the preceding text is not </p>, but .

You want an HTML parser. PHP comes with several, but I'm not much of a PHP dev so I can't recommend any in particular. See this question for some recommendations.

Question 3

This might help.

$html = preg_replace('@(<li[^>]*>)([^</li>]+)(?!\s*<p)@i', '$1<p>$2</p>', $html);