Question

How would you use Beautiful soup to get a URL base name in python? Given the url name as a string, what would you do?

Was it helpful?

Solution 2

I'd use urlparse over BeautifulSoup for extracting pieces of a URL. Here's an example:

from urlparse import urlparse

parsedurl = urlparse('http://example.com/filename.txt')
print parsedurl.path

The output will be:

/filename.txt

OTHER TIPS

If by base name you mean, given http://example.com/file.txt you want file.txt? In that case you do not need Beautiful Soup at all. Simple string manipulation code would work.

It is also known that os.path.basename('http://example.com/file.txt) would give you file.txt

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top