Python fetching </h1> <p id="fontePrincipale" class=""> <a class="text-dark" href="https://stackoverflow.com/questions/1660302" target="_blank"> <img src="https://cdn.sstatic.net/Sites/stackoverflow/img/favicon.ico" alt="StackOverflow" style="width:32px;height:32px;vertical-align:middle;" /> https://stackoverflow.com/questions/1660302 <i class="fas fa-hand-point-right"></i> </a> </p> </div> <div id="boxInfo" class="col-md-12"> <div class="row" style="margin:0px;"> <ul class="col-md-12"> <li> <i class="fas fa-tags"></i>  </li> <li class="tag badge"> <a class="nome" title="python" href="/de/tag/5732/python"> python </a> </li> <li class="tag badge"> <a class="nome" title="urllib2" href="/de/tag/10267/urllib2"> urllib2 </a> </li> </ul> <ul class="col-md-12 mt-2"> <li class="date"><i class="fas fa-clock"></i> <tim>11-09-2019</time></li> <li> | </li> <li class="timeToRead"><i class="fas fa-book-reader"></i> <span class="eta"></span></li> </ul> </div> </div> <div class="hidden"> <div id="detailMultilanguage" class="dropdown language-selection"> <button class="btn btn-light dropdown-toggle nav-link" type="button" id="dropdownMenuButtonDetails" data-bs-toggle="dropdown" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false" > Deutsch <img src="/img/icone/bandiere/de.svg" alt="de" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" /> </button> <div class="dropdown-menu dropdown-menu-right" aria-labelledby="dropdownMenuButtonDetails"> <a class="dropdown-item" href="https://www.generacodice.com/it/articolo/243175/python-recupero-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/it.svg" alt="italian" /> italiano</a> <a class="dropdown-item" href="https://www.generacodice.com/en/articolo/243175/python-fetching-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/gb.svg" alt="english" /> english</a> <a class="dropdown-item" href="https://www.generacodice.com/fr/articolo/243175/fetching-python-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/fr.svg" alt="french" /> français</a> <a class="dropdown-item" href="https://www.generacodice.com/es/articolo/243175/ir-a-buscar-python-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/es.svg" alt="spanish" /> española</a> <a class="dropdown-item" href="https://www.generacodice.com/cn/articolo/243175/%e8%9f%92%e5%8f%96-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/cn.svg" alt="chinese" /> 中国</a> <a class="dropdown-item" href="https://www.generacodice.com/jp/articolo/243175/python%e3%81%ae%e3%83%95%e3%82%a7%e3%83%83%e3%83%81-%e3%82%bf%e3%82%a4%e3%83%88%e3%83%ab"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/jp.svg" alt="japanese" /> 日本の</a> <a class="dropdown-item" href="https://www.generacodice.com/ar/articolo/243175/%d8%a8%d9%8a%d8%ab%d9%88%d9%86-%d8%ac%d9%84%d8%a8-%d8%a7%d9%84%d8%b9%d9%86%d9%88%d8%a7%d9%86"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/ar.svg" alt="arabic" /> العربية</a> <a class="dropdown-item" href="https://www.generacodice.com/de/articolo/243175/python-fetching-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/de.svg" alt="german" /> Deutsch</a> <a class="dropdown-item" href="https://www.generacodice.com/ko/articolo/243175/%ed%8c%8c%ec%9d%b4%ec%8d%ac-%ea%b0%80%ec%a0%b8-%ec%98%a4%ea%b8%b0-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/ko.svg" alt="korean" /> 한국어</a> <a class="dropdown-item" href="https://www.generacodice.com/pt/articolo/243175/python-atraente-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/pt.svg" alt="portuguese" /> Português</a> <a class="dropdown-item" href="https://www.generacodice.com/ru/articolo/243175/%d0%b2%d1%8b%d0%b1%d0%be%d1%80%d0%ba%d0%b0-%d0%bd%d0%b0-python-title"><img class="" loading="lazy" style="width:22px;height:22px;border:1px solid #fff;border-radius:160px;" src="/img/icone/bandiere/ru.svg" alt="russian" /> Russian</a> </div> </div> </div> </div> <div class="row"> <div id="boxLeft" class="col-xl-6" style="position:relative;"> <div id="boxArgomentoDescrizione" class="boxArticolo domanda"> <p class="title text-uppercase" style="background:linear-gradient(90deg,#EE5B32,#FB7928,#EE5B32) !important;margin-top:0px;"> <i class="fas fa-question"></i> Frage </p> <div class="testo"> <P> Ich möchte den Titel einer Webseite holen, die ich urllib2 mit öffnen. Was ist der beste Weg, dies zu tun, die HTML zu analysieren und finden, was ich (vorerst nur der -tag aber vielleicht in Zukunft mehr brauche) benötigen. </P> <P> Gibt es einen guten Parsing lib für diesen Zweck? </P> </div> </div> </div> <div id="boxRight" class="tab-content col-xl-6"> <div id="boxSoluzioneDescrizione" class="boxArticolo"> <div class="row"> <div class="col-md-6"> <div class="row justify-content-start"> <div class="col-md-12"> <form id="feedback" action="/de/articolo/feedback" method="post"> <input type="hidden" name="_csrf" value="nGBBwA18mcyma62QA-20A3bti_AP7PTLmmqE31v7v0H4WAv4Wg-vnugg3vRinOt2Gt3bpE6HnfKjPL2-aoHcKg=="> <div class="hidden" style="display:none;"> <div class="form-group field-feedbackform-pagina required"> <input type="hidden" id="feedbackform-pagina" class="pagina" name="FeedbackForm[pagina]" value="/articolo/details"> <p class="help-block help-block-error"></p> </div> <div class="form-group field-feedbackform-idargomento"> <input type="hidden" id="feedbackform-idargomento" class="idArgomento" name="FeedbackForm[idArgomento]" value="243175"> <p class="help-block help-block-error"></p> </div> </div> <div> War es hilfreich? <div class="example-block text-center"> <label class="radio-inline" for="happy" style="font-size:1.5em;cursor:pointer;color:green;"> <i class="far fa-thumbs-up" title="This answer is useful"></i> <!-- fas per effetto mano piena --> <!--<img class="votoImg" src="https://img.icons8.com/color/100/000000/bored.png" width="84" height="84" loading="lazy" fetchpriority="high"> --> </label> <input type="radio" id="happy" class="voto input-hidden" name="FeedbackForm[voto]" value="10"> </div> <div class="example-block text-center"> <label class="radio-inline" for="sad" style="font-size:1.5em;cursor:pointer;color:red;"> <i class="far fa-thumbs-down" title="This answer is not useful"></i> <!-- fas per effetto mano piena --> <!--<img class="votoImg" src="https://img.icons8.com/color/100/000000/boring.png" width="84" height="84" loading="lazy" fetchpriority="high">--> </label> <input type="radio" id="sad" class="voto input-hidden" name="FeedbackForm[voto]" value="0"> </div> <!--<div class="col-auto example-block text-center"> <label class="radio-inline"> <input type="radio" name="voto" id="exicetd" class="input-hidden" /> <img class="votoImg" src="https://img.icons8.com/color/100/000000/smiling.png " width="84" height="84" loading="lazy" fetchpriority="high"> </label> </div>--> </div> <div class="row footer justify-content-between"> <div class="col"> <button type="button" class="btn btn-primary" data-dismiss="modal">einreichen</button> </div> </div> </form> </div> </div> </div> <div class="col-md-6"> </div> </div> <div class="row "> <div class="col-md-12"> <p class="title" style="background-color:green;"> <i class="far fa-thumbs-up"></i> Lösung </p> <div class="testo"> <P> Ja, ich würde empfehlen, <a href="http://www.crummy.com/software/BeautifulSoup/" rel="noreferrer"> BeautifulSoup </a> </p> <P> Wenn Sie den Titel sind immer es ist einfach: </p> <pre><code>soup = BeautifulSoup(html) myTitle = soup.html.head.title </code></pre> <P> oder </p> <pre><code>myTitle = soup('title') </code></pre> <P> Genommen von <a href="http://www.crummy.com/software/BeautifulSoup/documentation.html" rel="noreferrer"> der Dokumentation </a> </p> <P> Es ist sehr robust und wird die HTML egal analysiert, wie chaotisch es ist. </P> </div> </div> </div> </div> </div> </div> <div class="row mt-4 adv"> <div class="col-12 text-center"> <ins class="adsbygoogle" style="display:block; text-align:center;" data-ad-layout="in-article" data-ad-format="fluid" data-ad-client="ca-pub-5108424997424987" data-ad-slot="1879801491"></ins> <script defer async crossorigin="anonymous"> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> </div> <div class="row mt-4 adv"> <div class="col-12 text-center"> </div> </div> <div class="row mt-4"> <div class="col-12"> <div id="boxSoluzioniAvanzate" class="boxArticolo soluzioni"> <p class="title" style="background-color:black;"><i class="fas fa-file-alt"></i> Andere Tipps</p> <div class="testo"> <div id="alt1082344" class="boxBorderTop row noMargin pt-4"> <div class="col-md-12 text-left"> <i class="far fa-newspaper fa-2x mb-2" style="display:block;color:gray;"></i> </div> <div class="col-md-12"> <P> Versuchen Sie <a href="http://www.crummy.com/software/BeautifulSoup/" rel="noreferrer"> Schöne Suppe </a>: </p> <pre><code>url = 'http://www.example.com' response = urllib2.urlopen(url) html = response.read() soup = BeautifulSoup(html) title = soup.html.head.title print title.contents </code></pre> </div> </div> <div id="alt1082345" class="boxBorderTop row noMargin pt-4"> <div class="col-md-12 text-left"> <i class="far fa-newspaper fa-2x mb-2" style="display:block;color:gray;"></i> </div> <div class="col-md-12"> <P> Warum hat sie eine ganze zusätzliche Bibliothek für eine Aufgabe zu importieren. Keine regulären Ausdrücke? war nicht der Antrag auf urllib nicht BS4 oder mech, die Dritten sind? mit Standard-Bibliotheken analysieren, um die HTML-und passen die Zeichenfolge dann teilen Sie die <code>'>'</code> <code>'<'</code> mit Re oder whateves zu tun. </p> <pre><code>N=(len(html)) for a in html(N): if '<title>' in a: Title=(str(a)) </code></pre> <P> das ist Python 2 Ich denke, man kann es abzustreifen </p> </div> </div> <div id="alt1082346" class="boxBorderTop row noMargin pt-4"> <div class="col-md-12 text-left"> <i class="far fa-newspaper fa-2x mb-2" style="display:block;color:gray;"></i> </div> <div class="col-md-12"> <P> Verwenden Sie <a href="http://www.crummy.com/software/BeautifulSoup/" rel="nofollow noreferrer"> Schöne Suppe </a>. </P> <pre><code>html = urllib2.urlopen("...").read() from BeautifulSoup import BeautifulSoup soup = BeautifulSoup(html) print soup.title.string </code></pre> </div> </div> </div> </div> </div> </div> <div class="row mt-4"> <div class="col-12"> <div class="attribution"> <div>Lizenziert unter: <a href="https://creativecommons.org/licenses/by-sa/3.0/" target="_blank">CC-BY-SA</a> mit <a href="https://stackoverflow.blog/2009/06/25/attribution-required/" target="_blank">Zuschreibung</a></div> <div>Nicht verbunden mit <a href="https://stackoverflow.com/" target="_blank">StackOverflow</a></div> </div> </div> </div> <div id="share"></div> </div> <div class="row mb-4 adv"> <div class="col-md-12 text-center"> <!-- GeneraCodice - Footer pagina --> <ins class="adsbygoogle" style="display:block" data-ad-client="ca-pub-5108424997424987" data-ad-slot="5412049179" data-ad-format="auto" data-full-width-responsive="true"></ins> <script defer async crossorigin="anonymous"> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> </div> </div> </div> <aside id="bannerRight" class="col-xs-12 col-md-4 col-lg-3 text-center" > <div class="container mt-4"> <div class="row mb-4 adv"> <div class="col-md-12"> <!-- GeneraCodice - Barra laterale --> <ins class="adsbygoogle" style="display:block" data-ad-client="ca-pub-5108424997424987" data-ad-slot="1592207755" data-ad-format="auto" data-full-width-responsive="true"></ins> <script defer async crossorigin="anonymous"> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> </div> <div class="row adv"> <div class="col-md-12"> <!-- GeneraCodice - Barra laterale 2 --> <ins class="adsbygoogle" style="display:block" data-ad-client="ca-pub-5108424997424987" data-ad-slot="8889943968" data-ad-format="auto" data-full-width-responsive="true"></ins> <script defer async crossorigin="anonymous"> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> </div> <div class="row topArticoli justify-content-center"> <div class="col-md-12 col-lg-10 pt-4"> </div> </div> </div> </aside> </div> </section> <!-- DA INSERIRE COLLEGAMENTO A GENERANEWS E GRATISFORGRATIS.COM --> <footer class="site-footer"> <div class="section-free d-block d-md-flex"> <div class="section-newsletter col"> </div> <div class="col content-free-projects mb-2"> <div> <p class="my-3">Nützliche Links</p> </div> <div class="d-flex justify-content-around"> <div></div> <div> <a class="nav-link" href="https://www.generacodice.com/de/tag">Stichworte</a> <a class="nav-link" href="https://www.generacodice.com/de/site/aboutus">Über uns</a> <a class="nav-link" href="https://www.generacodice.com/de/site/contacts">Kontakte</a> <a class="nav-link" href="https://www.generacodice.com/de/site/privacy">Privatsphäre</a> </div> <div> <a class="nav-link social fb" href="https://www.facebook.com/generacodice" target="_blank"><i class="fab fa-facebook"></i> Facebook</a> <a class="nav-link social instagram" href="https://www.instagram.com/genera_codice" target="_blank"><i class="fab fa-instagram"></i> Instagram</a> </div> <div></div> </div> <div class="small-footer-link d-flex align-items-center justify-content-center"> <form action="https://www.paypal.com/cgi-bin/webscr" method="post" target="_top"> <input type="hidden" name="cmd" value="_s-xclick" /> <input type="hidden" name="hosted_button_id" value="42ZKUPRLM66J2" /> <input type="image" src="https://www.paypalobjects.com/en_US/i/btn/btn_donate_SM.gif" border="0" name="submit" title="PayPal - The safer, easier way to pay online!" alt="Donate with PayPal button" /> </form> </div> </div> </div> <div class="row m-0 justify-content-center text-center p-2"> <div class="col-md-5"> <p>Der Inhalt ist unter Creative Commons lizenziert.</p> <p class="mb-0">Wenn Sie Urheberrechtsverletzungen finden, können Sie uns unter uns kontaktieren <a href="mailto:info@generacodice.com"> info@generacodice.com </a> um die Entfernung des Inhalts zu beantragen.</p> </div> </div> </footer> <div id="scroll-to-top" style="display: block;background:none;"> <img src="https://www.generacodice.com/img/icone/scroll-top.svg" alt="scroll top" style="width:48px;height:48px;background-color:#fff;" /> </div> <!-- Google Analytics --> <!-- Google tag (gtag.js) --> <script src="https://www.googletagmanager.com/gtag/js?id=G-PNYLV6VWJG" async crossorigin="anonymous"></script> <script crossorigin="anonymous" defer> window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', 'G-PNYLV6VWJG'); </script> <!-- Visualizzare barra ricerca su google --> <script type="application/ld+json" crossorigin="anonymous"> { "@context": "https://schema.org", "@type": "WebSite", "url": "https://www.generacodice.com/", "potentialAction": { "@type": "SearchAction", "target": "https://www.generacodice.com/articolo?ricerca={search_term_string}", "query-input": "required name=search_term_string" } } </script> <!-- Yandex.Metrika counter <script type="text/javascript" defer crossorigin="anonymous"> (function(m,e,t,r,i,k,a){m[i]=m[i]||function(){(m[i].a=m[i].a||[]).push(arguments)}; m[i].l=1*new Date();k=e.createElement(t),a=e.getElementsByTagName(t)[0],k.async=1,k.src=r,a.parentNode.insertBefore(k,a)}) (window, document, "script", "https://mc.yandex.ru/metrika/tag.js", "ym"); ym(79291009, "init", { clickmap:true, trackLinks:true, accurateTrackBounce:true, webvisor:true }); </script> <noscript><div><img src="https://mc.yandex.ru/watch/79291009" style="position:absolute; left:-9999px;" alt="" /></div></noscript> <!-- /Yandex.Metrika counter --> <script>var lingua = "https://www.generacodice.com/de";</script> <script src="/lib/wow.min.js" preload></script> <script src="/lib/js.cookie.min.js" preload></script> <script src="https://cdn.jsdelivr.net/npm/cookie-bar/cookiebar-latest.min.js?customize=1&tracking=1&thirdparty=1&always=1&noGeoIp=1&showNoConsent=1&showPolicyLink=1&privacyPage=https%3A%2F%2Fwww.generacodice.com%2Fsite%2Fprivacy" preload></script> <script src="/js/form_ricerca.js" preload></script> <script src="https://kit.fontawesome.com/99a60a9345.js" preload></script> <script src="/js/ads.js" defer="defer" preload></script> <script src="/js/main.js?timestamp=20221207" defer="defer" preload></script> <script src="/assets/44258436/yii.js"></script> <script src="/assets/44258436/yii.validation.js"></script> <script src="/assets/44258436/yii.activeForm.js"></script> <script src="/js/feedback.js" defer></script> <script src="/js/articulate.min.js"></script> <script src="/js/playerTesto.js?202310021220"></script> <script src="/lib/jssocials/jssocials.min.js"></script> <script src="/js/sceditor/minified/sceditor.min.js"></script> <script src="/js/sceditor/minified/formats/xhtml.js"></script> <script src="/js/articolo/details.js?202309292139"></script> <script>jQuery(function ($) { jQuery('#feedback').yiiActiveForm([{"id":"feedbackform-pagina","name":"pagina","container":".field-feedbackform-pagina","input":"#feedbackform-pagina","error":".help-block.help-block-error","validate":function (attribute, value, messages, deferred, $form) {yii.validation.required(value, messages, {"message":"Pagina darf nicht leer sein."});}},{"id":"feedbackform-idargomento","name":"idArgomento","container":".field-feedbackform-idargomento","input":"#feedbackform-idargomento","error":".help-block.help-block-error","validate":function (attribute, value, messages, deferred, $form) {yii.validation.number(value, messages, {"pattern":/^[+-]?\d+$/,"message":"Id Argomento muss eine Ganzzahl sein.","skipOnEmpty":1});}}], []); jQuery('#w0').yiiActiveForm([], []); });</script> <script> window.addEventListener('load', function() { var is_adsense_load = 0 window.addEventListener('scroll', function() { if (is_adsense_load == 0) { is_adsense_load = 1; var ele = document.createElement('script'); ele.async = true; ele.src = 'https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js' var sc = document.getElementsByTagName('script')[0] sc.parentNode.insertBefore(ele, sc); (adsbygoogle = window.adsbygoogle || []).push({ google_ad_client: "ca-pub-5108424997424987", enable_page_level_ads: true }); } }) }) </script> </body> </html>