{"id":4221,"date":"2020-09-25T20:58:00","date_gmt":"2020-09-25T23:58:00","guid":{"rendered":"https:\/\/www.kadunew.com\/blog\/?p=4221"},"modified":"2021-09-02T17:30:57","modified_gmt":"2021-09-02T20:30:57","slug":"xpath-screaming-frog-seo-extracao-de-conteudo","status":"publish","type":"post","link":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo","title":{"rendered":"XPath Screaming Frog SEO &#8211; extra\u00e7\u00e3o de conte\u00fado"},"content":{"rendered":"<p>Este artigo explica como podemos usar o <strong>recurso de extra\u00e7\u00e3o personalizado<\/strong> do <a href=\"https:\/\/www.screamingfrog.co.uk\/seo-spider\/\" target=\"_blank\" rel=\"noopener noreferrer\">Screaming Frog SEO<\/a> para extrair informa\u00e7\u00f5es de um site. O recurso de extra\u00e7\u00e3o personalizado permite que voc\u00ea extraia praticamente qualquer conte\u00fado do c\u00f3digo fonte HTML.<\/p>\n<p><!--more--><\/p>\n<p>Como exemplo apresento diversas extra\u00e7\u00e3o personalizadas. Voc\u00ea pode copiar os exemplos e modificar as express\u00f5es para se adaptarem ao seu cen\u00e1rio.<\/p>\n<p>Por que usar a extra\u00e7\u00e3o personalizada? Screaming Frog, por padr\u00e3o, coleta muitas <strong>informa\u00e7\u00f5es relevantes que auxilia a an\u00e1lise de SEO<\/strong>, como: t\u00edtulos de p\u00e1gina, elementos H1 e h2, tags can\u00f4nicas etc. Mas e se voc\u00ea quiser extrair algumas informa\u00e7\u00f5es como H3 e H4, ou contar o n\u00famero de ocorr\u00eancia de um determinado elemento. Isso pode se necess\u00e1rio para lhe ajudar na reestrutura\u00e7\u00e3o da arquitetura da informa\u00e7\u00f5es do site, por exemplo.<\/p>\n<h2>O que \u00e9 o XPath?<\/h2>\n<p><strong>Linguagem de consulta<\/strong> utilizada para localizar e processar itens em documentos XML. As express\u00f5es XPath podem ser usadas em HTML tamb\u00e9m, j\u00e1 que possui uma estrutura hier\u00e1rquica semelhante ao XML. \u00c9 uma ferramenta <strong>vers\u00e1til para navegar pelos elementos e atributos<\/strong> de um documento HTML e extrair seu conte\u00fado.<\/p>\n<p>Al\u00e9m do XPath temos no Screaming Frog as op\u00e7\u00f5es de CSSPath e Regex. Eu particularmente tenho prefer\u00eancia em usar o XPath e uso com mais frequ\u00eancia que os outros m\u00e9todos de extra\u00e7\u00e3o.<\/p>\n<h2>Como usar a extra\u00e7\u00e3o personalizada do Screaming Frog<\/h2>\n<p>Para acessar o recurso Extra\u00e7\u00e3o personalizada Clique em <em>Configuration <\/em>&gt; <em>Custom <\/em>&gt; <em>Extraction<\/em>.<\/p>\n<figure id=\"attachment_4231\" aria-describedby=\"caption-attachment-4231\" style=\"width: 383px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-4231\" src=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg\" alt=\"Acessando as op\u00e7\u00f5es de extra\u00e7\u00e3o - screaming frog\" width=\"383\" height=\"509\" srcset=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg 383w, https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog-226x300.jpg 226w\" sizes=\"(max-width: 383px) 100vw, 383px\" \/><figcaption id=\"caption-attachment-4231\" class=\"wp-caption-text\">Acessando as op\u00e7\u00f5es de extra\u00e7\u00e3o<\/figcaption><\/figure>\n<p>Temos 10 campos que podem ser personalizados para <strong>extrair informa\u00e7\u00f5es de p\u00e1ginas HTML<\/strong>. Abaixo apresento parte dessa janela onde configuramos as regras de extra\u00e7\u00e3o:<\/p>\n<p>A janela abaixo pode variar de acordo com a vers\u00e3o do seu programa<\/p>\n<figure id=\"attachment_4236\" aria-describedby=\"caption-attachment-4236\" style=\"width: 1082px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-4236\" src=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/custom-extraction-screaming-frog-1.jpg\" alt=\"Configurando extra\u00e7\u00e3o personalizada\" width=\"1082\" height=\"279\" srcset=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/custom-extraction-screaming-frog-1.jpg 1082w, https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/custom-extraction-screaming-frog-1-300x77.jpg 300w, https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/custom-extraction-screaming-frog-1-1024x264.jpg 1024w\" sizes=\"(max-width: 1082px) 100vw, 1082px\" \/><figcaption id=\"caption-attachment-4236\" class=\"wp-caption-text\">Janela de extra\u00e7\u00e3o screaming frog<\/figcaption><\/figure>\n<ul>\n<li><strong>Nome da extra\u00e7\u00e3o<\/strong>: Voc\u00ea pode digitar um nome para a pesquisa. Nome que aparecer\u00e1 nas extra\u00e7\u00f5es personalizadas e no arquivo de exporta\u00e7\u00e3o do Excel;<\/li>\n<li><strong>M\u00e9todo de extra\u00e7\u00e3o<\/strong>: Escolha a op\u00e7\u00e3o XPath;<\/li>\n<li><strong>Regra<\/strong>: Inserir a sintaxe XPath. Screaming Frog incluir um indicador de valida\u00e7\u00e3o de sintaxe. Um X vermelho indica que a sintaxe \u00e9 inv\u00e1lida, J\u00e1 um V verde significa que est\u00e1 correto;<\/li>\n<li><strong>Tipo de extra\u00e7\u00e3o<\/strong>: Escolha <em>Extract Inner HTML<\/em> (Extrair HTML interno), <em>Extract HTML Element<\/em> (Extrair elemento do HTML), <em>Extract Text<\/em> (Extrair texto) ou <em>Function Value<\/em> (valor da fun\u00e7\u00e3o).<\/li>\n<\/ul>\n<h3>Tipos de extra\u00e7\u00e3o<\/h3>\n<ul>\n<li><strong><em>Extract Inner HTML<\/em><\/strong>: o conte\u00fado HTML interno do elemento selecionado. Caso o elemento selecionado tenha outros elementos HTML, eles ser\u00e3o inclu\u00eddos na extra\u00e7\u00e3o;<\/li>\n<li><strong><em>Extract HTML Element<\/em><\/strong>: o elemento selecionado e seu conte\u00fado HTML interno s\u00e3o extra\u00eddos;<\/li>\n<li><strong><em>Extract Text<\/em><\/strong>: o conte\u00fado do elemento alvo na regra XPath e o texto de qualquer elemento interno;<\/li>\n<li><strong><em>Function Value<\/em><\/strong>: utilizado para uma escrever uma fun\u00e7\u00e3o de extra\u00e7\u00e3o.<\/li>\n<\/ul>\n<div class=\"obs\">Perceba que cada op\u00e7\u00e3o extrai diferentes partes do HTML. Use a op\u00e7\u00e3o que atenda melhor sua necessidade.<\/div>\n<h3>Exemplos extra\u00e7\u00e3o<\/h3>\n<table class=\"tab\" summary=\"Exemplos extra\u00e7\u00e3o\"><caption>Tabela 1: Exemplo com os diferentes tipos de extra\u00e7\u00e3o<\/caption>\n<thead>\n<tr>\n<th scope=\"col\">XPath<\/th>\n<th scope=\"col\">Resultado<\/th>\n<th scope=\"col\">Extra\u00e7\u00e3o<\/th>\n<\/tr>\n<\/thead>\n<tfoot>\n<tr>\n<td colspan=\"2\">Op\u00e7\u00f5es de extra\u00e7\u00e3o Screaming Frog<\/td>\n<\/tr>\n<\/tfoot>\n<tbody>\n<tr>\n<td>\/descendant::h1[1]<\/td>\n<td>KADUNEW<\/td>\n<td>Extract Text<\/td>\n<\/tr>\n<tr>\n<td>\/descendant::h1[1]<\/td>\n<td>&lt;h1&gt;&lt;a href=&#8221;https:\/\/www.kadunew.com\/blog\/&#8221; title=&#8221;KADUNEW&#8221;&gt;KADUNEW&lt;\/a&gt;&lt;\/h1&gt;<\/td>\n<td>Extract HTML Element<\/td>\n<\/tr>\n<tr>\n<td>\/descendant::h1[1]<\/td>\n<td>&lt;a href=&#8221;https:\/\/www.kadunew.com\/blog\/&#8221; title=&#8221;KADUNEW&#8221;&gt;KADUNEW&lt;\/a&gt;<\/td>\n<td>Extract Inner HTML<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Sintaxe B\u00e1sica de Extra\u00e7\u00e3o<\/h2>\n<table class=\"tab\" summary=\"Sintaxe B\u00e1sica de Extra\u00e7\u00e3o&lt;\"><caption>Tabela 2: Sintaxe b\u00e1sica XPath<\/caption>\n<thead>\n<tr>\n<th scope=\"col\">Exemplo<\/th>\n<th scope=\"col\">Descri\u00e7\u00e3o<\/th>\n<\/tr>\n<\/thead>\n<tfoot>\n<tr>\n<td colspan=\"2\">Extra\u00e7\u00f5es b\u00e1sicas para XPath screaming frog<\/td>\n<\/tr>\n<\/tfoot>\n<tbody>\n<tr>\n<td>\/\/<\/td>\n<td>Pesquise em qualquer lugar do documento<\/td>\n<\/tr>\n<tr>\n<td>\/<\/td>\n<td>Pesquisar na raiz<\/td>\n<\/tr>\n<tr>\n<td>@<\/td>\n<td>Selecione um atributo espec\u00edfico de um elemento<\/td>\n<\/tr>\n<tr>\n<td>*<\/td>\n<td>Curinga, usado para selecionar qualquer elemento<\/td>\n<\/tr>\n<tr>\n<td>[ ]<\/td>\n<td>Encontre um elemento espec\u00edfico<\/td>\n<\/tr>\n<tr>\n<td>.<\/td>\n<td>Especifica o elemento atual<\/td>\n<\/tr>\n<tr>\n<td>..<\/td>\n<td>Especifica o elemento pai<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Exemplo de extra\u00e7\u00e3o personalizada<\/h2>\n<p>Nas tabelas abaixo, voc\u00ea pode copiar a sintaxe na coluna exemplo e col\u00e1-la no Screaming Frog para executar a extra\u00e7\u00e3o descrita na coluna descri\u00e7\u00e3o. Atente-se para <strong>ajustar a sintaxe<\/strong> como desejar, <strong>personalizando a extra\u00e7\u00e3o<\/strong> de acordo com suas necessidades.<\/p>\n<h3>Extra\u00e7\u00e3o de Elementos HTML<\/h3>\n<table class=\"tab\" summary=\"Extra\u00e7\u00e3o de elementos HTML\"><caption>Tabela 3: Exemplos de extra\u00e7\u00f5es personalizadas para elementos HTML<\/caption>\n<thead>\n<tr>\n<th scope=\"col\">Exemplo<\/th>\n<th scope=\"col\">Descri\u00e7\u00e3o<\/th>\n<\/tr>\n<\/thead>\n<tfoot>\n<tr>\n<td colspan=\"2\">Extra\u00e7\u00e3o de elementos HTML Screaming Frog<\/td>\n<\/tr>\n<\/tfoot>\n<tbody>\n<tr>\n<td>\/\/h1<\/td>\n<td>Extrair todas tags H1\u00a0<\/td>\n<\/tr>\n<tr>\n<td>\/\/h2[1]<\/td>\n<td>Extrair a primeira tag H2<\/td>\n<\/tr>\n<tr>\n<td>\/\/h3[2]<\/td>\n<td>Extrair a segunda tag H3<\/td>\n<\/tr>\n<tr>\n<td>\/\/div\/p<\/td>\n<td>Extrair qualquer &lt;p&gt; que seja filho de &lt;div&gt;<\/td>\n<\/tr>\n<tr>\n<td>\/\/div[@class=&#8221;author&#8221;]<\/td>\n<td>Extrair qualquer &lt;div&gt; com class &#8220;author&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/p[@class=&#8221;bio&#8221;]<\/td>\n<td>Extrair qualquer &lt;p&gt; com class &#8220;bio&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/*[@class=&#8221;bio&#8221;]<\/td>\n<td>Extrair qualquer elemento HTML com class &#8220;bio&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/ul\/li[last()]<\/td>\n<td>Extrair o \u00faltimo &lt;li&gt; de um &lt;ul&gt;<\/td>\n<\/tr>\n<tr>\n<td>\/\/ol[@class=&#8221;cat&#8221;]\/li[1]<\/td>\n<td>Extrair o primeiro &lt;li&gt; de um &lt;ol&gt; com a class &#8220;cat&#8221;<\/td>\n<\/tr>\n<tr>\n<td>count(\/\/h2)<\/td>\n<td>Conta o n\u00famero de H2\u2019s (definir filtro Extrairion para \u201cFunction Value\u201d)<\/td>\n<\/tr>\n<tr>\n<td>\/\/a[contains(.,&#8221;SEO&#8221;)]\/@href<\/td>\n<td>Extrair todos links com o texto texto \u00e2ncora &#8220;SEO&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/a[contains(translate(., &#8216;ABCDEFGHIJKLMNOPQRSTUVWXYZ&#8217;, &#8216;abcdefghijklmnopqrstuvwxyz&#8217;),&#8217;seo spider&#8217;)]\/@href<\/td>\n<td>Extrair todos links com o texto texto \u00e2ncora &#8220;SEO&#8221;. Por ser Case-sensitive a regra converte tudo para min\u00fasculo<\/td>\n<\/tr>\n<tr>\n<td>\/\/a[starts-with(@title,&#8221;Written by&#8221;)]<\/td>\n<td>Extrair qualquer link com atributo title iniciando com \u201cWritten by\u201d<\/td>\n<\/tr>\n<tr>\n<td>\/\/p[contains(text() ,&#8221;your search query here&#8221;)]<\/td>\n<td>Extrair um texto espec\u00edfico dentro de um par\u00e1grafo<\/td>\n<\/tr>\n<tr>\n<td>\/descendant::h3[1]<\/td>\n<td>Extrai o conte\u00fado do primeiro H3 rastreado<\/td>\n<\/tr>\n<tr>\n<td>\/descendant::h3[position() &gt;= 0 and position() &lt;= 10]<\/td>\n<td>Extrai os 10 primeiros H3s rastreados<\/td>\n<\/tr>\n<tr>\n<td>\/\/h3[contains(text(), \u201cexemplo\u201d)]<\/td>\n<td>Extrai conte\u00fado &#8220;exemplo&#8221; de qualquer H3<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Extra\u00e7\u00e3o de Atributos HTML<\/h3>\n<table class=\"tab\" summary=\"Extra\u00e7\u00e3o de atributos HTML\"><caption>Tabela 4: Exemplos de extra\u00e7\u00f5es personalizadas para atributos HTML<\/caption>\n<thead>\n<tr>\n<th scope=\"col\">Exemplo<\/th>\n<th scope=\"col\">Descri\u00e7\u00e3o<\/th>\n<\/tr>\n<\/thead>\n<tfoot>\n<tr>\n<td colspan=\"2\">Extraindo conte\u00fado de atributo HTML Screaming Frog<\/td>\n<\/tr>\n<\/tfoot>\n<tbody>\n<tr>\n<td>\/\/@href<\/td>\n<td>Extrair todos links<\/td>\n<\/tr>\n<tr>\n<td>\/\/a[starts-with(@href,&#8221;mailto&#8221;)]\/@href<\/td>\n<td>Extrair link que iniciam &#8220;mailto&#8221; (endere\u00e7o de e-mail)<\/td>\n<\/tr>\n<tr>\n<td>\/\/img\/@src<\/td>\n<td>Extrair URLs de todas imagens<\/td>\n<\/tr>\n<tr>\n<td>\/\/img[contains(@class,&#8221;aligncenter&#8221;)]\/@src<\/td>\n<td>Extrair URLs das imagens contendo a classe de nome &#8220;aligncenter&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/link[@rel=&#8221;alternate&#8221;]<\/td>\n<td>Extrair conte\u00fado de elementos contendo o atributo &#8220;alternate&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/@hreflang<\/td>\n<td>Extrair todos valores de hreflang<\/td>\n<\/tr>\n<tr>\n<td>\/\/head\/link[@rel=&#8221;amphtml&#8221;]\/@href<\/td>\n<td>Extrair URL AMP de uma p\u00e1gina<\/td>\n<\/tr>\n<tr>\n<td>\/\/head\/link[@rel=&#8221;alternate&#8221;]\/@href<\/td>\n<td>Extrair URL do valor alternate<\/td>\n<\/tr>\n<tr>\n<td>\/\/link[contains(@media, &#8216;640&#8217;) and @href]\/@href<\/td>\n<td>Estrai href contendo media no elemento<\/td>\n<\/tr>\n<tr>\n<td>\/\/*[@hreflang]\/@hreflang<\/td>\n<td>Extrai o valor do\u00a0hreflang<\/td>\n<\/tr>\n<tr>\n<td>\/\/iframe\/@src<\/td>\n<td>Extrai URL do iframe<\/td>\n<\/tr>\n<tr>\n<td>\/\/iframe[contains(@src ,&#8217;www.youtube.com\/embed\/&#8217;)]<\/td>\n<td>Extrai URL de v\u00eddeos do Youtube incorporado \u00e0 p\u00e1gina<\/td>\n<\/tr>\n<tr>\n<td>\/\/iframe[not(contains(@src, &#8216;https:\/\/www.googletagmanager.com\/&#8217;))]\/@src<\/td>\n<td>Extrai URL que n\u00e3o seja iframe espec\u00edfico<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[@name=&#8217;news_keywords&#8217;]\/@content<\/td>\n<td>Extrai conte\u00fado meta &#8220;news_keywords&#8221;<\/td>\n<\/tr>\n<tr>\n<td>(\/\/iframe\/@src)[1]<\/td>\n<td>Extrai URL da primeira ocorr\u00eancia de iframe<\/td>\n<\/tr>\n<tr>\n<td>\/\/div[@class=&#8221;posts&#8221;]\/\/a<\/td>\n<td>Extrai o texto \u00e2ncora dentro de uma div de classe posts. Usar &#8220;Extract Inner HTML&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/div[@class=&#8221;posts&#8221;]\/\/a\/@href<\/td>\n<td>Extrai o URL dentro de uma div de classe posts. Usar &#8220;Extract Inner HTML&#8221;<\/td>\n<\/tr>\n<tr>\n<td>\/\/div[@class=&#8221;posts&#8221;]\/\/a<\/td>\n<td>Extrai c\u00f3digo completo do link dentro de uma div com a classe posts. Usar Extract HTML Element<\/td>\n<\/tr>\n<tr>\n<td>\/\/html \/@lang<\/td>\n<td>Extrai o idioma da p\u00e1gina declarado no elemento HTML<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Extrair as tags de media social de texto como Open Graph ou Twitter Cards<\/h3>\n<table class=\"tab\" summary=\"\"><caption>Tabela 5: Extra\u00e7\u00e3o de tags media social<\/caption>\n<thead>\n<tr>\n<th scope=\"col\">Exemplo<\/th>\n<th scope=\"col\">Descri\u00e7\u00e3o<\/th>\n<\/tr>\n<\/thead>\n<tfoot>\n<tr>\n<td colspan=\"2\">Exemplos para extra\u00e7\u00e3o de tags media social Screaming Frog<\/td>\n<\/tr>\n<\/tfoot>\n<tbody>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:title&#8221;)][1]\/@content<\/td>\n<td>Extrair t\u00edtulo<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:description&#8221;)][1]\/@content<\/td>\n<td>Extrair a descri\u00e7\u00e3o<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:type&#8221;)][1]\/@content<\/td>\n<td>Extrair o tipo do Open Graph ou Twitter Cards,<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:site_name&#8221;)][1]\/@content<\/td>\n<td>Extrair o valor do nome do site<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:locale&#8221;)][1]\/@content<\/td>\n<td>Extrair valor da localidade<\/td>\n<\/tr>\n<tr>\n<td>\/\/meta[starts-with(@property, &#8220;og:image&#8221;)][1]\/@content<\/td>\n<td>Extrair URL da imagem<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Resultado da Extra\u00e7\u00e3o<\/h2>\n<p>Para acessar o conte\u00fado extra\u00eddo pelo Screaming Frog, acesse a aba <em>Custom<\/em> ou no painel \u00e0 direita acesse <em>Extraction<\/em> dentro da se\u00e7\u00e3o <em>Custom<\/em>.<\/p>\n<figure id=\"attachment_4263\" aria-describedby=\"caption-attachment-4263\" style=\"width: 663px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-4263\" src=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/screaming-frog-resultado-extracao.jpg\" alt=\"Conte\u00fado extra\u00eddo Screaming Frog\" width=\"663\" height=\"459\" srcset=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/screaming-frog-resultado-extracao.jpg 663w, https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/screaming-frog-resultado-extracao-300x208.jpg 300w\" sizes=\"(max-width: 663px) 100vw, 663px\" \/><figcaption id=\"caption-attachment-4263\" class=\"wp-caption-text\">Visualizando o resultado da extra\u00e7\u00e3o Screaming Frog<\/figcaption><\/figure>\n<h2>Copiando XPath pelo Navegador<\/h2>\n<p>O navegador Google Chrome tem um recurso que <strong>facilita a escrita da regra XPath<\/strong>. Voc\u00ea pode usar o recurso atrav\u00e9s da ferramenta dev Tools para gerar express\u00f5es XPath:<\/p>\n<ul>\n<li>Abra a Dev tools pressionando a tecla F12 (ou bot\u00e3o direito &gt; inspecionar);<\/li>\n<li>Clique com o bot\u00e3o direito do mouse sobre o elemento desejado;<\/li>\n<li>V\u00e1 em copy &gt; copy XPath;<\/li>\n<li>Talvez seja necess\u00e1rio adaptar a express\u00e3o XPath que Chrome oferece antes de usar no Screaming Frog. Por\u00e9m, voc\u00ea j\u00e1 tem uma ideia inicial da regra;<\/li>\n<li>Exemplo de uma regra XPath copiada <strong>\/\/*[@id=&#8221;post-4158&#8243;]\/header\/h2<\/strong> (exemplo abaixo).<\/li>\n<\/ul>\n<figure id=\"attachment_4226\" aria-describedby=\"caption-attachment-4226\" style=\"width: 589px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-4226\" src=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extrair-xpath-google-chrome.jpg\" alt=\"extra\u00e7\u00e3o XPath Chrome\" width=\"589\" height=\"546\" srcset=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extrair-xpath-google-chrome.jpg 589w, https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extrair-xpath-google-chrome-300x278.jpg 300w\" sizes=\"(max-width: 589px) 100vw, 589px\" \/><figcaption id=\"caption-attachment-4226\" class=\"wp-caption-text\">Exemplo de extra\u00e7\u00e3o XPath pelo Google Chrome<\/figcaption><\/figure>\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Este artigo explica como podemos usar o recurso de extra\u00e7\u00e3o personalizado do Screaming Frog SEO para extrair informa\u00e7\u00f5es de um site. O recurso de extra\u00e7\u00e3o personalizado permite que voc\u00ea extraia praticamente qualquer conte\u00fado do c\u00f3digo fonte HTML.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[56],"tags":[],"class_list":["post-4221","post","type-post","status-publish","format-standard","hentry","category-seo"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado<\/title>\n<meta name=\"description\" content=\"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado\" \/>\n<meta property=\"og:description\" content=\"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo\" \/>\n<meta property=\"og:site_name\" content=\"KADUNEW\" \/>\n<meta property=\"article:published_time\" content=\"2020-09-25T23:58:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-09-02T20:30:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg\" \/>\n<meta name=\"author\" content=\"Kadu Oliveira\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kadu Oliveira\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo\",\"url\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo\",\"name\":\"XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado\",\"isPartOf\":{\"@id\":\"https:\/\/www.kadunew.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg\",\"datePublished\":\"2020-09-25T23:58:00+00:00\",\"dateModified\":\"2021-09-02T20:30:57+00:00\",\"author\":{\"@id\":\"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/07b2297c4825efbd1e9f2a1018926b05\"},\"description\":\"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage\",\"url\":\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg\",\"contentUrl\":\"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg\",\"width\":383,\"height\":509,\"caption\":\"Acessando as op\u00e7\u00f5es de extra\u00e7\u00e3o\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.kadunew.com\/blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"SEO\",\"item\":\"https:\/\/www.kadunew.com\/blog\/category\/seo\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"XPath Screaming Frog SEO &#8211; extra\u00e7\u00e3o de conte\u00fado\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/#website\",\"url\":\"https:\/\/www.kadunew.com\/blog\/\",\"name\":\"KADUNEW\",\"description\":\"Artigos sobre Front-End e Programa\u00e7\u00e3o web\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.kadunew.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/07b2297c4825efbd1e9f2a1018926b05\",\"name\":\"Kadu Oliveira\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/780660fded589936b30467c54c99d51a?s=96&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/780660fded589936b30467c54c99d51a?s=96&r=g\",\"caption\":\"Kadu Oliveira\"},\"sameAs\":[\"https:\/\/www.kadunew.com\/blog\"],\"url\":\"https:\/\/www.kadunew.com\/blog\/author\/admin\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado","description":"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo","og_locale":"pt_BR","og_type":"article","og_title":"XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado","og_description":"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.","og_url":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo","og_site_name":"KADUNEW","article_published_time":"2020-09-25T23:58:00+00:00","article_modified_time":"2021-09-02T20:30:57+00:00","og_image":[{"url":"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg"}],"author":"Kadu Oliveira","twitter_misc":{"Escrito por":"Kadu Oliveira","Est. tempo de leitura":"7 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo","url":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo","name":"XPath Screaming Frog SEO - extra\u00e7\u00e3o de conte\u00fado","isPartOf":{"@id":"https:\/\/www.kadunew.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage"},"image":{"@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage"},"thumbnailUrl":"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg","datePublished":"2020-09-25T23:58:00+00:00","dateModified":"2021-09-02T20:30:57+00:00","author":{"@id":"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/07b2297c4825efbd1e9f2a1018926b05"},"description":"Artigo que mostra como usar o recurso de extra\u00e7\u00e3o personalizada do Screaming Frog para extrair informa\u00e7\u00f5es do HTML de sites.","breadcrumb":{"@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#primaryimage","url":"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg","contentUrl":"https:\/\/www.kadunew.com\/blog\/wp-content\/uploads\/2020\/05\/extracao-xpath-screaming-frog.jpg","width":383,"height":509,"caption":"Acessando as op\u00e7\u00f5es de extra\u00e7\u00e3o"},{"@type":"BreadcrumbList","@id":"https:\/\/www.kadunew.com\/blog\/seo\/xpath-screaming-frog-seo-extracao-de-conteudo#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.kadunew.com\/blog"},{"@type":"ListItem","position":2,"name":"SEO","item":"https:\/\/www.kadunew.com\/blog\/category\/seo"},{"@type":"ListItem","position":3,"name":"XPath Screaming Frog SEO &#8211; extra\u00e7\u00e3o de conte\u00fado"}]},{"@type":"WebSite","@id":"https:\/\/www.kadunew.com\/blog\/#website","url":"https:\/\/www.kadunew.com\/blog\/","name":"KADUNEW","description":"Artigos sobre Front-End e Programa\u00e7\u00e3o web","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.kadunew.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Person","@id":"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/07b2297c4825efbd1e9f2a1018926b05","name":"Kadu Oliveira","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.kadunew.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/780660fded589936b30467c54c99d51a?s=96&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/780660fded589936b30467c54c99d51a?s=96&r=g","caption":"Kadu Oliveira"},"sameAs":["https:\/\/www.kadunew.com\/blog"],"url":"https:\/\/www.kadunew.com\/blog\/author\/admin"}]}},"_links":{"self":[{"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/posts\/4221"}],"collection":[{"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/comments?post=4221"}],"version-history":[{"count":42,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/posts\/4221\/revisions"}],"predecessor-version":[{"id":4479,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/posts\/4221\/revisions\/4479"}],"wp:attachment":[{"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/media?parent=4221"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/categories?post=4221"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.kadunew.com\/blog\/wp-json\/wp\/v2\/tags?post=4221"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}