Cheerio_草庐IT

javascript - 使用 cheerio 检索 href

我下载了一个看起来像这样的html文件.........FeelsGood(Carrots&Beets)(1982)[Single]我需要到达每个和表的条目user_list并检索href。这是我现在的代码，它只返回一个空数组。我无法掌握的一件事是cheerio是否能够找到每个class="list_album"像这样，否则您必须使用一堆$()沿着文件的层次结构向下移动的。varcheerio=require("cheerio");varfile="...path...";varlinks=[];var$=cheerio.load(file);$('list_album').each(

javascript cheerio 34 code gt jquery node.js

javascript - 如何使用 cheerio 访问 meta OpenGraph？

我正在尝试用cheerio做一些html抓取(不能使用jsdon我有一个依赖问题:contextifyecc的错误。)但我无法获得元“og:type”，“og:标题"...request(Url,function(error,response,body){var$=cheerio.load(body);$('meta').each(function(){console.log($('meta').attr('content'));});});我只得到第一个元文本/html；charset=UTF-8"。你知道如何访问og吗？？最佳答案

javascript OpenGraph section cheerio 39 node.js

javascript - 使用 cheerio 获取页面标题

我正在尝试使用cheerio获取url的标题标签。但是，我得到的是空字符串值。这是我的代码:app.get('/scrape',function(req,res){url='http://nrabinowitz.github.io/pjscrape/';request(url,function(error,response,html){if(!error){var$=cheerio.load(html);vartitle,release,rating;varjson={title:"",release:"",rating:""};$('title').filter(function()

javascript cheerio section title function node.js express

javascript - 带有选择器的 Node.js 和 Cheerio 解析表

我正在尝试使用Node.js和Cheerio解析HTML表格并得到了一些结果，但不幸的是我得到了太多数据并且我不确定如何进一步解析它以仅获取我需要的数据.这是我目前使用的一小段代码..varrequest=require("request");varcheerio=require("cheerio");request('http://www.myURL.com',function(error,response,body){var$=cheerio.load(body);$('td').each(function(){console.log($(this).text());});});使

javascript Cheerio section gt lt node.js html-parsing

javascript - 如何使用 Cheerio js 删除 <div> 和 <br>？

我有以下html，我想通过Cheerios进行解析。var$=cheerio.load('Thisworkswell.SoIhavebeendoingthisforseveralhours.Howcomethespacedoesnotsplit?Thinkingthatthiscouldbeanissue.Testingnextparagraph.Imtestingwithanotherpost.Thispostshouldwork.Thisisfortestserver.',{normalizeWhitespace:true,});//tryingtoparsethehtml//th

amp javascript 39 code div node.js cheerio

javascript - Scrapy 之类的 Nodejs 工具？

关闭。这个问题不符合StackOverflowguidelines.它目前不接受答案。我们不允许提问寻求书籍、工具、软件库等的推荐。您可以编辑问题，以便用事实和引用来回答。关闭2年前。Improvethisquestion我想知道是否有类似Scrapyfornodejs的东西？。如果不是，您如何看待使用简单的页面下载并使用cheerio对其进行解析？有没有更好的办法。

javascript Scrapy section class notice node.js web-scraping cheerio

javascript - Async/Await with Request-Promise 返回 Undefined

我有两个文件；server.js和scrape.js，下面是它们当前的代码片段。服务器.js:constscrape=require("./scrape");asyncfunctionstart(){constresponse=awaitscrape.start();console.log(response);}start();和scrape.js:constcheerio=require("cheerio");constrequest=require("request-promise");go=async()=>{constoptions={uri:"http://www.somew

Request-Promise javascript code scrapeTitleArray const node.js asynchronous async-await cheerio

javascript - nodejs 使用 cheerio 解析 xml 返回空 CDATA

我在nodejs中使用cheerio来解析一些rss提要。我正在抓取所有将它们放入数组中的项目。我正在使用3个测试提要，它们的每个“项目”元素都有一个“描述”子元素。在其中一个提要中，整个“描述”被包装为CDATA，我无法获得它的值(value)。这是一个简短的代码片段//Openthexmldocumentwithcheerio$=cheerio.load(arrXmlDocs[i],{ignoreWhitespace:true,xmlMode:true});//Loopthrougheveryitem$('item').each(function(i,xmlItem){//arra

javascript cheerio code section xml node.js cdata

javascript - Cheerio:如何按文本内容选择元素？

我有一些这样的HTML:Code:12345Category:faucets我想获取类别名称(“水龙头”)。这是我的试用版:var$=cheerio.load(html.contents);varcategory=$('span[innerHTML="Category:"]').next().text();但这不起作用(innerHTML修饰符不选择任何内容)。有什么线索吗？最佳答案你的代码不工作的原因是因为[innerHTML]是一个属性选择器，而innerHTML不是元素的属性(这意味着什么都没有被选中)。您可以根据文本过滤

javascript Cheerio code span section jquery-selectors

还在用Python爬虫？教你一招，摆脱选择元素位置的烦恼！使用Node.js大杀器，并且无需使用cheerio 库~

咱们以豆瓣历史250最佳电影为例。豆瓣说，>_第一步：打开网页源代码第二步：选择你想要爬虫的元素，右键复制获取JS路径document.querySelector("#content>div>div.article>ol>li:nth-child(3)>div>div.pic>a>img")第三步：将这个路径复制到代码//引入模块consthttps=require('https')//不使用cheerio完成爬虫constcheerio=require('cheerio')constfs=require('fs')constjsdom=require('jsdom')const{JSDOM}

使用爬虫 gt div const AI综合