Puppeteer - 获取元素文本

我们可以在 Puppeteer 中获取元素文本。这可以通过 textContent 属性来实现。元素的此属性作为参数传递给 getProperty 方法。

语法

获取元素文本的语法如下：

const n = await page.$("#txt")
const t = await (await n.getProperty('textContent')).jsonValue()

在下图中，让我们获取突出显示元素的文本 - 关于 Tutorialspoint：

About Tutorialspoint for Highlighted Element

首先，请按照 Puppeteer 中“基本测试”章节中的步骤 1 到 2 操作，步骤如下：

步骤 1 - 在创建 node_modules 文件夹的目录（Puppeteer 和 Puppeteer core 已安装的位置）中创建一个新文件。

Puppeteer 安装的详细信息在“Puppeteer 安装”章节中进行了讨论。

右键单击创建 node_modules 文件夹的文件夹，然后单击“新建文件”按钮。

步骤 2 - 输入文件名，例如 testcase1.js。

步骤 3 - 将以下代码添加到创建的 testcase1.js 文件中。

//Puppeteer library
const pt= require('puppeteer')
async function getText(){
   //launch browser in headless mode
   const browser = await pt.launch()
   //browser new page
   const page = await browser.newPage()
   //launch URL
   await page.goto('https://tutorialspoint.com/about/about_careers.htm')
   //identify element
   const f = await page.$("[class='heading']")
   //obtain text
   const text = await (await f.getProperty('textContent')).jsonValue()
   console.log("Text is: " + text)
}
getText()

步骤 4 - 使用以下命令执行代码：

node <filename>

因此，在我们的示例中，我们将运行以下命令：

node testcase1.js

命令成功执行后，元素 - 关于 Tutorialspoint 的文本将打印到控制台。

打印页面