nodejs用curl导入数据到Mailchimp时遇到的问题

在工作中,遇到用node.js spawn调用curl方法向Mailchimp导入用户数据的情形。

一、child.stdout.on的机制

一开始的代码如下:

const spawn = require('child_process').spawn

const args = `curl --request POST \
     --url '${BASE_URL}/3.0/lists/${LIST_ID}' \
     --user 'user:${API_KEY}' \
     --header 'content-type:application/json' \
     --data '${postData}' \
     --include`

const child = spawn('curl', [args], {
    shell: true
})

child.stdout.on('data', (data) => {
    console.log(data)
});

console.log(data)的结果是:

HTTP/1.1 200 OK
Server: openresty
Content-Type: application/json; charset=utf-8
Content-Length: 4530
Vary: Accept-Encoding
X-Request-Id: 19ae5ea8-1f1a-4cb8-a142-074887140753
Link: <https://us19.api.mailchimp.com/schema/3.0/Lists/Instance.json>; rel="describedBy", <https://us19.admin.mailchimp.com/lists/members/?id=16769>; rel="dashboard"
Date: Thu, 06 Sep 2018 15:25:49 GMT
Connection: keep-alive
Set-Cookie: _AVESTA_ENVIRONMENT=prod; path=/
Set-Cookie: _mcid=1.03c96f827b0fce334cb3f36bf2ec5667; expires=Fri, 06-Sep-2019 15:25:49 GMT; Max-Age=31536000; path=/; domain=.mailchimp.com

{"new_members":[],"updated_members":[],"errors":[{"email_address":"info@edicomex.com.mx_cancel","error":"Please provide a valid email address."}],"total_created":0,"total_updated":0,"error_count":1,"_links":[{"rel":"self","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Response.json"},{"rel":"parent","href":"https://us19.api.mailchimp.com/3.0/lists","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists.json"},{"rel":"update","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0","method":"PATCH","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Response.json","schema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/PATCH.json"},{"rel":"batch-sub-unsub-members","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0","method":"POST","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/BatchPOST-Response.json","schema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/BatchPOST.json"},{"rel":"delete","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0","method":"DELETE"},{"rel":"abuse-reports","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/abuse-reports","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Abuse/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Abuse.json"},{"rel":"activity","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/activity","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Activity/Response.json"},{"rel":"clients","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/clients","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Clients/Response.json"},{"rel":"growth-history","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/growth-history","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Growth/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Growth.json"},{"rel":"interest-categories","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/interest-categories","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/InterestCategories/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/InterestCategories.json"},{"rel":"members","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/members","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Members/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Members.json"},{"rel":"merge-fields","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/merge-fields","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/MergeFields/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/MergeFields.json"},{"rel":"segments","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/segments","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Segments/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Segments.json"},{"rel":"webhooks","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/webhooks","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Webhooks/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Webhooks.json"},{"rel":"signup-forms","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/signup-forms","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/SignupForms/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/SignupForms.json"},{"rel":"locations","href":"https://us19.api.mailchimp.com/3.0/lists/5b524f3ac0/locations","method":"GET","targetSchema":"https://us19.api.mailchimp.com/schema/3.0/Definitions/Lists/Locations/CollectionResponse.json","schema":"https://us19.api.mailchimp.com/schema/3.0/CollectionLinks/Lists/Locations.json"}]}

我其实想直接拿到{"new_members":以后的东西,所以我做了如下操作(其实是同事帮忙哈哈):

const spawn = require('child_process').spawn

const args = `curl --request POST \
     --url '${BASE_URL}/3.0/lists/${LIST_ID}' \
     --user 'user:${API_KEY}' \
     --header 'content-type:application/json' \
     --data '${postData}' \
     --include`

const child = spawn('curl', [args], {
    shell: true
})

child.stdout.on('data', (data) => {
    const rData = data.toString('utf8').split('\n')
    const da = (rData[rData.length - 1])
    let chunksObject = JSON.parse(da)
    console.log(chunksObject)
});

child.on('close', (code) => {
    console.log(`child process close code:${code}`);
});

可以看到,我先通过const rData = data.toString('utf8').split('\n')把stdout出来的buffer转成utf-8格式的字符串,再通过\n即换行符切分字符串,最后取最后一位const da = (rData[rData.length - 1])为我想要的{"new_members": 部分。最后通过JSON.parse(da)得到我想要的JSON格式数据。

一开始数据量小的时候还好,等数据量一大,就发现JSON.parse(da)会报错,说da不是一个JSON格式的字符串。

查询资料发现:

node.js官网里说:

stdout <Buffer> | <string> output[1] 的内容。

原来,stdout吐出来的东西是一个buffer,所以它应该是一段一段吐出来的。为了验证猜想,我在child.stdout.on里加了一段console.log('------------')

child.stdout.on('data', (data) => {
    console.log('------------')
    const rData = data.toString('utf8').split('\n')
    const da = (rData[rData.length - 1])
    let chunksObject = JSON.parse(da)
    console.log(chunksObject)
});

运行脚本,一次curl请求打印了4到5次------------,验证了其是buffer。

于是我想,不能在每次stdout处解析data了,因为它此时data是不完整的,应该在child.on('close')时,即整个buffer输出完毕后再解析。

所以我定义了一个全局变量allChunksString,然后将每次stdout出来的chunk拼接起来,最后在child.on('close')的时候去解析allChunksString

const spawn = require('child_process').spawn

const args = `curl --request POST \
     --url '${BASE_URL}/3.0/lists/${LIST_ID}' \
     --user 'user:${API_KEY}' \
     --header 'content-type:application/json' \
     --data '${postData}' \
     --include`

const child = spawn('curl', [args], {
    shell: true
})

let allChunksString = '' // a global variable to concat all the stdout chunks

child.stdout.on('data', (data) => {
    const rData = data.toString('utf8').split('\n')
    const da = (rData[rData.length - 1])
    allChunksString += da
});

child.on('close', (code) => {
    let chunksObject = JSON.parse(allChunksString)
    console.log(chunksObject)
    console.log(`child process close code:${code}`);
});

果然,现在JSON.parse就没有再报错了。

总结child.stdout.on里的结果是一个buffer,一段一段的结果,所以不能出来一次解析一次,应该将其组装起来,最后到整个buffer输出完毕后在close事件里一起解析。

二、向Mailchimp API里发送的数据格式有问题

完成上述工作后,发现还是有400多条用户数据无法导入导Mailchimp中。报错shema expect object but got NULL。最后和同事千辛万苦找出来坑爹的结果,是因为有用户填入的某项merge_field字段带有'(英文单引号)。这个有开始符却没有结束符的'问题导致整个curl的data部分数据紊乱,因为curl认为你这里的data不是正确的JSON格式,所以直接给你报错结束进程。一条数据的错误,导致一起的200条用户都没法导入(我是每200条用户导入一次。)

所以,用正则将'替换成了空格,

postData = postData.replace(/\'/g, " ") // in case some user's field has ' which will mess the shell script

然后发现可以全部导入了。

当然这不是最优解,因为将用户的'改成了空格。

坑!

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 224,289评论 6 522
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 95,968评论 3 402
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 171,336评论 0 366
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 60,718评论 1 300
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 69,734评论 6 399
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 53,240评论 1 314
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 41,631评论 3 428
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 40,599评论 0 279
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 47,139评论 1 324
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 39,166评论 3 345
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 41,286评论 1 354
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 36,917评论 5 350
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 42,604评论 3 336
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 33,075评论 0 25
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 34,205评论 1 275
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 49,814评论 3 381
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 46,351评论 2 365

推荐阅读更多精彩内容