前一段時(shí)間開始搞優(yōu)化后端 API 性能的事,發(fā)現(xiàn)根本無從下手,Kibana 中雖然有記一些簡陋且零散的日志,但也瞅不出啥眉目來??偨Y(jié)了下是日志太少了,所以決定先搞搞日志的事,目標(biāo)是記錄 API 請求相關(guān)的日志,找出哪個(gè)接口慢,最好具體到哪個(gè)函數(shù)慢。
在利通等地區(qū),都構(gòu)建了全面的區(qū)域性戰(zhàn)略布局,加強(qiáng)發(fā)展的系統(tǒng)性、市場前瞻性、產(chǎn)品創(chuàng)新能力,以專注、極致的服務(wù)理念,為客戶提供成都網(wǎng)站設(shè)計(jì)、成都網(wǎng)站制作 網(wǎng)站設(shè)計(jì)制作按需策劃,公司網(wǎng)站建設(shè),企業(yè)網(wǎng)站建設(shè),品牌網(wǎng)站制作,全網(wǎng)營銷推廣,成都外貿(mào)網(wǎng)站制作,利通網(wǎng)站建設(shè)費(fèi)用合理。
記日志必然要涉及日志打點(diǎn),怎么做日志打點(diǎn)是個(gè)問題。如果直接在代碼中插入日志打點(diǎn)代碼不僅侵入性強(qiáng)而且工作量大,也不夠靈活,于是考慮如何做智能的自動(dòng)打點(diǎn)。我們石墨后端使用的 bay 框架(基于 Koa@1 二次開發(fā),基本上你可以認(rèn)為就是用的 Koa),controller 里用了大量的 yield,所以考慮是不是可以在 yield 表達(dá)式前后打點(diǎn),調(diào)研了下覺得可以用 esprima 和 escodegen 做。
奮戰(zhàn)幾天,koa-yield-breakpoint 誕生了。
koa-yield-breakpoint 就是一個(gè)在 controllers 或者 routes 里的 yield 表達(dá)式前后打點(diǎn)的工具,不用插入一行日志打點(diǎn)代碼,只需要在引入的時(shí)候配置一下,可以記錄每個(gè)請求到來時(shí) yield 表達(dá)式前后的現(xiàn)場,如:
yield 表達(dá)式所在的行列號(filename)
yield 表達(dá)式是執(zhí)行的第幾步(step)
yield 表達(dá)式字符串形式(fn)
執(zhí)行 yield 表達(dá)式所花費(fèi)的時(shí)間(take)
執(zhí)行 yield 表達(dá)式的結(jié)果(result)
yield 表達(dá)式當(dāng)時(shí)的現(xiàn)場(this),通過 this 你也可以添加自定義字段
原理:
重載 Module.prototype._compile,相當(dāng)于 hack 了 require,如果發(fā)現(xiàn)是 require 了配置里指定的的文件,則進(jìn)行下一步,否則返回原始代碼內(nèi)容,相關(guān)源代碼如下:
shimmer.wrap(Module.prototype, '_compile', function (__compile) { return function koaBreakpointCompile(content, filename) { if (!_.includes(filenames, filename)) { return __compile.call(this, content, filename); } ... };});
用 esprima 解析代碼,生成 AST。如:
'use strict';const Mongolass = require('mongolass');const mongolass = new Mongolass();mongolass.connect('MongoDB://localhost:27017/test');exports.getUsers = function* getUsers() { yield mongolass.model('users').create({ name: 'xx', age: 18 }); const users = yield mongolass.model('users').find(); this.body = users;};
會生成如下 AST:
Program { type: 'Program', body: [ Directive { type: 'ExpressionStatement', expression: Literal { type: 'Literal', value: 'use strict', raw: '\'use strict\'', loc: { start: { line: 1, column: 0 }, end: { line: 1, column: 12 } } }, directive: 'use strict', loc: { start: { line: 1, column: 0 }, end: { line: 1, column: 13 } } }, VariableDeclaration { type: 'VariableDeclaration', declarations: [ VariableDeclarator { type: 'VariableDeclarator', id: Identifier { type: 'Identifier', name: 'Mongolass', loc: { start: { line: 3, column: 6 }, end: { line: 3, column: 15 } } }, init: CallExpression { type: 'CallExpression', callee: Identifier { type: 'Identifier', name: 'require', loc: { start: { line: 3, column: 18 }, end: { line: 3, column: 25 } } }, arguments: [ Literal { type: 'Literal', value: 'mongolass', raw: '\'mongolass\'', loc: { start: { line: 3, column: 26 }, end: { line: 3, column: 37 } } } ], loc: { start: { line: 3, column: 18 }, end: { line: 3, column: 38 } } }, loc: { start: { line: 3, column: 6 }, end: { line: 3, column: 38 } } } ], kind: 'const', loc: { start: { line: 3, column: 0 }, end: { line: 3, column: 39 } } }, VariableDeclaration { type: 'VariableDeclaration', declarations: [ VariableDeclarator { type: 'VariableDeclarator', id: Identifier { type: 'Identifier', name: 'mongolass', loc: { start: { line: 4, column: 6 }, end: { line: 4, column: 15 } } }, init: NewExpression { type: 'NewExpression', callee: Identifier { type: 'Identifier', name: 'Mongolass', loc: { start: { line: 4, column: 22 }, end: { line: 4, column: 31 } } }, arguments: [], loc: { start: { line: 4, column: 18 }, end: { line: 4, column: 33 } } }, loc: { start: { line: 4, column: 6 }, end: { line: 4, column: 33 } } } ], kind: 'const', loc: { start: { line: 4, column: 0 }, end: { line: 4, column: 34 } } }, ExpressionStatement { type: 'ExpressionStatement', expression: CallExpression { type: 'CallExpression', callee: StaticMemberExpression { type: 'MemberExpression', computed: false, object: Identifier { type: 'Identifier', name: 'mongolass', loc: { start: { line: 5, column: 0 }, end: { line: 5, column: 9 } } }, property: Identifier { type: 'Identifier', name: 'connect', loc: { start: { line: 5, column: 10 }, end: { line: 5, column: 17 } } }, loc: { start: { line: 5, column: 0 }, end: { line: 5, column: 17 } } }, arguments: [ Literal { type: 'Literal', value: 'mongodb://localhost:27017/test', raw: '\'mongodb://localhost:27017/test\'', loc: { start: { line: 5, column: 18 }, end: { line: 5, column: 50 } } } ], loc: { start: { line: 5, column: 0 }, end: { line: 5, column: 51 } } }, loc: { start: { line: 5, column: 0 }, end: { line: 5, column: 52 } } }, ExpressionStatement { type: 'ExpressionStatement', expression: AssignmentExpression { type: 'AssignmentExpression', operator: '=', left: StaticMemberExpression { type: 'MemberExpression', computed: false, object: Identifier { type: 'Identifier', name: 'exports', loc: { start: { line: 7, column: 0 }, end: { line: 7, column: 7 } } }, property: Identifier { type: 'Identifier', name: 'getUsers', loc: { start: { line: 7, column: 8 }, end: { line: 7, column: 16 } } }, loc: { start: { line: 7, column: 0 }, end: { line: 7, column: 16 } } }, right: FunctionExpression { type: 'FunctionExpression', id: Identifier { type: 'Identifier', name: 'getUsers', loc: { start: { line: 7, column: 29 }, end: { line: 7, column: 37 } } }, params: [], body: BlockStatement { type: 'BlockStatement', body: [ ExpressionStatement { type: 'ExpressionStatement', expression: YieldExpression { type: 'YieldExpression', argument: CallExpression { type: 'CallExpression', callee: StaticMemberExpression { type: 'MemberExpression', computed: false, object: [Object], property: [Object], loc: [Object] }, arguments: [ [Object] ], loc: { start: [Object], end: [Object] } }, delegate: false, loc: { start: { line: 8, column: 2 }, end: { line: 11, column: 4 } } }, loc: { start: { line: 8, column: 2 }, end: { line: 11, column: 5 } } }, VariableDeclaration { type: 'VariableDeclaration', declarations: [ VariableDeclarator { type: 'VariableDeclarator', id: Identifier { type: 'Identifier', name: 'users', loc: [Object] }, init: YieldExpression { type: 'YieldExpression', argument: [Object], delegate: false, loc: [Object] }, loc: { start: [Object], end: [Object] } } ], kind: 'const', loc: { start: { line: 13, column: 2 }, end: { line: 13, column: 54 } } }, ExpressionStatement { type: 'ExpressionStatement', expression: AssignmentExpression { type: 'AssignmentExpression', operator: '=', left: StaticMemberExpression { type: 'MemberExpression', computed: false, object: ThisExpression { type: 'ThisExpression', loc: [Object] }, property: Identifier { type: 'Identifier', name: 'body', loc: [Object] }, loc: { start: [Object], end: [Object] } }, right: Identifier { type: 'Identifier', name: 'users', loc: { start: [Object], end: [Object] } }, loc: { start: { line: 14, column: 2 }, end: { line: 14, column: 19 } } }, loc: { start: { line: 14, column: 2 }, end: { line: 14, column: 20 } } } ], loc: { start: { line: 7, column: 40 }, end: { line: 15, column: 1 } } }, generator: true, expression: false, loc: { start: { line: 7, column: 19 }, end: { line: 15, column: 1 } } }, loc: { start: { line: 7, column: 0 }, end: { line: 15, column: 1 } } }, loc: { start: { line: 7, column: 0 }, end: { line: 15, column: 2 } } } ], sourceType: 'script', loc: { start: { line: 1, column: 0 }, end: { line: 15, column: 2 } } }
相關(guān)源代碼如下:
let parsedCodes;try { parsedCodes = esprima.parse(content, { loc: true });} catch (e) { console.error('cannot parse file: %s', filename); console.error(e.stack); process.exit(1);}
遍歷找到 YieldExpression 節(jié)點(diǎn),進(jìn)行以下包裝后生成 AST 替換掉原來的節(jié)點(diǎn)。
global.logger( this, function*(){ return YieldExpression }, YieldExpressionString, filename);
相關(guān)源代碼如下:
findYieldAndWrapLogger(parsedCodes);try { content = escodegen.generate(parsedCodes, { format: { indent: { style: ' ' } }, sourceMap: filename, sourceMapWithCode: true });} catch (e) { console.error('cannot generate code for file: %s', filename); console.error(e.stack); process.exit(1);}debug('file %s regenerate codes:\n%s', filename, content.code);
findYieldAndWrapLogger 作用就是遍歷 AST 將 YieldExpression 替換成用日志函數(shù)包裹后新的 YieldExpression 的 AST。
最后用 escodegen 將 AST 生成代碼(支持 soucemap,所以錯(cuò)誤棧對應(yīng)的行數(shù)是對的)。
核心:每個(gè)請求到來時(shí),生成(項(xiàng)目中本來就有 requestId 的話,可通過配置指定)一個(gè) requestId(uuid 類型)掛載到 this 上,這樣就可以通過 requestId 將日志串起來了。
特點(diǎn):可以記錄每個(gè)請求的流程(甚至可以查到某個(gè)用戶某個(gè)時(shí)間段的請求),及每一次請求的每一步(step)的現(xiàn)場及返回值,方便調(diào)試或查 bug,當(dāng)然只針對 yield 表達(dá)式。
app.js
'use strict';const koaYieldBreakpoint = require('koa-yield-breakpoint')({ files: ['./routes/*.js']});const koa = require('koa');const routes = require('./routes');const app = koa();app.use(koaYieldBreakpoint);routes(app);app.listen(3000, () => { console.log('listening on 3000');});
routes/index.js
'use strict';const route = require('koa-route');const users = require('./users');module.exports = function (app) { app.use(route.get('/users', users.getUsers));};
routes/users.js
'use strict';const Mongolass = require('mongolass');const mongolass = new Mongolass();mongolass.connect('mongodb://localhost:27017/test');exports.getUsers = function* getUsers() { yield mongolass.model('users').create({ name: 'xx', age: 18 }); const users = yield mongolass.model('users').find(); this.body = users;};
運(yùn)行:
DEBUG=koa-yield-breakpoint node app.js
終端打印如下:
example git:(master) DEBUG=koa-yield-breakpoint node app.jskoa-yield-breakpoint options: {"files":["./routes/*.js"],"nodir":true,"absolute":true,"filter":{"ctx":["state","params"],"request":["method","path","header","query","body"],"response":["status","body"]},"loggerName":"logger","requestIdPath":"requestId"} +0mskoa-yield-breakpoint matched files: ["/Users/nswbmw/node/koa-yield-breakpoint/example/routes/index.js","/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js"] +9mskoa-yield-breakpoint file /Users/nswbmw/node/koa-yield-breakpoint/example/routes/index.js regenerate codes:'use strict';const route = require('koa-route');const users = require('./users');module.exports = function (app) { app.use(route.get('/users', users.getUsers));}; +73mskoa-yield-breakpoint file /Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js regenerate codes:'use strict';const Mongolass = require('mongolass');const mongolass = new Mongolass();mongolass.connect('mongodb://localhost:27017/test');exports.getUsers = function* getUsers() { yield global.logger(this, function* () { return mongolass.model('users').create({ name: 'xx', age: 18 }); }, 'mongolass.model(\'users\').create({\n name: \'xx\',\n age: 18\n})', '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:8:2'); const users = yield global.logger(this, function* () { return mongolass.model('users').find(); }, 'mongolass.model(\'users\').find()', '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:13:16'); this.body = users;}; +15mslistening on 3000
可以看出,routes/users.js 被替換成了:
'use strict';const Mongolass = require('mongolass');const mongolass = new Mongolass();mongolass.connect('mongodb://localhost:27017/test');exports.getUsers = function* getUsers() { yield global.logger(this, function* () { return mongolass.model('users').create({ name: 'xx', age: 18 }); }, 'mongolass.model(\'users\').create({\n name: \'xx\',\n age: 18\n})', '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:8:2'); const users = yield global.logger(this, function* () { return mongolass.model('users').find(); }, 'mongolass.model(\'users\').find()', '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:13:16'); this.body = users;};
為什么要將 yield 表達(dá)式包裹成這樣呢?
global.logger( this, function*(){ return YieldExpression }, YieldExpressionString, filename);
而不是:
global.logger( this, function(){ return YieldExpression }, YieldExpressionString, filename);
或者:
global.logger( this, yield YieldExpression, YieldExpressionString, filename);
因?yàn)榭紤]到 YieldExpression 可能是以下情況導(dǎo)致 esprima 不能解析:
User.find(yield xxx)
User.find(http://this.xxx)
所以綜合考慮只能放到 GeneratorFunction 里,另外,不用:
global.logger( this, function*(){ return yield YieldExpression }, YieldExpressionString, filename );
是因?yàn)檫@樣會導(dǎo)致 esprima 解析死循環(huán),所以去掉了里面的 yield,在 logger 里 yield。
訪問 localhost:3000/users,終端打印出:
{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 404, body: undefined } }, type: 'start', step: 1, take: 0 }{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', step: 2, filename: '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:8:2', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 404, body: undefined } }, type: 'before', fn: 'mongolass.model(\'users\').create({\n name: \'xx\',\n age: 18\n})', result: undefined, take: 1 }{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', step: 3, filename: '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:8:2', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 404, body: undefined } }, type: 'after', fn: 'mongolass.model(\'users\').create({\n name: \'xx\',\n age: 18\n})', result: { result: { ok: 1, n: 1 }, ops: [ [Object] ], insertedCount: 1, insertedIds: [ undefined, 5810856182d0eea2f12030fd ] }, take: 7 }{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', step: 4, filename: '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:13:16', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 404, body: undefined } }, type: 'before', fn: 'mongolass.model(\'users\').find()', result: undefined, take: 1 }{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', step: 5, filename: '/Users/nswbmw/node/koa-yield-breakpoint/example/routes/users.js:13:16', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 404, body: undefined } }, type: 'after', fn: 'mongolass.model(\'users\').find()', result: [ { _id: 5810856182d0eea2f12030fd, name: 'xx', age: 18 } ], take: 4 }{ requestId: 'dad593c0-c4a1-4640-a00e-9ba0349cfd2f', timestamp: Wed Oct 26 2016 18:28:49 GMT+0800 (CST), this: { state: {}, request: { method: 'GET', path: '/users', header: [Object], query: [Object] }, response: { status: 200, body: [Object] } }, type: 'end', step: 6, take: 1 }
注意:type 是 ['start', 'beforeYield', 'afterYield', 'error', 'end'] 其中一種,take 單位是 ms。
start: 請求到來時(shí)第一次打點(diǎn)
beforeYield: 上一個(gè) yieldExpression 之后到這一個(gè) yieldExpression 之前
afterYield: 這個(gè) yieldExpression 開始到結(jié)束
error: 錯(cuò)誤日志,koa-yield-breakpoint 會記錄下來,然后再 throw 出去
end: 請求結(jié)束時(shí)打點(diǎn)
const koaYieldBreakpoint = require('koa-yield-breakpoint')({ files: ['./controllers/*.js', './middlewares/*.js'], store: require('./lib/koa_yield_breakpoint_logstash'), loggerName: '__logger', requestIdPath: 'header.reqid', yieldCondition: (filename, yieldExpression) => { // yield next 不包裹 logger if (yieldExpression === 'next') { return { wrapYield: false, deep: false }; } if (yieldExpression.match(/^db\.sequelize\.transaction/)) { return { wrapYield: true, deep: false }; } }});...app.use(koaYieldBreakpoint);
koa-yield-breakpoint 主要用在 Koa 的 routes、controllers 或 middlewares 文件。co-yield-breakpoint 主要用來給普通 yield 表達(dá)式(如 lib 或 models 下的文件)打點(diǎn)(沒有 requestId,所以 log 是零散的,這里只用來計(jì)算平均耗時(shí)),實(shí)現(xiàn)原理跟 koa-yield-breakpoint 類似,這里不再贅述。
foo.js
'use strict';const co = require('co');const Mongolass = require('mongolass');const mongolass = new Mongolass();mongolass.connect('mongodb://localhost:27017/test');co(function* () { yield mongolass.model('users').create({ name: 'xx', age: 18 }); const users = yield mongolass.model('users').find(); console.log('users: %j', users);}).catch(e => console.error(e.stack));
終端打印如下:
{ "filename": "/Users/nswbmw/node/co-yield-breakpoint/example/foo.js:9:2", "timestamp": "2016-11-10T13:53:33.053Z", "fn": "mongolass.model('users').create({\n name: 'xx',\n age: 18\n})", "result": { "result": { "ok": 1, "n": 1 }, "ops": [{ "name": "xx", "age": 18, "_id": "58247bdd3fb205dbad5418a0" }], "insertedCount": 1, "insertedIds": [null, "58247bdd3fb205dbad5418a0"] }, "take": 55}{ "filename": "/Users/nswbmw/node/co-yield-breakpoint/example/foo.js:14:16", "timestamp": "2016-11-10T13:53:33.060Z", "fn": "mongolass.model('users').find()", "result": [{ "_id": "58247bdd3fb205dbad5418a0", "name": "xx", "age": 18 }], "take": 7}users: [{ "_id": "58247bdd3fb205dbad5418a0", "name": "xx", "age": 18}]
真實(shí)用例:
require('co-yield-breakpoint')({ files: ['./lib/**/*.js', './models/**/*.js'], store: require('./lib/koa_yield_breakpoint_logstash'), loggerName: '__coLogger'});
store 參數(shù)最好自己定義,可以是一個(gè) object 可以是一個(gè)實(shí)例,只需要有一個(gè) save 方法即可。在 save 方法內(nèi)可做一些自定義,比如我在 koa_yield_breakpoint_logstash.js 里添加了一些字段(如:url),同時(shí)對正常請求日志過濾了一些字段,對錯(cuò)誤請求日志添加了一些字段方便 bug 追蹤。如下:
if (ctx.request) { record.request = _.pick(ctx.request, 'headers', 'body', 'query', 'ips');}
線上 API 在上了 koa-yield-breakpoint + co-yield-breakpoint 這個(gè)組合后,我又讓運(yùn)維在 Kibana 作了以下 3 種圖表:
fn 的平均耗時(shí),由高到低(根據(jù) filename 聚合, 因?yàn)?filename 中包含行列數(shù),唯一)
每個(gè)接口的平均耗時(shí),由高到低(根據(jù) url 聚合)
所有 requestId 耗時(shí)排行(對應(yīng)這個(gè) requestId 所有日志 take 值累加),由高到低
看起來效果良好,但后來運(yùn)維告訴我日志打的太多了,想想也是,一個(gè)邏輯復(fù)雜的 api 的請求從開始到結(jié)束,可能執(zhí)行了幾十次 yield。。討論了下,決定只采樣 10% 日志。如下:
exports.save = function save(record, ctx) { ... if (Math.random() > 0.1 && !record.error) {// co-yield-breakpoint 的 record.error 總為 undefined return; } ...};
注意:這里 koa-yield-breakpoint 和 co-yield-breakpoint 的日志都隨機(jī)取樣了 10%, co-yield-breakpoint 的日志倒無所謂,但 koa-yield-breakpoint 的日志則被打散了,也就是說你基本上不可能通過一個(gè) requestId 查到完整的請求日志鏈了(step 是連續(xù)的)。另一種方案是在 koa-yield-breakpoint 中間件之前給 this 添加一個(gè)屬性:
function *(next) { this.shouldSaveLog = Math.random() < 0.1; yield next;};
然后再 save 方法內(nèi)判斷一下 shouldSaveLog 決定是否記錄這條日志,上面代碼改為:
exports.save = function save(record, ctx) { ... if (!ctx.shouldSaveLog && !record.error) { return; } ...};
這樣也實(shí)現(xiàn)了只記錄 10% 的請求日志,但這種方案是:這個(gè)請求的所有日志要么都記,要么都不記(錯(cuò)誤日志除外,錯(cuò)誤日志一定記)。
我們正在招聘!
[北京/武漢] 石墨文檔 做最美產(chǎn)品 - 尋找中國最有才華的工程師加入