Just relax, take it easy..

0%

When switching from Android to iOS, I was unable to find a light-weighted but handy habit tracking app, so I decided to make one by myself :)

For the name of the project, it came from a game called "Against the Storm" (which I spent over 100 hours, highly recommended). In the game, my favorite city builder species is beaver, hoping this web app works as a beaver to save ur precious moments in your fleeting life.

Recently, we discovered some unexpected autoscaling EC2_INSTANCE_TERMINATE events in our Scala system: "instance was taken out of service in response to an ELB system health check failure".

After checking the error logs, the issue was caused by Too many open files, which leaded to unsuccessful DNS resolution, consequently resulting in errors when accessing the AWS endpoint, finally causing the server to hang.

System will never go down without "changes", e.g. code release, traffic overloaded or external dependency down, ...

In all kinds of changes above, human changes are responsible for over 80% of incidents, as humans are not machines and make mistakes all the time :)

So when planning a change request for production operation (without a perfect and automated pipeline), how can we leverage strategies to minimize the risk and impact on our customers?

家庭网络质量,如同水电燃气一般,对生活幸福感至关重要。突发奇想为家中的网络中枢 HTTP proxy 编写 Prometheus exporter 并配置可观测大盘。

趁此机会学习 如何编写一个自定义的 exporter,以及 PromQL 中 rate/irate 函数的实现原理。

记得大学学习编程时,即使书本提供了完整解法,个人也习惯在电脑中抄写一遍,抚摸每一行代码,以获得最真实透彻的理解。

机缘巧合加入上海 Autodesk,有幸在外企中真真切切体验了三个月,分享一些个人真实的感受。希望不管是对 SRE 还是职业生涯工作选择,都可以带来一点点参考与帮助 :)

熟悉我的朋友可能知道,博主最近机缘巧合换了一份工作(从互联网的大牙变成了外企的 Henry :))

正巧碰上新公司圣诞一周假期(Recharge Days),趁此机会一起回顾如海浪般平静却充满涌动的 2023

Recently researching an internal tool to support resiliency testing, e.g. performing network delay on dependent redis/db/.., helps validate the service’s ability to handle and recover from unexpected network disruptions or delays in accessing external dependencies.

TCP layer network traffic can easily be classified and shaped using a Linux built-in tool called TC (traffic control).

But how can we hijack and manipulate encrypted outbound HTTPS traffic?

在 9 月购入了 iPhone Pro 钛金属版本后,我迫不及待地进行了拆机,并且通过 iCloud 进行了熟练的备份还原。然而,当看到主屏幕界面和布局高度还原后,一股索然无味涌上心头。恍惚间,好像根本没有更换过新手机一样!于是。。最终决定退掉它。

不知不觉间,我手中竟然同时拥有 10 多个苹果设备,这还不包括家人。。苹果公司借助其强大的生态系统,成功地将用户牢牢地捆绑在其中。例如,手机/平板可以作为电视遥控器,HomeKit 可以在家里无人时触发自动化场景,手表可以自动解锁电脑等等。尽管这些细节确实非常方便,但我越来越不喜欢这种被绑架束缚的感觉了。