A Flock Of Tasty Sources On How To Start Learning High Scalability

This is a guest repost by Leandro Moreira.

When we usually are interested about scalability we look for links, explanations, books, and references. This mini article links to the references I think might help you in this journey.

DISCLAIMER:

You don’t need to have N machines to build/test a cluster/high scalable system, currently you can useVagrant and up N machines easily.

THE REFERENCES:

Now that you know you can empower yourself with virtual servers, I challenge you to not only read these links but put them into practice.

First of all, motivate yourself by watching this tutorial using nodejs + nginx + applying static caching + load balancing + testing, all this in 7 minutes.
Add these words and their meaning to your vocabulary: scalability, failover, single point of failure (SPOF), sharding, replication and load balancing; even if you don’t understand them completely.
In order to have a general overview and the reasons/whys about scalable systems, I strongly recommend you to read Scalable Web Architecture and Distributed Systems. This is a great introduction.
After you get the general idea you can move on to understand how to use a load balancerand what decisions and problems you will face. And then you can try to run a haproxy and make it not a single point of failure too.
Dare yourself to serve 3 million requests per second but for this task you’ll need togenerate 3 million requests, fine tune your web server and finally scale and test it.
Your application is already scalable, now you need to scale your databases. They are very important part of your application, here I recommend you to read at least how MongoDB scales with sharding and replication and Cassandra with its almost linear scalability and the ease of adding nodes to the cluster.
Since your application and database are scalable and fault tolerant, it’s good to save your servers unnecessary workload and also make the responses to the user faster. Learn that a good request is the one that never reached the “real server”.
Let’s assume we’re deploying the whole infrastructure within a single data center, now we have another SPOF. Since all servers are in the same space, some natural disaster might happen or even the simple power outages. Good news is that Cassandra have support to multiple data center out of the box and you can see how google face this issue. If your user is on Brazil, don’t make him travel longer than he needs and remember even with the best situation we still have latency.

Good questions to test your knowledge:

Why to scale? how people do that usually?
How to deal with user session on memory RAM with N servers? how LB know which server is up? how LB knows which server to send the request?
Isn’t LB another SPOF? how can we provide a failover for LB?
Isn’t my OS limited by 64K ports? is linux capable of doing that out of the box?
How does mongo solves failover and high scalability? how about cassandra? how cassandra does sharding when a new node come to the cluster?
What is cache lock? What caching policies should I use?
How can a single domain have multiple IP addresses (ex: $ host www.google.com)? What is BGP? How can we use DNS or BGP to serve geographically users?

Bonus round: sometimes simple things can achieve your goals of making even an AB test.

Please let me know any mistake, I’ll be happy to fix it.

参考：

http://highscalability.com/blog/2014/11/24/a-flock-of-tasty-sources-on-how-to-start-learning-high-scala.html

时间： 2024-12-28 17:34:11

A Flock Of Tasty Sources On How To Start Learning High Scalability的相关文章

buffer overflow vulnerabilitie

Computer Systems A Programmer's Perspective Second Edition Avoiding security holes.For many years,buffer overflow vulnerabilitieshave accounted for the majority of security holes in network and Internet servers. These vulnerabilities exist because to

51 Free Data Science Books

51 Free Data Science Books A great collection of free data science books covering a wide range of topics from Data Science, Business Analytics, Data Mining and Big Data to Machine Learning, Algorithms and Data Science Tools. Data Science Overviews An

CHAPTER 1 ----- a tour of computer sysytems(2)

1.3 It pays to understand how compilation systems work Why programmers need to understand how compilation systems work? 1)Optimizing program performance.We do need a basic understanding of machine-level code and how the compiler translation different

Flume 入门--几种不同的Sources

1.flume概念 flume是分布式的,可靠的,高可用的,用于对不同来源的大量的日志数据进行有效收集.聚集和移动,并以集中式的数据存储的系统. flume目前是apache的一个顶级项目. flume需要java运行环境,要求java1.6以上,推荐java1.7. 将下载好的flume安装包解压到指定目录即可. 2.flume中的重要模型 2.1.1.flume Event: flume 事件,被定义为一个具有有效荷载的字节数据流和可选的字符串属性集. 2.1.2.flume Agent:

脚本加密http://www.datsi.fi.upm.es/~frosal/sources/

shc的官网下载地址: http://www.datsi.fi.upm.es/~frosal/sources/ 安装: 复制代码代码如下: tar xzvf shc-3.8.6.tgz cd shc-3.8.6 mkdir -p /usr/local/man/man1 这步是必须的,不然安装过程中会报错,shc将安装命令到/usr/local/bin/目录下: 将帮助文档存放在/usr/local/man/man1/目录下,如果系统中无此目录,安装时会报错,可创建此目录后再执行安装. 安装:

解决 U盘安装Windows Server 2012 R2 报错 Windows 无法打开所需的文件 Sources\install.wim

报错原因: 使用UltraISO等软件刻录镜像时默认使用FAT32文件系统,该系统不支持大于4G的文件, 而Server 2012 R2的安装文件install.wim为5.12G,固安装失败. 解决方法: 按照以前的方法刻录镜像到U盘: 更改U盘文件系统: 进入命令行模式,输入 convert f: /fs:NTFS (F盘为我的U盘所在盘符) 然后打开镜像文件,找到Sources目录下的install.wim文件,复制到对应的U盘目录下. 问题解决!

Chrome开发者工具详解(1)：Elements、Console、Sources面板

Chrome开发者工具面板面板上包含了Elements面板.Console面板.Sources面板.Network面板. Timeline面板.Profiles面板.Application面板.Security面板.Audits面板这些功能面板. 这些按钮的功能点如下: Elements:查找网页源代码HTML中的任一元素,手动修改任一元素的属性和样式且能实时在浏览器里面得到反馈. Console:记录开发者开发过程中的日志信息,且可以作为与JS进行交互的命令行Shell. Sources:断

[Cycle.js] Generalizing run() function for more types of sources

Our application was able to produce write effects, through sinks, and was able to receive read effects, through the DOM sources. However, the main function only gets the DOMSource as input. This lessons shows how we can generalize main to receive an

更换Ubuntu的sources源

163对Ubuntu的sources进行了镜像,对于amd-64位系统,其它版本应该也可以使用. vim /etc/apt/sources.list insert 将其它的用#注释起来,将如下的添加进去 #add by neruda 2015-12-13, added 163 sourcedeb http://mirrors.163.com/ubuntu/ precise main restricted universe multiversedeb http://mirrors.163