lukmdo
diff --git a/‎Chapter_05.tex
Lines changed: 85 additions & 13 deletions b/‎Chapter_05.tex
Lines changed: 85 additions & 13 deletions
diff --git a/‎charts/benchmarks/cache3.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/cache3.tsv
Lines changed: 14 additions & 0 deletions
diff --git a/‎charts/benchmarks/cache4.ps
19.8 KB b/‎charts/benchmarks/cache4.ps
19.8 KB
diff --git a/‎charts/benchmarks/cache4.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/cache4.tsv
Lines changed: 14 additions & 0 deletions
diff --git a/‎charts/benchmarks/final4.ps
18.9 KB b/‎charts/benchmarks/final4.ps
18.9 KB
diff --git a/‎charts/benchmarks/final4.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/final4.tsv
Lines changed: 14 additions & 0 deletions
diff --git a/‎charts/benchmarks/normal3.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/normal3.tsv
Lines changed: 14 additions & 0 deletions
diff --git a/‎charts/benchmarks/normal4.ps
19.7 KB b/‎charts/benchmarks/normal4.ps
19.7 KB
diff --git a/‎charts/benchmarks/normal4.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/normal4.tsv
Lines changed: 14 additions & 0 deletions
diff --git a/‎charts/benchmarks/static3.tsv
Lines changed: 14 additions & 0 deletions b/‎charts/benchmarks/static3.tsv
Lines changed: 14 additions & 0 deletions
@@ -29,25 +29,97 @@ \section{Presentation of the final result}\label{sec:result}
 \label{fig:ismartnotes_window}
 \end{center}
 \end{figure}
- 
+% gae_best_practises_plus_load_tests
 \section{Performance tests}\label{sec:performance}
-This subsection presents a graphical illustration of some chosen performance metrics of SmartNotes application running on Google App Engine. Taking into account the concurrency factor of incoming requests and the rate in which the  connection rate becomes increased the factors that will be analysed include:
+Performance testing form does strongly depend on the research scope. Testing the application performance is only a general name for the process that should become narrowed to a single parameter or set of well defined properties. The usage of wright tool does also matter just as making appropriate assumptions and choosing wright model used as otherwise measured values could not match the real application characteristics.   
+
+The analyses done in this chapter are fallowing the guidelines as defined in~\cite{gae_best_practises_plus_load_tests} for testing applications that use Google App Engine:
 \begin{itemize}
-	\item{Minimum reply rate - }
-	\item{Average reply rate - }
-	\item{Maximum reply rate -}
-	\item{X}
+	\item{Use production system. Most of web frameworks provide a development including sever that can be easily run on the local machine. What speedups the development at one side cant be used for production purposes. This environment divers significantly from the deployment system and cant be to give much information of final system performance.}
+	\item{Gradual ramp up. The goal of load tests is to probe the system reaction for a certain input parameter sets. Therefore its strongly desirable to make it remind the realistic cases. Besides the resources of Google App Engine are granted only when the application needs them. That motivates a wrap up just before the wright test takes place.}
+	\item{Realistic load. Is seams pointless to run test for a situation that the system is unlikely to reach.}
 \end{itemize}
-The tests were performed for main SmartNotes web page by using tools like \texttt{httperf} and \texttt{autobench} and collected data allowed to make graphs shown in Figure
+The fallowing part of this subsection presents a graphical illustration for selected performance metrics of SmartNotes application running on Google App Engine. Taking into account the parameters such as the rate of opened connections and reached request rate the analyses in the first step will focus on the variations of the response rate and will be finalised with a comparison of the average response time between different techniques. This includes a internalized dynamic content, cached page and same page served as static content. The first is a most classical case when some parts of page depending on state may differ, second one makes use of highly popular caching technique and lastly the special infrastructure devoted only for serving static content. All of these will become wider dissed by analysing corresponding results. 
+
+The view on dashboard of SmartNotes application shown in Figure~\ref{fig:sn_dash_view} exposes a plot of request rate timeline for two different scenarios. The first  one marked as~\textit{(a)} illustrates a regular usage case with none anomalies wheres the second figure~\textit{(b)} presents a case when outgoing bandwidth became exceeded. This is most basic way of tracking the application performance and status. Providing couple of other tools which were discussed in Section~\ref{sec:gae_general} admin interface is truly functional yet easy to use. Providing a logs browser it leaves a place where user can place his application specific informations on several logging levels.     
 \begin{figure}[ht]
   \begin{center}
-    \subfigure[\textbf{X}.]{\label{fig:sm_main}\includegraphics[scale=0.5]{img/SNmain_page.png}}
-    \subfigure[\textbf{Y}.]{\label{fig:sm_signin}\includegraphics[scale=0.24]{img/SN_signin.png}}
-    \subfigure[\textbf{Z}.]{\label{fig:sm_getSNkey}\includegraphics[scale=0.24]{img/SNget_activation_key.png}}
+    \subfigure[\textbf{Regular situation with average of about one\newline request per second}.]{\label{fig:dash_normal}\includegraphics[scale=0.18]{img/DASH_Regular_stats.png}}
+    \subfigure[\textbf{Application reaching the outgoing bandwidth limit on heavy traffic situation with the maximum of 170 requests per second}.]{\label{fig:dash_out_of_bandwith}\includegraphics[scale=0.2]{img/DASH_Out_of_bandwith.png}}
   \end{center}
-  \caption{SmartNotes performence statistics.}
-  \label{fig:sn_web_interface}
+  \caption{The view on the public web-based SmartNotes interface with basic information regarding the project and authentication.}
+  \label{fig:sn_dash_view}
+\end{figure}
+The system of quotas of introduced by Google provides to phase limitations the daily limits and narrowed most common to minute quotas.The second one is a protection for load high peaks or any malicious or testing software that without it could let to letting the target application out of resource in just a couple of minutes. Thats exactly the case uncounted during the tests by receiving responses with \texttt{503 Service Unavailable} status codes when crossing the short tame quota and \texttt{403 Forbidden} responses after running over the daily limits.
+
+The author has chosen Open Source tools like \texttt{httperf} from HP and \texttt{autobench} wrapper to carry out the tests for the main SmartNotes web page. This tools were chosen beyond others researched\footnote{That list includes tools like \texttt{siege} and \texttt{ab} which are popular Open source load testing programs.} due support for a gradual increase of connection rate and extended output. Each of the tests which results are presented in Figures~\ref{fig:sm_benchmark_normal}, \ref{fig:sm_benchmark_cache}, \ref{fig:sm_benchmark_static} and \ref{fig:sm_resp_time_comapre} uses the same set of input parameters:
+\begin{itemize}
+	\item{500 connections per test. This connected with twenty second connection timeout allowed to reach up to 400 concurrent connections.  }
+	\item{3 requests per connection. This implies a total number of 1500 requests per test. This parameter was chosen with respect to the short term quota limit of $7, 400$ requests per minute.}
+	\item{Generated connection rate stared at 25 connections per second and by each test was gradually increased by factor of 10 up to reaching the rate of 145 connections per second. That allowed to make 12 tests for in each of test series.}
+	\item{Each of tests is identified using multiplication of generated connection rate and number of request per second as the horizontal axis labels. That values correspond to the theoretical request rate or the client-side request rate.}
+\end{itemize}
+
+First of techniques that was tested was a classical dynamic page which was using the system resources to determine the language and present internationalized content. For grater flexibility and minimising the repetitions of common parts author decided to use template inheritance. This in basic let to use template blocks in a way remaining the object oriented  programming. However both of them cost the CPU time and in a case of high rated page seam not to be the most suitable solution. The most common approaches to this problem are presented by next two techniques: using cache or serving as static content. The most interesting part is to observe the differences in serving the same content using each of those techniques. One of parameters that has a great impact on the  rest is the server-side connection rate that in Figures~\ref{fig:sm_benchmark_normal}, \ref{fig:sm_benchmark_cache} and \ref{fig:sm_benchmark_static} was presented using green lines. I case of dynamic page the average connection rate become saturated around 45 connection per second what gives the request rate of 135 requests per second. 
+\begin{figure}[ht]
+  \begin{center}
+	\includegraphics[scale=0.4]{charts/benchmarks/normal4.pdf}
+  \end{center}
+  \caption{Response rate statistics for dynamic page served using the Django framework.}
+	\label{fig:sm_benchmark_normal}
 \end{figure}
+This number is close to the mentioned before short time limit of $7, 400$ requests per minute. The test were done several times that way to as close as possible to the this quota limits watching to receive replies with the \texttt{200 OK} response code. Additionally it is also interesting the observe the distribution of the minimum, average and maximum response rate with increasing value of average request rate. In ideal case all of this four factors would fallow identical linear curve.     
 
+The second technique is cashing. It is a solution bundled into many web frameworks including Django which as noted  in Section~\ref{sec:gae_general} is supported by GAE. Concept of this technique is based on storing data in some fast accessible space called cache and with the help of it repeating calls can shorting the request path. A simple usage of cashing system could look as presented in Listing~\ref{code:py_cache}. The function \texttt{get\_from\_cache} returns the data calculated by function called \texttt{some\_time\_consuming\_calculations} that next becomes located into control of \texttt{memcache}\footnote{Memcached is a distributed, memory based cashing system. This only one of possible caching backends such like file-cache or database based storage engines however it is mostly used for its speed and support for multi-machine work mode.} for a time defined by the \texttt{expire\_time} variable. In case a subsequent calls coming before one hour from the first calculation of \texttt{data} they wont call the \texttt{some\_time\_consuming\_calculations} function but return the value stored by the \texttt{key} name.        
+\lstset{language=Python,caption=Simple cache usage example in Python.,label=code:py_cache,
+basicstyle=\scriptsize,         % the size of the fonts that are used for the code
+showspaces=false,               % show spaces adding particular underscores
+showstringspaces=false,         % underline spaces within strings
+showtabs=false,                 % show tabs within strings adding particular underscores
+tabsize=2,                    % sets default tabsize to 2 spaces
+captionpos=b,                   % sets the caption-position to bottom
+breaklines=true,                % sets automatic line breaking
+breakatwhitespace=false,        % sets if automatic breaks should only happen at whitespace
+escapeinside={\%*}{*)}          % if you want to add a comment within your code
+}
+\lstinputlisting{src/samples/py_cache.py}
+When the \texttt{some\_time\_consuming\_calculations} is truly resource consuming this technique can bring efficient savings. In case test results for cached version of page the plot from Figure~\ref{fig:sm_benchmark_cache} does only slightly differ from the curves of the dynamic version from Figure~\ref{fig:sm_benchmark_normal}. Used HTTP headers shown in Listing~\ref{code:resp_headers} can help to avoid subsequent calls from a single user by using his browser-cache mechanisms.The test client was ignoring them as it was supposed to test the worst case scenario when all of the connections being opened come from different users that use browsers ignoring the cache specific headers including \texttt{Expires}, \texttt{Vary}, \texttt{Last\-Modified}, \texttt{ETag} and \texttt{Cache\-Control}. 
+\lstset{language=HTML,caption=Server response headers for cached content.,label=code:resp_headers,
+basicstyle=\scriptsize,         % the size of the fonts that are used for the code
+showspaces=false,               % show spaces adding particular underscores
+showstringspaces=false,         % underline spaces within strings
+showtabs=false,                 % show tabs within strings adding particular underscores
+tabsize=2,                    % sets default tabsize to 2 spaces
+captionpos=b,                   % sets the caption-position to bottom
+breaklines=true,                % sets automatic line breaking
+breakatwhitespace=false,        % sets if automatic breaks should only happen at whitespace
+escapeinside={\%*}{*)}          % if you want to add a comment within your code
+}
+\lstinputlisting{src/samples/headers_cache.txt}
 
-   
+On the other hand it does not mean that that using caching did not bring anything. Because the requests to cache are normally much cheaper from the requests that involve the server-side operations the binding quota limits are lower and allow for $8,640,000$ API calls a day. Usage of \texttt{memcached} did help to reduce the CPU usage what is a big advantage of this technique. However it should be taken into account that usage of memory caching will consume additional memory, make the system little more complex and by itself isn't a fault tolerant storage. Besides one of the biggest problems regarding cache is expiring it content. In case some page component changed on some of pages the easiest way is to flush all the cache content. In case of application that profile has similar number of read as write operations cache will be much harder to implement. This solution suits best applications of high read rate.      
+\begin{figure}[ht]
+  \begin{center}
+	\includegraphics[scale=0.4]{charts/benchmarks/cache4.pdf}
+  \end{center}
+  \caption{Response rate statistics for cached page. Realised using the Google memcache API.}
+  \label{fig:sm_benchmark_cache}
+\end{figure}
+
+The last tested approach was to serve the entire page as a static content. It should be noted that the infrastructure used for this purpose differs strongly from the application server. It takes the advantage of storing stateless content and minimum server overhead. The implementation details belong to the Google company however the they exist Open Source projects like \texttt{lighttpd} or \texttt{nginx} which share the same idea. Those differences can be easy observed in Figure~\ref{fig:sm_benchmark_static}. Until reaching the level of 235 requests per second the response rate curve was fallowing the server-side request rate curve with low value variation not crossing the factor of $5 \%$ of average value. In this case the CPU usage rate was even lower than using caching and did not require any additional memory usage like in case of \texttt{memcached}. It should be requires much more work to be integrated with the application replacing it prior dynamic content. That task might be sometimes even impossible to be done result in bed final experience when done not wright. Thus it definitely is interesting option for applications that are stateless or use lots of different media files.             
+ \begin{figure}[ht]
+  \begin{center}
+	\includegraphics[scale=0.4]{charts/benchmarks/static4.pdf}
+  \end{center}
+  \caption{Response rate statistics for static page with use of static content server.}
+\label{fig:sm_benchmark_static}
+\end{figure}
+
+An interesting comparison presented in Figure~\ref{fig:sm_resp_time_comapre} which collates various techniques used together by focusing on the average response time. It is one of the parameters that has a huge inpact on the user experience    
+\begin{figure}[ht]
+  \begin{center}
+	\includegraphics[scale=0.4]{charts/benchmarks/final4.pdf}
+  \end{center}
+  \caption{Comparison of average response times among various cases including serving dynamic content, caching response in memory or making use of static content server.}
+	\label{fig:sm_resp_time_comapre}
+\end{figure}  
@@ -0,0 +1,14 @@
+dem_req_rate	req_rate_2.latest.smart-notes.appspot.com	con_rate_2.latest.smart-notes.appspot.com	min_rep_rate_2.latest.smart-notes.appspot.com	avg_rep_rate_2.latest.smart-notes.appspot.com	max_rep_rate_2.latest.smart-notes.appspot.com	resp_time_2.latest.smart-notes.appspot.com
+75	68.2	22.7	35.2	68.2	82.6	896.1
+105	77.1	25.7	72.2	88.6	101.0	895.9
+135	94.7	31.6	77.8	98.0	138.2	945.9
+165	104.0	34.7	84.6	121.2	157.8	1177.4
+195	93.7	31.2	50.2	98.2	151.2	1506.3
+225	142.6	47.5	135.2	148.1	161.0	892.0
+255	109.8	36.6	141.0	143.1	145.2	996.2
+285	90.5	30.2	35.6	98.5	158.8	1579.0
+315	132.0	44.0	141.0	144.1	147.2	1418.9
+345	70.4	23.5	0.6	69.4	134.0	1295.1
+375	134.0	44.7	122.4	147.5	172.6	1195.3
+405	126.8	42.3	122.3	146.7	171.0	1289.3
+435	133.9	44.6	126.4	148.3	170.2	1339.2
@@ -0,0 +1,14 @@
+dem_req_rate average_request_rate_[req\\s] average_connection_rate_[con\\s] minimum_response_rate_[resp\\s] average_response_rate_[resp\\s] maximum_response_rate_[resp\\s]
+75	68.2	22.7	35.2	68.2	82.6
+105	77.1	25.7	72.2	88.6	101.0
+135	94.7	31.6	77.8	98.0	138.2
+165	104.0	34.7	84.6	121.2	157.8
+195	93.7	31.2	50.2	98.2	151.2
+225	142.6	47.5	135.2	148.1	161.0
+255	109.8	36.6	141.0	143.1	145.2
+285	90.5	30.2	35.6	98.5	158.8
+315	132.0	44.0	141.0	144.1	147.2
+345	70.4	23.5	0.6	69.4	134.0
+375	134.0	44.7	122.4	147.5	172.6
+405	126.8	42.3	122.3	146.7	171.0
+435	133.9	44.6	126.4	148.3	170.2
@@ -0,0 +1,14 @@
+dem_req_rate dynamic_page_average_response_time_[ms] cached_page_average_response_time_[ms] static_page_average_response_time_[ms]
+75	850.5 896.1 193.9
+105	707.0 895.9 209.5
+135	644.4 945.9 197.2
+165	637.6 1177.4 192.7
+195	1321.8 1506.3 222.7
+225	787.0 892.0 187.8
+255	859.6 996.2 350.2
+285	1065.7 1579.0 226.7
+315	1570.8 1418.9 366.8
+345	1108.8 1295.1 313.9
+375	1298.7 1195.3 183.5
+405	1446.5 1289.3 227.5
+435	1576.8 1339.2 529.6
@@ -0,0 +1,14 @@
+dem_req_rate	req_rate_smart-notes.appspot.com	con_rate_smart-notes.appspot.com	min_rep_rate_smart-notes.appspot.com	avg_rep_rate_smart-notes.appspot.com	max_rep_rate_smart-notes.appspot.com	resp_time_smart-notes.appspot.com
+75	67.4	22.5	29.4	69.6	91.8	850.5
+105	91.6	30.5	70.2	94.1	108.8	707.0
+135	107.2	35.7	90.7	110.3	129.9	644.4
+165	129.8	43.3	113.5	133.3	153.1	637.6
+195	75.3	25.1	37.8	96.3	128.8	1321.8
+225	138.9	46.3	133.8	147.9	162.0	787.0
+255	121.3	40.4	141.4	146.4	151.4	859.6
+285	127.4	42.5	125.8	145.4	165.0	1065.7
+315	97.5	32.5	26.7	99.2	145.4	1570.8
+345	131.3	43.8	139.8	145.6	151.4	1108.8
+375	117.4	39.1	111.8	140.7	169.6	1298.7
+405	136.2	45.4	142.6	147.2	151.8	1446.5
+435	128.5	42.8	138.8	146.5	154.2	1576.8
@@ -0,0 +1,14 @@
+dem_req_rate average_request_rate_[req\\s] average_connection_rate_[con\\s] minimum_response_rate_[resp\\s] average_response_rate_[resp\\s] maximum_response_rate_[resp\\s]
+75	67.4	22.5	29.4	69.6	91.8	850.5
+105	91.6	30.5	70.2	94.1	108.8	707.0
+135	107.2	35.7	90.7	110.3	129.9	644.4
+165	129.8	43.3	113.5	133.3	153.1	637.6
+195	75.3	25.1	37.8	96.3	128.8	1321.8
+225	138.9	46.3	133.8	147.9	162.0	787.0
+255	121.3	40.4	141.4	146.4	151.4	859.6
+285	127.4	42.5	125.8	145.4	165.0	1065.7
+315	97.5	32.5	26.7	99.2	145.4	1570.8
+345	131.3	43.8	139.8	145.6	151.4	1108.8
+375	117.4	39.1	111.8	140.7	169.6	1298.7
+405	136.2	45.4	142.6	147.2	151.8	1446.5
+435	128.5	42.8	138.8	146.5	154.2	1576.8
@@ -0,0 +1,14 @@
+dem_req_rate	req_rate_2.latest.smart-notes.appspot.com	con_rate_2.latest.smart-notes.appspot.com	min_rep_rate_2.latest.smart-notes.appspot.com	avg_rep_rate_2.latest.smart-notes.appspot.com	max_rep_rate_2.latest.smart-notes.appspot.com	resp_time_2.latest.smart-notes.appspot.com
+75	72.6	24.2	67.4	72.1	82.4	193.9
+105	100.5	33.5	92.8	94.4	96.0	209.5
+135	116.8	38.9	122.4	128.6	134.8	197.2
+165	154.4	51.5	145.8	145.8	145.8	192.7
+195	179.1	59.7	170.4	170.4	170.4	222.7
+225	207.2	69.1	202.6	202.6	202.6	187.8
+255	145.8	48.6	97.1	148.4	199.6	350.2
+285	129.5	43.2	48.2	149.7	251.2	226.7
+315	129.4	43.1	55.2	147.6	240.0	366.8
+345	166.6	55.5	291.4	291.4	291.4	313.9
+375	295.3	98.4	299.8	299.8	299.8	183.5
+405	143.6	47.9	11.1	148.3	285.6	227.5
+435	164.7	54.9	267.8	267.8	267.8	529.6