0.0
No commit activity in last 3 years
No release in over 3 years
SamplingProf is a profiling tool that operates by sampling your running thread stacktrace. The result is statistical approximation, but it allows your code to run near full speed
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

>= 0.9.2, ~> 0.9
 Project Readme

SamplingProf

SamplingProf is a statistical profiler or sampling profiler that operates by sampling your running thread stacktrace. The result is statistical approximation, but it allows your code to run near full speed. It is optimized for JRuby.

Supports JRuby both 1.8 and 1.9 mode, and CRuby 1.9+.

Quick start

For single thread profiling, start with default options:

prof = SamplingProf.new
prof.profile do
  ... your code ...
end
at_exit { prof.terminate }

After profiling finished, the output will be write to file SamplingProf::DEFAULT_OUTPUT_FILE

Options

SamplingProf class initializer takes 1 argument:

  1. sampling interval: seconds

SamplingProf class also takes block as another option to overwrite default output handler, the default output handler will write output data to a local file defined by output_file attribute, which is default to SamplingProf::DEFAULT_OUTPUT_FILE

Notice, for the performance and thread-safe concerns, the output handler will not be called in the context of the thread start profiling.

When a SamplingProf is initialized, a thread will be started to handle sampling process. You need call SamplingProf#terminate to shutdown the sampling thread after everything is done.

Sampling interval

This is an interval to control how frequent SamplingProf should take sample of target thread stacktrace. The default value is 0.1 seconds, and is designed for general Rails web request profiling. Adjust this parameter for your case, so that sampling process does lowest overhead for your program.

Output data format

Output data is plain text, so that you can see the result and do analysis by yourself. Checkout sampling_prof.rb report method for how to generate report from output data file.

Output data is divided into a number of chunks. Chunks are separated by 2 "\n" chars.

List of chunks in order:

  1. runtime
  2. call element id map
  3. counts
  4. call graph

runtime chunk

A number represents time of collecting data. It should be only one line. The time unit is ms. Runtime can be used to compute an estimated runtime of a call element.

When it's multithreading mode, the runtime result is a little bit tricky. But think about we queuing up all threads, and put them into single thread, the time doing sampling in this one single thread is the runtime we output here.

call element id map

For storage efficiency, we build this map for counts and call graph chunks to use id instead of a call element string. One line one call element string and its id, separated by a comma.

call element format

A call element represents a line of code while program running. It has 3 components:

  1. file path
  2. line number
  3. method name

A call element string is a join of 3 components with char ":", for example: ./lib/sampling_prof.rb:5:initialize

counts

SamplingProf collects count of each call element at runtime. There are 2 type of count:

  1. self count
  2. total count

Same with call element id map, we use comma to separate data, and one line represents one call element data. Here we use call element id instead of call element string to reference the call element. The line looks like this: [call element id],[self count],[total count]

call graph

SamplingProf collects counts of function calls. One function call is A call element calls B call element, and B call element calls A call element is considered as another function call.

The call graph stores the counts of function calls. Every line is one function call data. the line looks like this: [call element id1],[call element id2],[count]

Direct recursive calls are recorded as: [call element id1],[call element id1],[count]

Indirect recursive calls are ignored as function call only have direct call info. Hence the data maybe odd when there are indirect recursive calls.