Java IoT Authors: Liz McMillan, Elizabeth White, Yeshim Deniz, Zakia Bouachraoui, Pat Romanski

Related Topics: Java IoT

Java IoT: Article

Know Your Worst Friend, the Garbage Collector

It can make or break performance

The first one is called incremental garbage collection. As its name suggests, this mode does a bit of major collecting every time a minor collection is run. Global performance suffers from this execution mode but you get rid of those nasty pauses caused by major collections. Incremental garbage collection also tends to leave the heap in a fragmented state. So it's a good idea to limit its use to applications with long-lived objects. It can help you in a subtle way to obtain more responsive Swing UIs.

The second mode is a parallel GC using many threads at once, at least one per processor, to do minor collections. If your target computer has less than three processors, it is unlikely you will ever notice a difference. This option is particularly well suited to heavy servers running applications spawning many short-lived objects like web servers.

Finally yet importantly, a concurrent GC can do incremental major collection without interrupting the application for long periods of time. A concurrent GC can also use parallel execution for minor collections. According to Sun Microsystems, this execution mode gets good results with interactive applications running on a single-processor computer. To choose among these three modes and set them up, you need to use the following command-line flags:

  • -Xincgc activates incremental garbage collection.
  • -XX:+UseParallelGC activates the parallel GC. The number of threads used to do minor collections is defined by the -XX:ParallelGCThreads=<a number> flag.
  • -XX:+UseConcMarkSweepGC activates the concurrent GC. Parallel minor collections can be activated as well by using the -XX:+UseParNewGC flag.
To measure the impact of your choice on your application, you'll want to use the –verbose:gc command-line flag. You'll then get a GC activity trace on the standard output. Redirecting the standard output to a file and running the following Python script will let you draw very useful charts as shown in Figure 2.

This script can be used by specifying a file name as the first parameter or by piping it directly to the Java command.

#!/usr/bin/env python
# -*- coding: ISO-8859-1 -*-

import fileinput
import re

print "%s\t%s\t%s\t%s" % ("Minor", "Major", "Alive", "Freed")

for line in fileinput.input():
match = re.match("\[(Full )?GC (\d+)K->(\d+)K\((\d+)K\), ([^ ]+) secs]", line)
if match is not None:
minor = match.group(1) == "Full " and "0.0" or match.group(5)
major = match.group(1) == "Full " and match.group(5) or "0.0"
alive = match.group(3)
freed = int(match.group(2)) - int(alive)
print "%s\t%s\t%s\t%s" % (minor, major, alive, freed)

The result is a CSV document using tabs as separators and containing four columns of data. The first one gives the time consumed by the GC to do a minor collection, the second one gives the same information for a major collection, the third one indicates the amount of memory used by the application after the execution of the GC and the last one gives the amount of memory reclaimed by the last collection. (This script was inspired by an AWK script written by Ken Gottry in the article Pick up performance with generational garbage collection published on javaworld.com.)

The chart presents the running time of each collection performed by the GC during the application life cycle. This chart was generated with the following command line and OpenOffice.org:

$ java -verbose:gc -Xms64m -Xmx128m -XX:NewRatio=2 -XX:+UseConcMarkSweepGC -jar ../lib/jext-5.0.jar > gclog
$ gclog2csv.py gclog > gclog.csv

Although simple, this small script can help you tune your application pretty quickly. As of J2SE 5.0, Sun Microsystems provides a more powerful tool called JConsole that will graphically monitor the state of the heap and the activity of the GC as shown in Figure 3.

Further Reading

  • Java HotSpot VM Options, Sun Microsystems, http://java.sun.com/docs/hotspot/VMOptions.html
  • A Collection of JVM Options, Joseph Mocker, http://blogs.sun.com/roller/resources/watt/jvm-options-list.html
  • Pick up performance with generational garbage collection, Ken Gottry, www.javaworld.com/javaworld/jw-01-2002/jw-0111-hotspotgc-p4.html
  • Garbage Collection in the Java HotSpot Virtual Machine, Tony Printezis, www.devx.com/Java/Article/21977?trk=DXRSS_JAVA
  • Garbage Collection Performance, Brian Goetz, www-106.ibm.com/developerworks/java/library/j-jtp01274.html
  • Using JConsole to Monitor Applications, Mandy Chung, http://java.sun.com/developer/technicalArticles/J2SE/jconsole.html
  • Comments (1) View Comments

    Share your thoughts on this story.

    Add your comment
    You must be signed in to add a comment. Sign-in | Register

    In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.

    Most Recent Comments
    Suresh 05/31/05 01:44:21 PM EDT

    cannot go to page2!!!!

    IoT & Smart Cities Stories
    The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
    Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
    The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
    René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
    Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
    Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
    Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.
    Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
    Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities - ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups.
    As IoT continues to increase momentum, so does the associated risk. Secure Device Lifecycle Management (DLM) is ranked as one of the most important technology areas of IoT. Driving this trend is the realization that secure support for IoT devices provides companies the ability to deliver high-quality, reliable, secure offerings faster, create new revenue streams, and reduce support costs, all while building a competitive advantage in their markets. In this session, we will use customer use cases...