How to Cache DTO Projections with Hibernate


Take your skills to the next level!

The Persistence Hub is the place to be for every Java developer. It gives you access to all my premium video courses, monthly Java Persistence News, monthly coding problems, and regular expert sessions.


The most efficient way to improve the performance of a database query is to avoid it by retrieving the data from a local cache. That’s why Hibernate offers 3 different caches:

  1. The 1st level cache contains all entities loaded and created within the current session.
  2. The 2nd level cache is a shared, session-independent cache for entities.
  3. The query cache is a shared, session-independent cache for the results of Criteria and JPQL queries.

In this article, I will focus on the query cache. It’s Hibernate-specific and the only one that doesn’t store entities. That makes it your only option if you want to cache DTO projections.

I explain all 3 caches in great details in my Hibernate Performance Tuning Online Training, which I will re-open for enrollment in June.

The Benefits of DTO Projections

DTO projections are one of the best and easiest way to improve the performance of your read operations. In contrast to entities, which are part of the domain model and are used by multiple use cases, you can define a DTO projection for a specific use case. That enables you to implement a query that only fetches the data that you need for your use case. Within that query, you can use all features supported by JPQL, like selecting attributes from multiple entities or using database functions to transform your data.

And that’s not the only reasons why you should use them for read operations. As I showed in one of my previous articles, even if your DTO projection contains the same information as your entity, the DTO projection is significantly faster.

The only downside of queries that return DTO projections is that the JPA specification doesn’t offer any option to cache them. But you can use Hibernate’s proprietary query cache for it.

How to Activate Hibernate’s Query Cache

To use Hibernate’s query cache you first need to activate it in your persistence.xml file. In the next step you need to explicitly activate caching for your query.

This 2-step activation is necessary because most of your queries are not good candidates for caching.

You should only cache the result of a query, which you often call with the same set of bind parameter values. In addition to that, the data on which you perform your query should only rarely change. Otherwise, your query cache will not be very effective. It will spend more time adding and removing entries than actually returning cached query results.

How to Activate Your Query Cache

You can activate the query cache by setting the hibernate.cache.use_query_cache parameter in your persistence.xml to true. And if you want to use the query cache for any query that returns entities, you should also make sure to configure the 2nd level cache for these entities. But that’s a topic for another article, and I explain it in great details in the Hibernate Performance Tuning Online Training.

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<persistence xmlns="http://xmlns.jcp.org/xml/ns/persistence" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="2.1" xsi:schemaLocation="http://xmlns.jcp.org/xml/ns/persistence http://xmlns.jcp.org/xml/ns/persistence/persistence_2_1.xsd">
    <persistence-unit name="my-persistence-unit">
        <description>Hibernate Performance Tuning</description>
        <provider>org.hibernate.jpa.HibernatePersistenceProvider</provider>
        <exclude-unlisted-classes>false</exclude-unlisted-classes>
        
        <!--  enable selective 2nd level cache -->
        <shared-cache-mode>ENABLE_SELECTIVE</shared-cache-mode>
        <properties>
            ...
            
            <!--  configure caching -->
            <property name="hibernate.cache.use_query_cache" value="true"/>	
        </properties>
    </persistence-unit>
</persistence>

How to Activate Caching for a Query

After you activated the query cache, you need to tell Hibernate to cache the result of the query. You can do that in 2 ways:

  1. If you’re using JPA’s Query interface, you can set the query hint QueryHints.CACHEABLE to true.
  2. If you’re using Hibernate’s Query interface, you can call the setCacheable method with true.

Both options create the same result. Hibernate checks the query cache before it executes the query. If the cache contains the result, Hibernate returns it without performing the query. If the result isn’t cached, Hibernate executes the query and stores the result in the query cache.

Let’s give it a try.

The following query uses a constructor expression and selects the title attribute of the Book entity and the name attribute of the Publisher entity. For each record returned by this query, Hibernate calls the constructor of the BookPublisherValue class. In this example, I use JPA’s Query interface and activate the query cache with a query hint.

TypedQuery<BookPublisherValue> q = em
		.createQuery(
				"SELECT new org.thoughts.on.java.model.BookPublisherValue(b.title, p.name) FROM Book b JOIN b.publisher p WHERE b.id = :id",
				BookPublisherValue.class);
q.setHint(QueryHints.CACHEABLE, true);
q.setParameter("id", 1L);
BookPublisherValue value = q.getSingleResult();

If you activate the Hibernate statistics and the logging of SQL statements, you can see that Hibernate executes the query and stores the result in the cache.

19:28:04,826 INFO  [org.hibernate.cache.internal.StandardQueryCache] - HHH000248: Starting query cache at region: org.hibernate.cache.internal.StandardQueryCache
19:28:04,831 DEBUG [org.hibernate.SQL] - select book0_.title as col_0_0_, publisher1_.name as col_1_0_ from Book book0_ inner join Publisher publisher1_ on book0_.publisherid=publisher1_.id where book0_.id=?
19:28:04,841 DEBUG [org.hibernate.stat.internal.ConcurrentStatisticsImpl] - HHH000117: HQL: SELECT new org.thoughts.on.java.model.BookPublisherValue(b.title, p.name) FROM Book b JOIN b.publisher p WHERE b.id = :id, time: 14ms, rows: 1
19:28:04,850 INFO  [org.thoughts.on.java.model.TestQueryCache] - BookPublisherValue [publisher=Addison Wesley, title=Effective Java]
19:28:04,853 INFO  [org.hibernate.engine.internal.StatisticalLoggingSessionEventListener] - Session Metrics {
    28300 nanoseconds spent acquiring 1 JDBC connections;
    27201 nanoseconds spent releasing 1 JDBC connections;
    307300 nanoseconds spent preparing 1 JDBC statements;
    1204200 nanoseconds spent executing 1 JDBC statements;
    0 nanoseconds spent executing 0 JDBC batches;
    3333200 nanoseconds spent performing 1 L2C puts;
    0 nanoseconds spent performing 0 L2C hits;
    1897000 nanoseconds spent performing 1 L2C misses;
    0 nanoseconds spent executing 0 flushes (flushing a total of 0 entities and 0 collections);
    78800 nanoseconds spent executing 1 partial-flushes (flushing a total of 0 entities and 0 collections)
}

If you rerun the same code, Hibernate gets the query result from the query cache without executing the query.

19:28:04,859 INFO  [org.thoughts.on.java.model.TestQueryCache] - BookPublisherValue [publisher=Addison Wesley, title=Effective Java]
19:28:04,860 INFO  [org.hibernate.engine.internal.StatisticalLoggingSessionEventListener] - Session Metrics {
    11401 nanoseconds spent acquiring 1 JDBC connections;
    10700 nanoseconds spent releasing 1 JDBC connections;
    0 nanoseconds spent preparing 0 JDBC statements;
    0 nanoseconds spent executing 0 JDBC statements;
    0 nanoseconds spent executing 0 JDBC batches;
    0 nanoseconds spent performing 0 L2C puts;
    234000 nanoseconds spent performing 1 L2C hits;
    267599 nanoseconds spent performing 2 L2C misses;
    0 nanoseconds spent executing 0 flushes (flushing a total of 0 entities and 0 collections);
    8500 nanoseconds spent executing 1 partial-flushes (flushing a total of 0 entities and 0 collections)
}

Conclusion

DTO projections perform much better than entities if you want to read data from the database. In this test, the DTO projection gave me a ~40% performance improvement. And your application will be even faster if Hibernate can skip the query and get the result from local memory.

You can achieve that with Hibernate’s query cache. You need to activate it in your persistence.xml configuration by setting the hibernate.cache.use_query_cache parameter to true. In the next step, you also need to activate the cache for a specific query by setting the query hint QueryHints.CACHEABLE on your JPA Query or by calling the setCacheable method on Hibernate’s Query interface.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.