Skip to main content
  1. Open Source/

dice

Dice #

A cryptographically secure pseudo-random number generator CLI tool which generates byte arrays with entropy from the best random source from your machine* optionally externally seeded by multiple true random number generators and supports various byte-to-text encodings like hex or base64 and for many programming languages. The output may be printed to the command line or to a file. This implementation uses the HMAC Deterministic Random Bit Generator (DRBG) schema as defined in NIST SP800-90Ar1.

* depending on the used provider

logo

Main features:

Example usage generating randoms with 24 byte-length (not char length) and default encoding:

java -jar dice.jar 24

More examples:

java -jar dice.jar 16 --count 100
java -jar dice.jar 16 --encoding "base64"
java -jar dice.jar 16 --encoding "java"
java -jar dice.jar 4096 --encoding "raw" --count 1024 --file "./rnd-4-MiB-outputfile.txt"
java -jar dice.jar 16 --seed "myBadRandomSeed"
java -jar dice.jar 16 --offline
java -jar dice.jar 32 --encoding "base85" --urlencode --padding --crc32

This should run on any Windows, Mac or Linux machine.

Requirements #

Download #

Grab jar from the latest Release

Using the *.exe Launcher: Launch4J is used to wrap the .jar into a Windows executable. It should automatically download the needed JRE if required.

Demo #

asciicast

Use Cases #

Creating Nonces, Tokens, Identifiers or Passwords #

I’ll recommend base32, base36 or base58 because these encodings are typically url-safe. 16 byte usually suffice for globally unique, infeasible to brute force number.

java -jar dice.jar 16 -e "base36"

If you require fixed char-sized output either use hex encoding or other encodings supporting paddings like base32 or base64. For passwords, high-density encodings are recommended like base85.

Creating static byte arrays for your application #

You can create static salts, or randoms to hardcode, in your code. Just pick your programming language to get the correct syntax (see below). E.g.:

java -jar dice.jar 16 -e "java"

Creating files of entropy #

Create a file 4MiB full of raw random bytes with this call:

java -jar dice.jar 4096 -c 1024 -e "raw" -f "./rnd-outfile.txt"

This will create random chunks of 4Kib (the maximum allowed size per chunk) repeated 1024 times. Currently, the tool is capped to 10 GiB of generated random data per call. Successive calls will append data, not overwrite it.

Command Line Interface #

   --anuquantum             Enable external, supposed true random generator ANU Quantum; note this service is known
                            to be slow (only when online).
-c,--count <number>         How many randoms should be generated. Automatically chosen if this argument is omitted.
   --crc32                  If this flag is set, 4 bytes of CRC32 checksum will be appended to every random value.
                            If you need to check the integrity of the data.
-d,--debug                  Prints additional info for debugging.
-e,--encoding <string>      Output byte-to-text encoding. Available encodings include:
                            binary, octal, dec, base16, BASE16, base26, base32, base36, base58, base64, base64-url,
                            base85, c, c#, java, go, kotlin, node, js, perl, php, python3, ruby, rust, swift, img,
                            raw, utf8
-f,--file <path>            Prints the random data to given file instead of the command line. Will create the file
                            if it does not exist or append the data if it does.
-h,--help                   Shows this page.
-o,--offline                Skips request to external random generators (random.org) for seeding (use when
                            offline).
-p,--padding                If this flag is set, byte-to-text output will be padded to full byte if needed.
-r,--robot                  If this flag is set, output will be more friendly for scripting (ie. no verbose text,
                            only the randoms 1 per line)
-s,--seed <string|number>   Uses either the 64-bit integer interpretation or the utf-8 byte representation of given
                            parameter to seed the internal random generator. Warns if entropy is low.
-u,--urlencode              Uses 'www-form-urlencoded' encoding scheme, also misleadingly known as URL encoding, on
                            the output strings
-v,--version                Prints application version.

Supported Encodings #

Byte-to-Text Encodings #

NameExampleEfficiencyPaddingDescription
binary11101101 10101111 00011110 11111111 11111101 10010100 0100101012.5 %falseA simple binary representation with ‘0’ and ‘1’ divided into 8 bit groups.
octal112451767770752775537.5 %trueThe octal numeral system, is the base-8 number system, and uses the digits 0 to 7.
dec2099296690442647741.5 %trueDecimal positive sign-magnitude representation representation in big-endian byte-order.
base164a94fdff1eafed50.0 %falseBase16 or hex stores each byte as a pair of hexadecimal digits. Lowercase (a-f) letters are used for digits greater than 9.
BASE164A94FDFF1EAFED50.0 %falseBase16 or hex stores each byte as a pair of hexadecimal digits. Uppercase (A-F) letters are used for digits greater than 9.
base26FSSLZZFNQZQZ58.8 %trueBase26 uses the twenty-six letters A-Z.
base32JKKP37Y6V7WQ62.5 %trueBase32 uses a 32-character subset of the twenty-six letters A-Z and the digits 2-7. Uses the alphabet defined in RFC 4648.
base365qpdvuwjvu564.6 %trueBase36 translating into a radix-36 (aka Hexatrigesimal) representation.
base583pvkeHJmHN73.2 %trueBase58 is similar to Base64 but has been modified to avoid both non-alphanumeric characters and letters which might look ambiguous when printed. This version uses the alphabet common for Bitcoin protocol.
base64SpT9/x6v7Q75.0 %trueBase64 represent binary data in an ASCII string format by translating it into a radix-64 representation.
base64-urlSpT9_x6v7Q75.0 %trueBase64 represent binary data in an ASCII string format by translating it into a radix-64 representation. Uses url safe mode
base858sK;S*j=r80.1 %trueBase85 uses an 85 character ASCII alphabet to encode. It’s main use is with the PDF format and GIT.

Programming Languages #

NameExample
c{0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11};
c#new byte[]{0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11};
javanew byte[]{(byte) 0xD0, 0x3A, 0x4A, (byte) 0xEE, 0x64, 0x11};
go[...]byte = {0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11}
kotlinbyteArrayOf(0xD0.toByte(), 0x3A, 0x4A, 0xEE.toByte(), 0x64, 0x11)
nodenew Buffer([0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11]);
javascriptUint8Array.from([0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11]);
perlpack 0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11;
phparray(208, 58, 74, 238, 100, 17);
python3bytes([0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11])
ruby[0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11]
rust[u8; 6] = [0xd0, 0x3a, 0x4a, 0xee, 0x64, 0x11];
swift[UInt8] = [0xD0, 0x3A, 0x4A, 0xEE, 0x64, 0x11]

Other #

NameExampleDescription
rawJ”ýÿ¯íPrints the raw byte array encoded in ISO_8859_1 which does not change the byte output. Most useful with file output.
utf8J�����Prints the byte array interpreted as UTF-8 encoded text. Only for testing purpose.
img▙▝▚░▓▃Prints a byte per character encoded in unicode block elements.

Digital Signatures #

Signed Jar #

The provided JARs in the GitHub release page are signed with my private key:

CN=Patrick Favre-Bulle, OU=Private, O=PF Github Open Source, L=Vienna, ST=Vienna, C=AT
Validity: Thu Sep 07 16:40:57 SGT 2017 to: Fri Feb 10 16:40:57 SGT 2034
SHA1: 06:DE:F2:C5:F7:BC:0C:11:ED:35:E2:0F:B1:9F:78:99:0F:BE:43:C4
SHA256: 2B:65:33:B0:1C:0D:2A:69:4E:2D:53:8F:29:D5:6C:D6:87:AF:06:42:1F:1A:EE:B3:3C:E0:6D:0B:65:A1:AA:88

Use the jarsigner tool (found in your $JAVA_HOME/bin folder) folder to verify.

Signed Commits #

All tags and commits by me are signed with git with my private key:

GPG key ID: 4FDF85343912A3AB
Fingerprint: 2FB392FB05158589B767960C4FDF85343912A3AB

Deterministic Random Bit Generation #

As cryptographically secure pseudorandom number generator, the NIST SP800-90Ar1 recommendation HMAC-DRBG is used in an implementation derived from the google/rappor project. HMAC-DRBG seems to be a better choice than the also recommended HASH-DRBG approach. Java 9 is expected to have its own provider for it. There is no known issue with Java’s current SHA1-PRNG implementation, but it is less studied thant the NIST recommendation.

This implementation uses HMAC-SHA512 internally and reseeds itself after 1 MiB of random data generation which is well below the maximum NIST recommendation.

References:

Output Test Results #

This PRNG was tested with Robert G. Brown’s dieharder test suite for random number generators, which among others, includes the NIST Statistical Test Suite.

The reports can be seen in /misc/reports/* in this repo.

DRBG Seeding & Input Sources #

A DRGB needs to be seeded by strong entropy sources, so it can safely be expanded to create unpredictable pseudo random output. SP800-90Ar1 defines different types of input for the DRGB. This implementation uses the following types:

Entropy Input #

This implementation uses multiple entropy sources to seed its random bit generator. All these sources are combined and a weak source will not weaken the overall output. This ensures that even if one source fails the output is still cryptographically strong. Below is a detailed description of the used sources:

Strong Secure Random Seed #

This is the main entropy source. This implementation uses the SecureRandom class with its getStrongInstance() constructor to get the best cryptographic random generator available. Internally SecureRandom chooses among providers available at runtime. The best of those access the OS own entropy pools (e.g. /dev/random in *nix systems) since the OS has better access to various random sources.

Further reading:

External Random Service Seeding #

Per default the tool tries to fetch a seed from an external (supposedly true) random source.

Because there are various opinions what technique delivers truly random data, this tool incorporates 3 different services backed by different hardware RNG. Also, to mitigate the fact that if one or more source is either compromised or produces predictable outcome, the other source will mitigate that flaw.

Using an external random might open a new attack vector if, for example, an attacker might read the seed send over the network. There are 2 measures against this:

  • The connections are encrypted with TLS (i.e. HTTPS) and the random is signed by the creator which will be verified by a local pinned certificate (only random.org).
  • The seed is only a part of the entropy source and the knowledge of it does not make it possible to guess the random bits. Therefore, there is no sole trust in an external service. Every generation of random data will see seeding from both local and external sources.

HMAC-based key derivation function (HKDF) described in RFC5869 is used to expand the external seed to the desired length.

Random.org #

Random.org is a website that produces “true random numbers” based on atmospheric noise captured by several radios tuned between stations. The service has existed since 1998 and was built by Dr. Mads Haahr of the School of Computer Science and Statistics at Trinity College, Dublin in Ireland. Random.org offers Transport Layer Security (TLS) encrypted access and signed random data with JSON-RPC 2.0

References:

Hotbits #

Hotbits is not supported anymore since 1st of Jan 2023 and was removed in dice version 1.8.0.

ANU Quantum Random Numbers Server #

A quantum random number generator offered by the Australian National University. The random numbers are generated in real-time by measuring the quantum fluctuations of the vacuum. The services provide a TLS encrypted JSON/REST API.

References:

Local seeding #

The caller may provide a string that additionally seeds the random bit generator. A seed provided by the user is seen as weak seed and will always be combined with the internal state of a strong SecureRandom instance.

Threaded Seed Generator #

A port of SUN’s threaded seed generator by Joshua Bloch. The seed is produced by counting the number of times the VM manages to loop in a given period. This number roughly reflects the machine load at that point in time. The samples are translated using a permutation (s-box) and then XORed together. This process is non-linear and should prevent the samples from “averaging out”. This implementation uses faster timings to produce values faster.

This generator is mainly used as a fallback if there is no external seed and the internal seeds fail.

Nonce Input #

The nonce is composed of:

  • Monotonically increasing sequence number with a starting value of current JVM startup time
  • System nanosecond time (which has arbitrary starting point)
  • JVM uptime in milliseconds
  • Current elapsed milliseconds since January 1, 1970, UTC.

The four 8 byte values will be hashed with HKDF.

Personalization String #

The goal of a personalization string is to gather as much information about e.g. runtime, machine identifiers and static identifiers to make the call as unique as possible for this particular machine/runtime/version/etc.

For this the following data will be gathered:

  • MAC address of all network adapters
  • Runtime & OS information (e.g. uptime, current cpu usage, processor count, classpath)
  • SCM information (e.g. commit hash, committer, etc.) & version name
  • Environmental variables and system properties
  • Content of the temp directory

The resulting data will be hashed with HKDF.

Example Data #

Under misc/example a couple of files of example random data can be found (e.g. 1MiB and 10MiB)

Build #

Jar Sign #

If you want to jar sign you need to provide a file keystore.jks in the root folder with the correct credentials set in environment variables ( OPENSOURCE_PROJECTS_KS_PW and OPENSOURCE_PROJECTS_KEY_PW); alias is set as pfopensource.

If you want to skip jar signing just change the skip configuration in the pom.xml jar sign plugin to true:

<skip>true</skip>

Build with Maven #

Use maven (3.1+) to create a jar including all dependencies

mvn clean install

Checkstyle Config File #

This project uses my common-parent which centralized a lot of the plugin versions as well as providing the checkstyle config rules. Specifically they are maintained in checkstyle-config. Locally the files will be copied after you mvnw install into your target folder and is called target/checkstyle-checker.xml. So if you use a plugin for your IDE, use this file as your local configuration.

Tech-Stack #

  • Java 8
  • Maven
  • rxjava2, apache-commons-codec, apache-commons-cli, Retrofit 2
  • Proguard, Launch4j, Jar Signing

Credits #

License #

Copyright 2017 Patrick Favre-Bulle

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Patrick Favre
Author
Patrick Favre
Software Engineer currently working as architect, backend dev, cloud engineer, IT ops rookie. Cryptography and security are my passions.