Codex

Encoding and Decoding Genetic Information

Because Radiate has such a specific domain language, there needs to be a way to convert between the domain language or 'problem space' and the 'solution space'. For example, if you are trying to evolve a list of floating-point numbers, the GeneticEngine needs to know how to interact with that list of numbers, but from our perspective, we are only interested in the real-world problem that the list of numbers represents. This is where the Codex comes in.

When the GeneticEngine is evolving a population of individuals, it uses the Codex to encode the genetic information of the individuals into a Genotype, and then decode the Genotype back into the domain language when evaluating the fitness of the individuals. This process of encoding and decoding genetic information is what allows the GeneticEngine to operate on the genetic information of the individuals while still allowing us to work with the real-world problem that the genetic information represents. This allows the fitness_fn to accept the real-world representation of the individual we defined in the Codex.

In other words, the Codex is the bridge between the domain language of Radiate and the real-world problem that you are trying to solve.

Core library Codex implementations

BitCodexCharCodexFloatCodexIntCodexPermutationCodexSubsetCodexFnCodex

pub struct BitCodex {
    pub num_chromosomes: usize,
    pub num_genes: usize,
}

Encodes: Genotype of BitChromosomes with BitGenes
Decodes: Vec<Vec<bool>>

When people traditionally think of genetic algorithms, they often think of binary strings. The BitCodex is a simple way to encode and decode a Genotype of binary strings so in essence, it is the most basic Codex implementation.

classDiagram 
    class BitCodex {
        num_chromosomes: usize
        num_genes: usize

        encode() Genotype~BitChromosome~
        decode(&Genotype~BitChromosome~) Vec~Vec~bool~~
    }

pub struct CharCodex {
    num_chromosomes: usize,
    num_genes: usize,
    char_set: Arc<[char]>,
}

Encodes: Genotype of CharChromosomes with CharGenes
Decodes: Vec<Vec<char>>

classDiagram
    class CharCodex {
        num_chromosomes: usize
        num_genes: usize
        char_set: Arc~[char]~

        encode() Genotype~CharChromosome~
        decode(&Genotype~CharChromosome~) Vec~Vec~char~~
    }

pub struct FloatCodex {
    pub num_chromosomes: usize,
    pub num_genes: usize,
    pub min: f32,
    pub max: f32,
    pub lower_bound: f32,
    pub upper_bound: f32,
}

Encodes: Genotype of FloatChromosomes with FloatGenes
Decodes: Vec<Vec<f32>>

classDiagram
    class FloatCodex {
        num_chromosomes: usize
        num_genes: usize
        min: f32
        max: f32
        lower_bound: f32
        upper_bound: f32

        with_bounds(mut self, f32, f32) FloatCodex
        scalar(f32, f32) FloatCodex
        encode() Genotype~CharChromosome~
        decode(&Genotype~CharChromosome~) Vec~Vec~char~~
    }

pub struct IntCodex<T: Integer<T>> {
    pub num_chromosomes: usize,
    pub num_genes: usize,
    pub min: T,
    pub max: T,
    pub lower_bound: T,
    pub upper_bound: T,
}

Encodes: Genotype of IntChromosome<T> with IntGene<T>
Decodes: Vec<Vec<T>>

Note: T must implement the Integer trait. Integer is a trait in Radiate and is implemented for i8, i16, i32, i64, i128, u8, u16, u32, u64, u128.

classDiagram
    class IntCodex~T~ {
        num_chromosomes: usize
        num_genes: usize
        min: T
        max: T
        lower_bound: T
        upper_bound: T

        with_bounds(mut self, T, T) IntCodex~T~
        encode() Genotype~IntChromosome~
        decode(&Genotype~IntChromosome~T~~) Vec~Vec~char~~
    }

pub struct PermutationCodex<A: PartialEq + Clone> {
    pub alleles: Arc<Vec<A>>,
}

Encodes: Genotype of PermutationChromosomes<A> with PermutationGene<A>
Decodes: Vec<A>

Permutation problems are problems where the order of the elements in the solution is important. The Travelling Salesman Problem is a classic example of a permutation problem.

classDiagram
    class PermutationCodex~A~ {
        alleles: Arc~Vec~A~~

        encode() Genotype~PermutationChromosome~A~~
        decode(&Genotype~PermutationChromosome~A~~) Vec~A~~
    }

pub struct SubSetCodex<'a, T> {
    pub items: &'a Vec<T>,
}

Encodes: Genotype of BitChromosome with BitGene
Decodes: Vec<&'a T>

The SubsetCodex is a specialized Codex that is used for subset selection problems. In subset selection problems, the goal is to select a subset of items from a larger set of items that maximizes some objective function. The Knapsack Problem is a classic example of a subset selection problem.

classDiagram
    class SubSetCodex~'a, T~ {
        items: &'a Vec~T~

        encode() Genotype~BitChromosome~
        decode(&Genotype~BitChromosome~) Vec~&'a T~
    }

pub struct FnCodex<C: Chromosome, T> {
    pub encoder: Option<Box<dyn Fn() -> Genotype<C>>>,
    pub decoder: Option<Box<dyn Fn(&Genotype<C>) -> T>>,
}

Encodes: Genotype of C with C::GeneType
Decodes: T

The FnCodex is a generic Codex that allows you to define your own encoding and decoding functions. This is useful if you have a custom problem that doesn't fit into the other Codex implementations and don't want to create a new Codex implementation.

Build your own Codex

FloatCodex

Let's take a look at a simplified version of Raditate's built-in FloatCodex which encodes and decodes floating-point numbers. The FloatCodex takes in the number of chromosomes, number of genes per chromosome, the max allele value, and the min allele value. The encode method creates a new Genotype of FloatChromosomes with FloatGenes that have random alleles between the max and min. The decode method takes a Genotype and returns a Vec<Vec<f32>> of the gene values.

use std::ops::Range;
use radiate::*;

pub struct FloatCodex {
    pub num_chromosomes: usize,
    pub num_genes: usize,
    pub min: f32,
    pub max: f32,
    pub lower_bound: f32,
    pub upper_bound: f32,
}

impl FloatCodex {
    /// Create a new `FloatCodex` with the given number of chromosomes, genes, min, and max values.
    /// The f_32 values for each `FloatGene` will be randomly generated between the min and max values.
    pub fn new(num_chromosomes: usize, num_genes: usize, range: Range<f32>) -> Self {
        let (min, max) = (range.start, range.end);
        FloatCodex {
            num_chromosomes,
            num_genes,
            min,
            max,
            lower_bound: f32::MIN,
            upper_bound: f32::MAX,
        }
    }
}

impl Codex<FloatChromosome, Vec<Vec<f32>>> for FloatCodex {
    fn encode(&self) -> Genotype<FloatChromosome> {
        Genotype {
            chromosomes: (0..self.num_chromosomes)
                .map(|_| {
                    FloatChromosome {
                        genes: (0..self.num_genes)
                            .map(|_| {
                                FloatGene::from((self.min..self.max, self.lower_bound..self.upper_bound))
                            })
                            .collect::<Vec<FloatGene>>(),
                    }
                })
                .collect::<Vec<FloatChromosome>>(),
        }
    }

    fn decode(&self, genotype: &Genotype<FloatChromosome>) -> Vec<Vec<f32>> {
        genotype
            .iter()
            .map(|chromosome| {
                chromosome
                    .iter()
                    .map(|gene| *gene.allele())
                    .collect::<Vec<f32>>()
            })
            .collect::<Vec<Vec<f32>>>()
    }
}

Lets take a look at how we can use the FloatCodex to encode and decode a Genotype of FloatChromosomes with 2 chromosomes and 3 genes per chromosome:

fn main() {
    let codex = FloatCodex::new(2, 3, 0.0..1.0);

    let genotype: Genotype<FloatChromosome> = codex.encode();
    let decoded: Vec<Vec<f32>> = codex.decode(&genotype);
}

The genotype in this case will look something like this:

Genotype {
    chromosomes: [
        FloatChromosome {
            genes: [
                FloatGene { allele: 0.123, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX },
                FloatGene { allele: 0.456, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX }, 
                FloatGene { allele: 0.789, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX },
            ],
        },
        FloatChromosome {
            genes: [
                FloatGene { allele: 0.321, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX },
                FloatGene { allele: 0.654, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX },
                FloatGene { allele: 0.987, min: 0.0, max: 1.0, lower_bound: f32::MIN, upper_bound: f32::MAX },
            ],
        },
    ],
}

And the decoded is a Vec<Vec<f32>> of the gene values and is what will be passed to the fitness_fn:

[
    [0.123, 0.456, 0.789],
    [0.321, 0.654, 0.987],
]

FnCodex - NQueens

use radiate::*;

const N_QUEENS: usize = 8;

fn main() {
    // this is a simple example of the NQueens problem.
    // The resulting codex type will be FnCodex<IntChromosome<i8>, Vec<i8>>.
    let codex = FnCodex::new()
        .with_encoder(|| {
            Genotype::from_chromosomes(vec![IntChromosome {
                genes: (0..N_QUEENS)
                    .map(|_| IntGene::from(0..N_QUEENS as i8))
                    .collect(),
            }])
        })
        .with_decoder(|genotype| {
            genotype.chromosomes[0]
                .genes
                .iter()
                .map(|g| *g.allele())
                .collect::<Vec<i8>>()
        });
    // encode and decode
    let genotype = codex.encode(); // Genotype<IntChromosome<i8>>
    let decoded = codex.decode(&genotype); // Vec<i8>
}

NQueens

A simple struct to represent the NQueens problem.

use radiate::*;

#[derive(Clone, Debug, PartialEq)]
struct NQueens(Vec<i32>);

A Codex for the NQueens problem.

struct NQueensCodex {
    size: i32,
}

Implement the Codex trait for the NQueensCodex. The encode function creates a Genotype with a single chromosome of size genes. The decode function creates a NQueens from the Genotype.

impl Codex<IntChromosome<i32>, NQueens> for NQueensCodex {
    fn encode(&self) -> Genotype<IntChromosome<i32>> {
        let genes = (0..self.size).map(|_| IntGene::from(0..self.size)).collect();
        let chromosomes = vec![IntChromosome { genes }];
        Genotype::from_chromosomes(chromosomes)
    }

    fn decode(&self, genotype: &Genotype<IntChromosome<i32>>) -> NQueens {
        NQueens(genotype.chromosomes[0].iter().map(|g| *g.allele()).collect())
    }
}

Create a new NQueensCodex with a size of 5.

let codex = NQueensCodex { size: 5 };

encode a new Genotype of IntGenes with a size of 5. The result will be a genotype with a single chromosome with 5 genes. The genes will have a min value of 0, a max value of 5, an upper_bound of 5, and a lower_bound of 0 The alleles will be random values between 0 and 5. It will look something like:

Genotype {
    chromosomes: [
        IntChromosome<i32> {
            genes: [
                IntGene { allele: 3, min: 0, max: 5, ... },
                IntGene { allele: 7, min: 0, max: 5, ... },
                IntGene { allele: 1, min: 0, max: 5, ... },
                IntGene { allele: 5, min: 0, max: 5, ... },
                IntGene { allele: 2, min: 0, max: 5, ... },
            ]
        }
    ]
}

let genotype = codex.encode();

decode the genotype to a NQueens. The result will be a NQueens struct with a Vec of 8 random values between 0 and 8. It will look something like:

NQueens([3, 7, 1, 5, 2])

let nqueens = codex.decode(&genotype);