Which Kind Of Array Or Collection Should I Use?

Arrays and Collections, overview
This article explains the basic uses of - and differences between - some of the various types of array-like containers you can use in Unity. If you haven't heard of arrays before, they are basically a single variable with a series of "compartments", where each compartment can contain a value. They're useful if you want to deal with a collection of values or objects together. In a game programming situation, you might use an array to store a reference to every enemy in your scene, or as a container for collected items in the player's inventory. Arrays can be 'iterated over', which is a fancy way of saying that you can go through each item in your array in sequence, and inspect or perform functions on each item in turn.

There are many types of containers, and a few of them use the name "array", but there are other types too. The broader term "collection" can be used to describe all these types of containers. These are the types of collection that I'll be describing in this article:


 * Built-in array
 * Javascript Array
 * ArrayList
 * Hashtable
 * Generic List
 * Generic Dictionary
 * Hashset
 * 2D Array

There are other types for more specialised situations, but I have selected these as being some of the basic staples of programming.

Because Unity is built on Mono, which is an open-source implementation of .Net, you have access to most of .Net's collection types. All of the above types are standard .Net types, with the exception of the "Javascript Array". The "Javascript Array" is a type which is added into the Unity Engine, and is only available if you're using the Javascript syntax in Unity - it's not available if you're using C#. However, it is essentially just a wrapper for the ArrayList class - which is available in C# - with a different set of functions provided. If you're coming to Unity having used Javascript or a Javascript-like language (such as ActionScript) elsewhere outside of Unity, it's important to be aware of these underlying differences, and bear in mind that Unity's "Javascript" isn't the same Javascript commonly used in WWW - it's UnityScript, just .Net with a different syntax laid over the top!

All types of collections share a few common features:


 * You can fill them with objects, and read back the values that you put in.
 * You can 'iterate' through them, which means you can create a loop which runs the same piece of code against each item in the collection.
 * You can get the length of the collection.
 * For most collections, you can arbitrarily add and remove items at any position, and sort their contents.

By using LINQ, you can do a lot more:
 * mapping and filtering (Select and Where)
 * calculate sum or any aggregation you specify
 * basically replace a hand-rolled foreach-loop with a chain of LINQ functions
 * read more at MSDN page on LINQ

Below, I describe some of the most common types of collection, along with their pros and cons, and some (but not all) of their most useful properties and methods.

Built-in Arrays
The most basic type of array, available in both JS and C#, is the built-in array. These used be limited by a fixed-size, but they can now be resized with "System.Array.Resize.(arrayname, size);" They are the fastest array in unity. For this reason, built-in arrays are the best choice if you need the fastest performance possible from your code (for example, if you're targeting iPhone). If there is a fixed and known number of items that you want to store, this is the best choice.

It's also common to use this type of array if you have a varying number of items to store, but you can decide on a 'maximum' for the number of objects that you'll need. You can then leave some of the elements in the array null when they're not required, and design your code around this. For example, for the bullets in a shooting game, you may decide to use an array of size 50, allowing a maximum of 50 active bullets at any one time.

This type of array shows up in Unity's inspector window. This means that the contents of a built-in array can be populated in the Unity editor, by dragging and dropping references. Generic Lists also have this ability, whereas Arrays and ArrayLists do not.

It's also usually the type of array you get back from Unity functions, if you use a function which may return a number of objects, such as GetComponentsInChildren.

Built-in arrays are declared by specifying the type of object you want to store, followed by brackets. Eg:

Basic Declaration & Use:

C#

Javascript

Full MSDN Documentation for Built-in Array

Some direct links to useful Functions/Methods of Built-in Arrays:

IndexOf, LastIndexOf, Reverse, Sort, Clear, Clone

Javascript Arrays
The "Javascript Array" in unity is a special class that is provided as part of the Unity Engine in addition to the standard .net classes. You can only declare them if you are using a Javascript syntax script - you can't declare them in C# (but for C#, see "ArrayList" below).

Javascript arrays are dynamic in size, which means you don't have to specify a fixed size. You can add and remove items to the array, and the array will grow and shrink in size accordingly. You also don't have to specify the type of object you want to store. You can put objects of any type into a Javascript array, even mixed types in the same array.

Javascript arrays are therefore somewhat easier to use than built-in arrays, however they are more costly in terms of performance. Another potential downside is that there are certain situations where you need to use explicit casting when retrieving items because of their 'untyped' nature - despite Javascript's dynamic typing. In general, there's not really any reason to use Array, since other collection types do the same thing faster and with more features; it's left over from earlier in Unity's life when generic Lists weren't directly usable in Javascript. If you need a dynamic array of mixed types, a generic List of Object will do the same thing as Array, but faster.

Basic Declaration & Use: (Javascript Only)

Full Unity Documentation for Javascript Array

Some direct links to useful Functions/Methods of Unity's Javascript Arrays:

Concat, Join, Push, Add, Pop, Shift, RemoveAt, Unshift, Clear, Reverse, Sort

ArrayLists
The ArrayList is a .Net class, and is very similar to the Javascript Array mentioned previously, but this time available in both JS and C#. Like JS Arrays, ArrayLists are dynamic in size, so you can add and remove items, and the array will grow and shrink in size to fit. ArrayLists are also untyped, so you can add items of any kind, including a mixture of types in the same ArrayList. ArrayLists are also similarly more costly when compared to the blazingly fast performance of built-in arrays. ArrayLists have a wider set of features compared to JS Arrays, although neither of their feature sets completely overlaps the other.

Basic Declaration & Use:

Javascript

C#

Full MSDN Documentation for ArrayList

Some direct links to useful Functions/Methods of the ArrayList:

Add, Insert, Remove, RemoveAt, Clear, Clone, Contains, IndexOf, LastIndexOf, GetRange, SetRange, AddRange, InsertRange, RemoveRange, Reverse, Sort, ToArray

Hashtables
A Hashtable is a type of collection where each item is made up of a "Key and Value" pair. It's most commonly used in situations where you want to be able to do a quick look-up based on a certain single value. The piece of information that you use to perform the look-up is the 'key', and the object that is returned is the "Value".

If you are familiar with web development, it's similar to the type of data in a GET or POST request, where every value passed has a corresponding name. With a Hashtable however, both the keys and the values can be any type of object. For most practical applications, it's usually the case that your keys are going to be all the same type (eg, strings) and your values are likely to be all of the same type too (eg, GameObjects, or some other class instance). As with ArrayLists, because Hashtable keys and values are untyped, you usually have to deal with the type casting yourself when you retrieve values from the collection.

Hashtables are designed for situations where you want to be able to quickly pick out a certain item from your collection, using some unique identifying key - similar to the way you might select a record from a database using an index, or the way you might pick out the contact details of a person using their name as the 'unique identifier'.

Basic Declaration & Use:

Javascript

C#

Full MSDN Documentation for Hashtable Members

Some direct links to useful Functions/Methods of the HashTable:

Add, Remove, ContainsKey, ContainsValue, Clear, Clone

Generic List
The Generic List (also known as List) is similar to the JS Array and the ArrayList, in that they have a dynamic size, and support arbitrary adding, getting and removing of items. The significant difference with the Generic List (and all other 'Generic' type classes), is that you explicitly specify the type to be used when you declare it - in this case, the type of object that the List will contain.

Once you've declared it, you can only add objects of the correct type - and because of this restriction, you get two significant benefits:


 * no need to do any type casting of the values when you come to retrieve them.
 * performs significantly faster than ArrayList

This means that if you were going to create an ArrayList, but you know that you will only be putting objects of one specific type of object into it, (and you know that type in advance) you're generally better off using a Generic List. For me, this tends to be true pretty much all the time.

The generic collections are not part of the standard System.Collections namespace, so to use them conveniently, you need to add a line at the top of any script in which you want to use them in C#:

In Javascript:

Basic Declaration & Use:

(C#)

(Javascript) numList = nums.ToList;                      // convert an array to list

Full MSDN Documentation for Generic List

Some direct links to useful Methods of the Generic List:

Add, Insert, Remove, RemoveAll, RemoveAt, Contains, IndexOf, LastIndexOf, Reverse, Sort, Clear, AddRange, GetRange, InsertRange, RemoveRange, ToArray

Also, how to count certain item in List

Generic Dictionary
The Generic Dictionary is to the Hashtable what the Generic List is to the ArrayList. The Generic Dictionary provides you with a structure for quickly looking up items from a collection (like the Hashtable), but it differs from the Hashtable in that you must specify explictly the types for the Keys and Values up-front, when you declare it.

Because of this, you get similar benefits to those mentioned in the Generic List. Namely, no annoying casting needed when using the Dictionary, and a significant performance increase compared to the Hashtable.

Because you need to specify the types for both the Keys and the Values, the declaration line can end up a little long and wordy. However, you can use type inference to alleviate this:

(C#) (Javascript)

Again, to use this, you'll need to include the Generic Collections package by including this line at the top of your script:

(C#)

(Javascript)

Basic Declaration & Use:

(C#)

(Javascript)

Full MSDN Documentation for Dictionary(TKey,TValue)

Some direct links to useful Methods of the Generic Dictionary:

Add, Remove, ContainsKey, ContainsValue, Clear,

2D Array
So far, all the examples of Arrays and Collections listed above have been one-dimensional structures, but there may be an occasion where you need to place data into an array with more dimensions. A typical game-related example of this is a tile-based map. You might have a 'map' array which should have a width and a height, and a piece of data in each cell which determines the tile to display. It is also possible to have arrays with more than two dimensions, such as a 3D array or a 4D array — however if you have a need for a 3D or 4D array, you're probably advanced enough to not require an explanation of how to use them!

There are two methods of implementing a multi-dimensional array. There are "real" multi-dimensional arrays, and there are "jagged" arrays. The difference is this:

With a "real" 2D array, your array has a fixed "width" and "height" (although they are not called width & height). You can refer to a location in your 2d array like this: myArray[x,y].

In contrast, "jagged" arrays aren't real 2D arrays, because they are created by using nested one-dimensional arrays. In this respect, what you essential have is a one-dimensional outer array which might represent your "rows", and each item contained in this outer array is actually an inner array which represents the cells in that row. To refer to a location in a jagged array, you would typically use something like this: myArray[y][x].

Usually, "real" 2D arrays are preferable, because they are simpler to set up and work with; however there are some valid cases for using jagged arrays. Such cases usually make use of the fact that, with a jagged array, each "inner" array doesn't have to be the same length (hence the origin of the term "jagged").

Note that Javascript doesn't currently support the syntax for declaring the type of jagged arrays directly. You can use type inference, however:

In this case myJaggedArray is assigned a type of int[][], because the supplied values are all ints. You can also use jagged arrays declared in C# scripts without any problems.

Basic Declaration & Use of "real" 2D arrays in C#:

In Javascript:

So which one should I use?
If there is no specific reason not to, use the generic containers.

Specific reasons might be:
 * optimizing memory efficiency and speed by using built-in arrays — since built-in arrays can be significantly faster than generic Lists for primitive types (up to 5X faster), it's a good idea to use them when you need the best possible speed.
 * changing Lists to Arrays isn't hard in the cases where Arrays are good enough for the job (known size), so why not still use Lists during development and save Array'ing to optimization phase?
 * third-party library uses ArrayLists and Hashtables (boooo, MiniJSON.cs, boooo), so I just want to avoid bunch of nasty-looking typecasts to stuff what they return into typed containers
 * well, you will need a typecast eventually anyway, and passing around Objects somehow implicitly trusting they are what they should is just asking for trouble
 * LINQ's Cast may help you with the problem of getting typed data out of untyped container

There's no good reason to use the JS Array class. It's relatively slow and won't work with #pragma strict code, unless you manually cast to the correct type when retrieving items. Use a generic List instead.

Similarly, ArrayList and Hashtable should be avoided, especially if you want to publish to the Windows store, since those collection types are obsolete and have been removed from the version of .NET that's used for Windows store apps. ("Real" .NET is used in that particular case, rather than Mono.) Instead, use generic List and Dictionary, respectively.