glib hash memory footprint

From: Davi de Castro Reis <davicastro terra com br>
To: gtk-devel-list gnome org
Subject: glib hash memory footprint
Date: Sun, 29 Aug 2004 20:46:09 -0300

Hi all,

I am working in a project where we do create some big string->uint hashtables (> 4 million elements). One thing I've noticed is the big memoryoverhead of glib's hash table.

glib's hash table uses up to 200MB to store the data, while a straightlinear probing hash implementation, with the same hashing function,takes about 90MB, with the same performance. As far as I can see, themain reason behind this is that glib's hash implementation uses linkedlist. So my first question is:

1. Is there some reason to use linked lists instead of linear probing(putting collided elements in the next bucket)?

2. Would glib/gtk team be interested in a linear/custom probing hashimplementation?

One more thing I would like to have in the API would be the ability toarbitrarily change the equality function for hash table (between callsto g_hash_table_lookup). There are some dirty tricks that can be donewith this, like looking in the hash table with or withoutcase-sensitivity, without the need of keeping two different hash tables.Do you think that a g_hash_table_set_equal_func() function could bepart of the API?

Should I create a bugzilla entry for these? Another advantage of thelinear probing implementation is that g_hash_table_destroy() could bemade faster (current implementation is quite slow).


Thanks,
Davi de Castro Reis

Follow-Ups:
- Re: glib hash memory footprint
  - From: Ross McFarland
- Re: glib hash memory footprint
  - From: Soeren Sandmann
- Re: glib hash memory footprint
  - From: Alan M. Evans

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]