I was wondering if numpy
could be used to build the most basic cube model where all cross-combinations and their computed value are stored.
Let's take the following example of data:
AUTHOR BOOK YEAR SALES
Shakespeare Hamlet 2000 104.2
Shakespeare Hamlet 2001 99.0
Shakespeare Romeo 2000 27.0
Shakespeare Romeo 2001 19.0
Dante Inferno 2000 11.6
Dante Inferno 2001 12.6
And to be able to build something like:
YEAR TOTAL
AUTHOR BOOK 2000 2001
(ALL) (ALL) 142.8 130.6 273.4
Shakespeare (ALL) 131.2 118.0 249.2
Dante (ALL) 11.6 12.6 24.2
Shakespeare Hamlet 104.2 99.0 203.2
Shakespeare Romeo 27.0 19.0 46.0
Dante Inferno 11.6 12.6 24.2
I'm hoping that the usage of using something like meshgrid
might get me 75% there. Basically, I'd like to see if it's possible to build a structure of all pre-computed values with numpy
(not pandas) to build a structure so that I could retrieve the above result of all possible combination. For the sake of simplicity, let's only consider the SUM
as the only possible calculation. Perhaps this is a roundable way of asking, but could numpy
be the backbone of doing this, or do I need to use something else?
And finally, if not possible in numpy
how might this be stored in a MDA?
from Build a basic cube with numpy?
No comments:
Post a Comment