DMelt:DataAnalysis/8 Metadata

From HandWiki
Member

Metadata

Metadata is a short record which captures main characteristics of an entire data record or objects inside each data record. These characteristics appear to be useful when one needs to find a necessary record as fast as possible, without reading the entire data records. Read Metadata article.

Let us consider a script which fills an event record with many events. It can look as

from jhplot.io import *
from jhplot import *

ps=math.Poisson(500)
def event(entry):
  p1,p2,p3=P0D('a'),P0D('b'),P0D('c')
  p1.randomUniform(ps.next(),0,1)
  p2.randomNormal(ps.next(),0,1)
  p3.randomNormal(ps.next(),0,1)
  return [str(entry),p1,p2,p3]

f=HFile('data.jser','w')
for i in range(5000):
  ev=event(i)
  if (i%100 == 0):
     print 'event=',ev[0]
  f.write(ev)
f.close()

In this example, event consists of 3 data containers with Poisson random numbers. We write all such events into a file. in a zipped serialized form. How to insert a data record characterizing the sum of 3 random values, x+y+x, such that after reading this small record one can navigate to the necessary event without running over all 50000 events?