How do I convince my data engineer to not modify data before including it in our db?

Taringano@lemm.ee · 1 year ago

How do I convince my data engineer to not modify data before including it in our db?

Taringano@lemm.ee · 1 year ago

That’s fair.

When would that be useful?

Consider we have no space restriction nor need for absurd speeds. All our competitors stpre the data as it was originally inputted (we share data sources, theirs display nice ours displays all lowercase and etc, as mentioned.)

CaptainBuckleroy@lemm.ee · edit-2 1 year ago

Got it, useful info.

I’m a software engineer, but here’s a bunch of stuff to consider, in no particular order.

Maybe the data engineer isn’t the one to convince?

If it saves time, how much time? Would tools (I’m using the term tools broadly here) you use work differently? (Such as analytics for IBM Ibm and ibm counting differently).

Is there a solution that’s the best of both worlds? If space isn’t an issue can the text be preserved somehow linked to each entry? The formatted text is used for elastic search, but the original text is preserved?

Maybe “convincing” isn’t the right approach, but learning is?