Statistics
| Branch: | Tag: | Revision:

root / volk / include @ 44b525e7

Name Size
  volk

Latest revisions

# Date Author Comment
5585c712 06/15/2012 12:43 pm Tom Rondeau

filter: adding ssc and fsf versions of filter with associated new Volk kernels.

These routines work and pass QA. They could use some performance work. the FSF is just slightly slower than before; the SCC version is more noticably slower.

Both could benefit, probably, by using SSE2 intrinsics to handle the shorts.

1a5a1d4e 06/14/2012 08:13 pm Tom Rondeau

filter: added a ccf Volk dot product to use with ccf filters and used it in fir_filter_ccf.

Produces improved results to previous version.

c24cabd4 06/14/2012 04:25 pm Tom Rondeau

volk: fixes for 32f dot_prod

Accepts num_points like everything else and handles splitting up numbers itself, not expected to be done externally.

Adds AVX version, both aligned and unaligned.

7f9f0fc9 06/13/2012 09:49 pm Tom Rondeau

volk: dot_produce for floats does 16 at a time.

This was done to make this have the same performance as float_dotprod from before. This makes all flavors of the 32f dotprod work the same way.

Because it's expecting the input to have 4x more samples than specified, it's making qa for these fail.

9e2e896e 06/13/2012 06:53 pm Tom Rondeau

filter: process 4 vectors each time in volk dot_prod to speed up fir filters.

This makes the volk version of the SSE FIR filter the same speed as using the hand-crafted float_dotprod from before.

f919f9dc 04/13/2012 10:36 pm Tom Rondeau

Removed whitespace and added dtools/bin/remove-whitespace as a tool to do this in the future.

The sed script was provided by Moritz Fischer.

a2149950 03/22/2012 10:07 pm Johnathan Corgan

Merge branch 'master' into next

d8832807 03/21/2012 11:59 pm Josh Blum

volk: add include for config.h to cc file

volk_32f_s32f_convert_16i_a was also a problem,
previously volk_32f_s32f_convert_16i_u was fixed to have config.h.
I heard putting config h stuff into public headers was bad practice.
Including config.h into the generated cc file should fix this issue from now on.

ac9cd0eb 03/15/2012 10:03 pm Tom Rondeau

Merge branch 'master' into next

43224ac7 03/15/2012 09:25 pm Tom Rondeau

volk: turning off sse implementation of complex dot product for 32-bit machines until it's fixed.

View revisions

Also available in: Atom