Steve,
thanks for furthering the
discussion.
Your views make sense to me.
I'd like to examine the super
computer
cabling distance distribution that Petar shared with us yesterday in a
bit more detail. I've plotted it to allow folks to see it in graphical
form.
This data has several features that
are remarkably similar to that of general data center cabling.
1) The distribution is highly skewed
towards the shorter end of the distribution.
2) The distribution has a very long
tail relative to the position of the mode, the most frequent length, at
20m.
3) The mode is at a distance that is
one fifth of the maximum length.
The white dot on the graph
represents
the coordinate of equivalent coverage relative to the 100m objective to
the data center cabling distribution. Speaking to Steve's point that
questions the correctness of the 100m objective for HPC environments, I
would venture to say that a 25m objective, which is the roughly
equivalent
in coverage to the 100m objective we are attempting to apply to data
centers,
would not be satisfactory for the HPC environment, as it would leave a
significant portion of the channels without a low-cost solution.
It is clear that the 100m objective
is a near-perfect match to the needs of HPC. Yet I do not believe
that HPC should be the primary focus of our development. We must
be developing a solution that properly satisfies a much larger market
than
this or we are wasting our time. Indeed, given that latency is a
major performance concern for HPC, the vendors of such machines may
prefer
to use InfiniBand. This could mean that one of the primary customers
to which we have tuned our present objective will actually not use
Ethernet,
but will benefit anyway by driving InfiniBand to adopt the same 100m
PMD
specs that 802.3ba defines. This possibility reinforces my perspective
that we need to properly address a broader set of customers - those
that
operate in the general data center environment. It is clear from
all of the data and surveys that remaining only with a 100m solution
misses
the mark for this broader market. Continuing under this condition
will mean that the more attractive solution for links longer than 100m
in the general data center will be to deploy link aggregated
10GBASE-SR.
Its cost will be on par and it will reach the distances the customers
need in their data centers.
Is this the future you want for all
our efforts, or do you want to face the facts and address the issue
head
on with a solution that gives data center customers what they need?
Next week these decisions will be
placed
before the Task Force. I hope we choose wisely.
Regards,
Paul Kolesar
CommScope Inc.
Enterprise Solutions
1300 East Lookout Drive
Richardson, TX 75082
Phone: 972.792.3155
Fax: 972.792.3111
eMail: pkolesar@xxxxxxxxxxxxx
All,
I think Paul's suggestion is
a
good one; I would like to add some other input (in the form of
questions)
from my point of view:
1. Do we have the
right
MMF objective (support at least 100m on OM3 fiber)?
My data suggests that we
don't;
we have tried to come at this from two different directions, trying to
be as unbiased as possible in assessing the situation. I presented
Corning
sales data in November 2006 (see http://www.ieee802.org/3/hssg/public/nov06/swanson_01_1106.pdf).
This data showed a need to support a link length longer than 100m and I
recommended that we support 200m at that time.
We also polled our
customers,
offering three options, a low cost, single PMD at 100m on OM3, a
slightly
higher cost single PMD at 150-200m on OM3, and a third option that
would
specify two PMDs consisting of both option 1 and option 2. The results
were overwhelmingly in favor of Option 2, a single PMD at longer
length.
A small number supported Option 3 (2 PMDs) but NONE supported Option 1.
While it is true that many of our customers have a substantial portion
of their link lengths that are less than 100m, they all have link
lengths
longer than 100m. One customer noted that more than half of his data
center
had link lengths longer than 100m.
Kolesar presented his
company's
sales data in September 2006 (see http://www.ieee802.org/3/hssg/public/sep06/kolesar_01_0906.pdf).
His data also suggested that longer link lengths were needed and he
recommended
150m at that time.
All the data for datacenter
seems
to suggest that 100m is TOO SHORT to cover a significant portion of the
datacenter application.
Pepeljugoski presented new
data
yesterday on HPC link lengths that show 85% being less than 20m and 98%
less than 50m. This might suggest that 100m is TOO LONG for HPC
applications.This
leads to another question of whether there is any economic or technical
advantage to a shorter MMF objective for HPC?
2. Is there consensus on
supporting
a longer reach objective for MMF?
I think there is, others on
the
call yesterday did not. I base my opinion on the straw poll conducted
in
Munich:
Straw Poll #15: Should we
continue
to work on a proposal for an annex to extend the reach of a 40GBASE-SR4
and 100GBASE-SR10 in addition to the
proposal(“pepeljugoski_01_0508.pdf”)
as in “jewell_01_0508.pdf”.
Yes: 55
No: 3
3. Could we achieve
75% support
for adding a new MMF objective?
I don't know but if we
could not,
I would be forced to vote against adopting the current MMF baseline
proposal
(which I don't want to do) and I think others may also. This may or may
not lead to an impasse similar to what we experienced in 802.3ae.
I understand the concern
that adding
the objective without a clear consensus on how to support the new
objective
could lead to delay but I have found this committee to be very
resourceful
in driving to a solution after we have made a decision to go forward.
40G
is one recent example of a situation where no consensus turned very
quickly
to consensus.
I think adding a new
objective
is the right approach and in the long run will save the task force
valuable
development time.
4. Can we agree on the
right
assumptions on the 10G model to evaluate the various proposals?
Everyone seems to be using
slightly
different variations of the model to evaluate the capability of the
proposal;
we need to agree on a common approach of analysis.
5. Can we not let the
discussion
on OM4 cloud the decision?
We can get extended link
lengths
on OM3. By achieving longer lengths on OM3, even longer lengths will be
possible on OM4 with the same specification. What I don't want people
to
think is that OM4 is required to get longer lengths.
6. Summary
John D'Ambrosia has provided
advice
that if we want to move forward with a new MMF objective, July is the
time
to do it - if we delay the decision, it is guaranteed to delay the
overall
process. Some might think if we make the decision, it will delay the
overall
process but we don't know that yet. I don't think adding an informative
specification on a PMD is the right way to go - let's get the MMF
objective(s)
right - we owe it to ourselves and to our customers. To do anything
less
is just avoiding the issue. Let's get the objectives set, get the
assumptions
correct and utilize the process set up by Petrilla and Barbieri to
drive
toward the hard decisions that we are all very capable of making.
Sincerely,
Steve Swanson
From: Paul Kolesar
[mailto:PKOLESAR@xxxxxxxxxxxx]
Sent: Thursday, July 10, 2008 7:19 PM
To: STDS-802-3-HSSG@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3BA] XR ad hoc Phone Conference Notice
Alessandro,
I'd like to continue your thread with some observations that have
driven
me to certain conclusions, and to follow that with a suggestion about
how
to parse the approach and drive to a consensus position.
First let's consider what various customers are telling us. The
Corning
survey of their customers, which has been presented to the Ethernet
Alliance,
the XR ad-hoc, and will be presented next week to 802.3ba, shows that
the
large majority of customers want a single PMD solution that can provide
150m on OM3 and 250m on OM4. A minority were willing to accept a
two PMD solution set that delivers the lowest cost PMD to serve up to
100
m and a second PMD to serve the extended distances as above. Not
a single response indicated a preference for a solution limited to
100m.
We also hear strongly expressed opinions from various system vendors
that a longer distance solution is not acceptable if it raises cost or
power consumption of the currently adopted 100m PMD. Under these
conditions, and given the options presented and debated within the XR
ad-hoc,
I believe you are justified in concluding that a single PMD cannot
satisfy
all these constraints. Yet it is clear to me that the market will
demand a low-cost PMD that can support more than 100m to fulfill the
distance
needs of data centers. Therefore I conclude that the correct
compromise
position is to develop a two-PMD solution. If the committee does
not undertake this development, it is likely that several different
proprietary
solutions will be brought to the market, with the net result of higher
overall cost structures.
So let's consider how to choose from among the various proposals for an
extended reach PMD and let the determination of how to document it
within
the standard be addressed after that.
I would propose a series of polls at next week's meeting designed to
gauge
the preferences of the Task Force. I do not think that any XR proposal
will garner >75% at the outset, so I would propose the use of
Chicago
rules wherein members may vote for all the proposals they find
acceptable.
From this we can see which of the solutions is least acceptable.
Then through a process of elimination from the bottom, and repeated
application of Chicago rules for the remainder, finally determine the
most
acceptable solution.
Depending on the degree of maturity of the specifications or other
considerations
for the chosen solution, the Task Force will be better able to
determine
how it should be handled within the standard. For example, a proposal
with a maturity on par with the adopted baseline could be put forth
under
a new objective without undue concern of becoming a drag on the
timeline,
while a proposal of lesser maturity could be placed in an annex without
an additional objective.
Regards,
Paul Kolesar
CommScope Inc.
Enterprise Solutions
1300 East Lookout Drive
Richardson, TX 75082
Phone: 972.792.3155
Fax: 972.792.3111
eMail: pkolesar@xxxxxxxxxxxxx
Matt,
here is my *personal* read of the situation in the XR ad hoc:
a) I think there could be consensus on supporting XR, as long as we
pick
a solution that does not impact the cost structure of the 100m PMD.
Because
of that I also don't feel a single PMD is realistic at this point.
a) The trouble however is that there is no consensus (>75%) on any
of
the technical proposals. No one proposal has a clear lead over the
others.
Of the three options you list below, I think adding an objective for a
ribbon XR PMD could have a major impact on the project schedule,
because
it seems we are nowhere near technical consensus. We could drag the
discussion
for several TF meetings...I am not sure delaying the project over this
specific topic is worth it.
We can always resort to non-standard solutions to fulfill market
requirements
we can't address within IEEE, or come back in the future with another
CFI.
At the end of the conference call earlier today I requested that we get
together after hours next week to see if we can accelerate consensus
building.
All the data is on the table now, so if we don't show any material
progress,
I am not sure we should extend this ad hoc.
Alessandro
From: Matt Traverso
[mailto:matt.traverso@xxxxxxxxx]
Sent: Thursday, July 10, 2008 10:07 AM
To: STDS-802-3-HSSG@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3BA] XR ad hoc Phone Conference Notice
Colleagues,
I feel that we are coming to a situation similar to the impasse at 40G
vs. 100G where different participants call different segments of the
networking
industry their customer.
For MMF, I'd like to see an optimized solution at 100m per all of the
work
that has been done.
I'd like to understand if folks feel that a different status for the
extended
reach
a) Informative
b) Normative
c) New objective
would significantly alter the technically proposed solution from the Ad
Hoc. Opinions?
Chris,
The case of slow market/industry transition from LX4 to LRM is one of
the
reasons why I would like to see the industry adopt 40G serial from the
launch. The slow adoption of LRM has primarily been limited by end
customer knowledge of the solution. 40G serial technology is available.
thanks
--matt
Hi Gourgen,
Some numbers might
help clarify
what close to 0 means.
For 2008,
Lightcounting gives
a shipment number of approximately 30,000 for 10GE-LRM (and for
10GE-LX4
it's about 60,000.) So close to 0 would apply if we were rounding to
the
nearest 100K. As an aside, 10GE-LRM supports 220m of MMF, not 300m.
300m of OM3 is
supported by
10GE-SR, which Lightcounting gives as approximately 400,000 in 2008, so
that would be close to 0 if we rounding to the nearest 1M.
Another interesting
sideline
in looking at these numbers is that 2 years after the 10GE-LRM standard
was adopted in 2006, despite the huge investment being made in 10GE-LRM
development, and despite very little new investment being made in
10GE-LX4,
the 10GE CWDM equivalent (i.e. 10GE-LX4, 4x3G) is chugging along at 2x
the volume of the 10GE Serial solution that was adopted to replace it.
This should put some
dim on
hopes that very low cost 40GE Serial technology can be developed from
scratch
in two years and ship in volume when the 40GE standard is adopted in
2010.
Chris
From: Gourgen Oganessyan
[mailto:gourgen@xxxxxxxxxxx]
Sent: Wednesday, July 09, 2008 8:02 PM
To: STDS-802-3-HSSG@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3BA] XR ad hoc Phone Conference Notice
Petar,
Well, sadly that's
what has
been happening in the 10G world, people are forced to amortize the cost
of 300m reach (LRM), while in reality the number of people who need
300m
is close to 0.
That's why I am
strongly in
support of your approach of keeping the 100m objective as primary goal.
Frank, OM4 can add as
much cost
as it wants to, the beauty is the added cost goes directly where it's
needed,
which is the longer links. Alternatives force higher cost/higher power
consumption on all ports regardless of whether it's needed there or
not.
Gourgen Oganessyan
Quellan
Inc.
Phone: (630)-802-0574
(cell)
Fax: (630)-364-5724
e-mail: gourgen@xxxxxxxxxxx
From: Petar Pepeljugoski
[mailto:petarp@xxxxxxxxxx]
Sent: Wednesday, July 09, 2008 7:51 PM
To: STDS-802-3-HSSG@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3BA] XR ad hoc Phone Conference Notice
Frank,
If I interpret correctly, you are saying that all users should amortize
the cost of very few who need extended reach.
We need to be careful how we proceed here - we should not repeat the
mistakes
of the past if we want successful standard.
Regards,
Peter
Petar Pepeljugoski
IBM Research
P.O.Box 218 (mail)
1101 Kitchawan Road, Rte. 134 (shipping)
Yorktown Heights, NY 10598
e-mail: petarp@xxxxxxxxxx
phone: (914)-945-3761
fax: (914)-945-4134
Hi Jeff;
Thanks for your comment. You missed one critical point that there is
cost
increase from OM3 to OM4. If you take ribbon cable cost in perspective,
OM4 option is possibly the largest of the 4 options.
Besides, the use of OM4 requires to tighten TX specs which impact TX
yield,
so you are actually compromising the primary goal.
Frank
From: Jeff Maki [mailto:jmaki@xxxxxxxxxxx]
Sent: Wednesday, July 09, 2008 7:02 PM
To: STDS-802-3-HSSG@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3BA] XR ad hoc Phone Conference Notice
Dear MMF XR Ad Hoc Committee Members,
I believe our current objective of "at least 100 meters on OM3 MMF"
should remain as a primary goal, the baseline. Support for any form
of extended reach should be considered only if it does not compromise
this
primary goal. A single PMD for all reach objectives is indeed a good
starting premise; however, it should not be paramount. In the
following
lists are factors, enhancements, or approaches I would like to put
forward
as acceptable and not acceptable for obtaining extended reach.
Not Acceptable:
1. Cost increase for the baseline PMD (optic) in order to obtain
greater
than 100-meter reach
2. EDC on the system/host board in any case
3. CDR on the system/host board as part of the baseline solution
4. EDC in the baseline PMD (optic)
5. CDR in the baseline PMD (optic)
Acceptable:
1. Use of OM4 fiber
2. Process maturity that yields longer reach with no cost increase
In summary, we should not burden the baseline solution with cost
increases
to meet the needs of an extended-reach solution.
Sincerely,
Jeffery Maki
————————————————
Jeffery J. Maki, Ph.D.
Principal Optical Engineer
Juniper Networks, Inc.
1194 North Mathilda Avenue
Sunnyvale, CA 94089-1206
Voice +1-408-936-8575
FAX +1-408-936-3025
www.juniper.net
jmaki@xxxxxxxxxxx
————————————————